[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v3] arinc653: don't assume Dom0 is the control domain


  • To: Jan Beulich <jbeulich@xxxxxxxx>
  • From: Stewart Hildebrand <stewart.hildebrand@xxxxxxx>
  • Date: Tue, 7 Apr 2026 07:27:16 -0400
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 165.204.84.17) smtp.rcpttodomain=suse.com smtp.mailfrom=amd.com; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=amd.com; dkim=none (message not signed); arc=none (0)
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=YfsFG8/SjaYpBSB12NCcvkBPt3OK1vfCpzahKSH4t90=; b=YCn6zgmAZ1AWcGDYSBkiLrugU4DLUyEB2j5LPwg0NUW9X2e/+nccEIzy2ognbVMq9D636P45HazClVtHlnjCX9uXQpHDPWKWmU9WkmTtwr0Ura1uqgRdiZkbVJhWvNQViSup8qj42WsZXMaIdgVDoZ9bY6wZsOrIIoKuNU+IurEnYhRGQSie7QAAp6zg/lJdJ4jetOViMCfKI08YmZuMpjTqU+KCtUR9YrS34+GQJ8GaaLLoCjJe9Vqr0wi9ZpFHjy14bXtcLAuHZ33advgTURWAbAnqrNNT6PGZvtr9WuK3rQl+nOHmcCvCXTsvtk4Wj5Aqe3QO33mL+HSij6z1vw==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=dekDUFsnxRjJ9pN4Mu+LW4fMGaU2qHixsrR+4HXVuwEC2Js8LffbmVhjDRze9h6kgxVP+AZPY/1VDdKWbsgM6AUuUkaObD4ZVVuHSC36B01N5Ax3eYqy4obYR5Zg/QoPq3CSc9UZTqyuEJKHuplL3dYErp2p+PPJZ2hqvSHlNIgYsblxnLOByOjrRc99X2qvfR9q1LKosvHTK6k1r3D7Vne5SRShfrL6qNtNDESSTp3o8qaxPm0K1YuTVcMc6AeVt6YzyScPaWC6z8VfeHGqABekOjvbjH1N46R3yWPYvMJZrzxS8uxaOd7UWZW+gcsBgsNprNtWYe9cafVKBkU5lg==
  • Authentication-results: eu.smtp.expurgate.cloud; dkim=pass header.s=selector1 header.d=amd.com header.i="@amd.com" header.h="From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck"
  • Cc: Nathan Studer <nathan.studer@xxxxxxxxxxxxxxx>, Stewart Hildebrand <stewart@xxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Jürgen Groß <jgross@xxxxxxxx>
  • Delivery-date: Tue, 07 Apr 2026 11:27:42 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 4/7/26 02:18, Jan Beulich wrote:
> On 03.04.2026 10:47, Stewart Hildebrand wrote:
>> On 4/1/26 09:03, Jan Beulich wrote:
>>> On 01.04.2026 14:57, Jürgen Groß wrote:
>>>> On 01.04.26 14:29, Jan Beulich wrote:
>>>>> Leaving aside highly disaggregated environments, the control domain is
>>>>> what will invoke XEN_SYSCTL_SCHEDOP_putinfo. Its vCPU-s therefore need to
>>>>> be able to run unconditionally, not those of the domain with ID 0 (which
>>>>> may not exist at all).
>>>>>
>>>>> Fixes: 9f0c658baedc ("arinc: add cpu-pool support to scheduler")
>>>>> Signed-off-by: Jan Beulich <jbeulich@xxxxxxxx>
>>>>> ---
>>>>> There being no "else" to the if(), what about other control domain vCPU-s?
>>>>
>>>> I guess this is a stale leftover. Doesn't matter for committing anyway.
>>>>
>>>>> ---
>>>>> v3: Don't mistakenly include the idle domain.
>>>>> v2: New.
>>>>>
>>>>> --- a/xen/common/sched/arinc653.c
>>>>> +++ b/xen/common/sched/arinc653.c
>>>>> @@ -411,10 +411,10 @@ a653sched_alloc_udata(const struct sched
>>>>>       spin_lock_irqsave(&sched_priv->lock, flags);
>>>>>   
>>>>>       /*
>>>>> -     * Add every one of dom0's units to the schedule, as long as there 
>>>>> are
>>>>> -     * slots available.
>>>>> +     * Add every one of the control domain's units to the schedule, as 
>>>>> long as
>>>>> +     * there are slots available.
>>>>>        */
>>>>> -    if ( unit->domain->domain_id == 0 )
>>>>> +    if ( is_control_domain(unit->domain) && 
>>>>> !is_idle_domain(unit->domain) )
>>>>>       {
>>>>>           entry = sched_priv->num_schedule_entries;
>>>>>   
>>>>
>>>> Hmm, is it really the control domain only which wants to be scheduled 
>>>> initially?
>>>> I would think that at least the hardware domain and probably a Xenstore 
>>>> domain
>>>> would want to be included, too.
>>>>
>>>> In the end it might even be that other domains created via dom0less would 
>>>> want
>>>> to be able to run initially. They could be part of a mandatory 
>>>> infrastructure.
>>>> Why would they need to be created at boot if they are NOT important?
>>>
>>> This part is easy to answer: Because in a dom0less setup you simply may have
>>> no toolstack at all. (At which point there may also be nothing to set a
>>> schedule, yes.)
>>
>> This is a known limitation. In a dom0less/hyperlaunch scenario, as future 
>> work,
>> I would like to see the ability to configure the ARINC653 schedule in device
>> tree, which would likely extend the existing boot time cpu pool work.
>>
>>>> The question is whether the arinc653 scheduler is really meant for such 
>>>> setups.
>>>> OTOH just modifying the test to:
>>>>
>>>>      if ( system_state < SYS_STATE_active &&
>>>>           unit->domain->domain_id < DOMID_FIRST_RESERVED )
>>>>
>>>> seems to be fine for catching all those cases.
>>>>
>>>> With or without this modification:
>>>>
>>>> Reviewed-by: Juergen Gross <jgross@xxxxxxxx>
>>>
>>> Thanks, yet I'll have to leave to the maintainers to decide which form it
>>> should ultimately take. One remark: A restartable control domain wouldn't
>>> pass that conditional. Granted that's looking far into the future.
>>
>> It may not be desirable to schedule domUs until the control domain has had a 
>> say
>> in the matter, considering that the default schedule is unlikely to contain 
>> the
>> desired minor frame runtimes. It's less clear whether to include hardware and
>> xenstore domains in the default schedule, though I'm leaning toward only
>> including the domain with ability to invoke XEN_SYSCTL_SCHEDOP_putinfo for 
>> now
>> (i.e. the control domain).
>>
>> Hm, the suggested 'system_state < SYS_STATE_active' check is possibly a good
>> addition. This reinforces that the default schedule's purpose is merely to 
>> get a
>> system booting until a user-provided schedule can be installed. Without this
>> check, restarting the control domain could result in new entries being added
>> while old entries remain, possibly ending up with duplicates and/or 
>> exhausting
>> the schedule. However, the restarted domain would need to retain its uuid if 
>> it
>> expects to be scheduled after restart.
>>
>> Lastly, we may consider restricting the default schedule to Pool-0, and
>> eventually we may want a mechanism to disable the default schedule altogether
>> (e.g. when boot time cpupools are in use), but I don't think it's necessary 
>> to
>> conflate those with the current patch.
> 
> So what does all of this mean for the patch here? Should I switch to Jürgen's
> suggestion? Should I merely add the system_state check, but otherwise keep as
> is? Or should I not change anything?

I think the patch is good as is, but I just want to give it a few more days to
give Nate or others a chance to chime in. I'll follow up next week if there's no
further discussion.



 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.