[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 02/15] xen/arm/gic: Enable interrupt assignment to running VM

To: Henry Wang <xin.wang2@xxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
From: Julien Grall <julien@xxxxxxx>
Date: Tue, 7 May 2024 22:54:08 +0100
Cc: Stefano Stabellini <sstabellini@xxxxxxxxxx>, Bertrand Marquis <bertrand.marquis@xxxxxxx>, Michal Orzel <michal.orzel@xxxxxxx>, Volodymyr Babchuk <Volodymyr_Babchuk@xxxxxxxx>, Stefano Stabellini <stefano.stabellini@xxxxxxxxxx>
Delivery-date: Tue, 07 May 2024 21:54:28 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

Hi Henry,

On 06/05/2024 09:32, Henry Wang wrote:

On 5/1/2024 4:13 AM, Julien Grall wrote:
Hi Henry,

On 30/04/2024 04:50, Henry Wang wrote:
On 4/25/2024 10:28 PM, Julien Grall wrote:
Thanks for your feeedback. After checking the b8577547236f commitmessage I think I now understand your point. Do you have anysuggestion about how can I properly add the support to route/removethe IRQ to running domains? Thanks.
I spent some time going through the GIC/vGIC code and had somediscussions with Stefano and Stewart during the last couple of days,let me see if I can describe the use case properly now to continuethe discussion:
We have some use cases that requires assigning devices to domainsafter domain boot time. For example, suppose there is an FPGA on theboard which can simulate a device, and the bitstream for the FPGA isprovided and programmed after domain boot. So we need a way to assignthe device to the running domain. This series tries to implement thisuse case by using device tree overlay - users can firstly add theoverlay to Xen dtb, assign the device in the overlay to a domain bythe xl command, then apply the overlay to Linux.
Thanks for the description! This helps to understand your goal :).
Thank you very much for spending your time on discussing this andprovide these valuable comments!
I haven't really look at that code in quite a while. I think we needto make sure that the virtual and physical IRQ state matches at thetime we do the routing.
I am undecided on whether we want to simply prevent the action tohappen or try to reset the state.
There is also the question of what to do if the guest is enablingthe vIRQ before it is routed.
Sorry for bothering, would you mind elaborating a bit more about thetwo cases that you mentioned above? Commit b8577547236f ("xen/arm:Restrict when a physical IRQ can be routed/removed from/to a domain")only said there will be undesirable effects, so I am not sure if Iunderstand the concerns raised above and the consequences of thesetwo use cases.
I will try to explain them below after I answer the rest.
I am probably wrong, I think when we add the overlay, we are probablyfine as the interrupt is not being used before.
What if the DT overlay is unloaded and then reloaded? Wouldn't thesame interrupt be re-used? As a more generic case, this could also bea new bitstream for the FPGA.
But even if the interrupt is brand new every time for the DT overlay,you are effectively relaxing the check for every user (such asXEN_DOMCTL_bind_pt_irq). So the interrupt re-use case needs to betaken into account.
I agree. I think IIUC, with your explanation here and below, could wesimplify the problem to how to properly handle the removal of the IRQfrom a running guest, if we always properly remove and clean up theinformation when remove the IRQ from the guest? In this way, the IRQ canalways be viewed as a brand new one when we add it back.


If we can make sure the virtual IRQ and physical IRQ is cleaned then yes.

Then the onlycorner case that we need to take care of would be...

Can you clarify whether you say the "only corner case" because youlooked at the code? Or is it just because I mentioned only one?

Also since we only load the device driver after the IRQ is routed tothe guest,
This is what a well-behave guest will do. However, we need to thinkwhat will happen if a guest misbehaves. I am not concerned about aguest only impacting itself, I am more concerned about the case wherethe rest of the system is impacted.
I am not sure the guest can enable the vIRQ before it is routed.
Xen allows the guest to enable a vIRQ even if there is no pIRQassigned. Thanksfully, it looks like the vgic_connect_hw_irq(), inboth the current and new vGIC, will return an error if we are tryingto route a pIRQ to an already enabled vIRQ.
But we need to investigate all the possible scenarios to make surethat any inconsistencies between the physical state and virtual state(including the LRs) will not result to bigger problem.
The one that comes to my mind is: The physical interrupt isde-assigned from the guest before it was EOIed. In this case, theinterrupt will still be in the LR with the HW bit set. This wouldallow the guest to EOI the interrupt even if it is routed to someoneelse. It is unclear what would be the impact on the other guest.
...same as this case, i.e.
test_bit(_IRQ_INPROGRESS, &desc->status) || !test_bit(_IRQ_DISABLED,&desc->status)) when we try to remove the IRQ from a running domain.

We already call ->shutdown() which will disable the IRQ. So don't weonly need to take care of _IRQ_INPROGRESS?


[...]

we have 3 possible states which can be read from LR for this case :active, pending, pending and active.- I don't think we can do anything about the active state, so we shouldreturn -EBUSY and reject the whole operation of removing the IRQ fromrunning guest, and user can always retry this operation.

This would mean a malicious/buggy guest would be able to prevent adevice to be de-assigned. This is not a good idea in particular when thedomain is dying.

That said, I think you can handle this case. The LR has a bit toindicate whether the pIRQ needs to be EOIed. You can clear it and thiswould prevent the guest to touch the pIRQ. There might be other clean-upto do in the vGIC datastructure.

Anyway, we don't have to handle removing an active IRQ when the domainis still running (although we do when the domain is destroying). But Ithink this would need to be solved before the feature is (security)supported.

- For the pending (and active) case,

Shouldn't the pending and active case handled the same way as the activecase?

can we clear the LR and point theLR for the pending_irq to invalid?

LRs can be cleared. You will need to find which vCPU was used for theinjection and then pause it so the LR can be safely updated.

There will also be some private state to clear. I don't know how easy itwill be. However, we decided to not do anything for ICPENDR (whichrequires a similar behavior) as this was complex (?) to do with theexisting vGIC.

I vaguely remember we had some discussions on the ML. I didn't look forthem though.

Anyway, same as above, this could possibly handled later on. But thiswould probably need to be solved before the feature is (security supported).


Cheers,

--
Julien Grall

Follow-Ups:
- Re: [PATCH 02/15] xen/arm/gic: Enable interrupt assignment to running VM
  - From: Henry Wang

References:
- Re: [PATCH 02/15] xen/arm/gic: Enable interrupt assignment to running VM
  - From: Henry Wang

Prev by Date: [xen-unstable-smoke test] 185939: tolerable all pass - PUSHED
Next by Date: Re: [PATCH v6 8/8] xen: allow up to 16383 cpus
Previous by thread: Re: [PATCH 02/15] xen/arm/gic: Enable interrupt assignment to running VM
Next by thread: Re: [PATCH 02/15] xen/arm/gic: Enable interrupt assignment to running VM
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.