[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] RE: Memory ordering question in the shutdown deferral code
> -----Original Message----- > From: Julien Grall <julien@xxxxxxx> > Sent: 21 September 2020 12:41 > To: Jan Beulich <jbeulich@xxxxxxxx>; Stefano Stabellini > <sstabellini@xxxxxxxxxx>; > andrew.cooper3@xxxxxxxxxx; George Dunlap <george.dunlap@xxxxxxxxxx>; Durrant, > Paul > <pdurrant@xxxxxxxxxxxx> > Cc: Xia, Hongyan <hongyxia@xxxxxxxxxx>; xen-devel@xxxxxxxxxxxxxxxxxxxx > Subject: RE: [EXTERNAL] Memory ordering question in the shutdown deferral code > > CAUTION: This email originated from outside of the organization. Do not click > links or open > attachments unless you can confirm the sender and know the content is safe. > > > > (+ Xen-devel) > > Sorry I forgot to CC xen-devel. > > On 21/09/2020 12:38, Julien Grall wrote: > > Hi all, > > > > I have started to look at the deferral code (see > > vcpu_start_shutdown_deferral()) because we need it for LiveUpdate and > > Arm will soon use it. > > > > The current implementation is using an smp_mb() to ensure ordering > > between a write then a read. The code looks roughly (I have slightly > > adapted it to make my question more obvious): > > > > domain_shutdown() > > d->is_shutting_down = 1; > > smp_mb(); > > if ( !vcpu0->defer_shutdown ) > > { > > vcpu_pause_nosync(v); > > v->paused_for_shutdown = 1; > > } > > > > vcpu_start_shutdown_deferral() > > vcpu0->defer_shutdown = 1; > > smp_mb(); > > if ( unlikely(d->is_shutting_down) ) > > vcpu_check_shutdown(v); > > > > return vcpu0->defer_shutdown; > > > > smp_mb() should only guarantee ordering (this may be stronger on some > > arch), so I think there is a race between the two functions. > > > > It would be possible to pause the vCPU in domain_shutdown() because > > vcpu0->defer_shutdown wasn't yet seen. > > > > Equally, vcpu_start_shutdown_deferral() may not see d->is_shutting_down > > and therefore Xen may continue to send the I/O. Yet the vCPU will be > > paused so the I/O will never complete. > > The barrier enforces global order, right? So, if domain_shutdown() pauses the vcpu then is_shutting_down must necessarily be visible all cpus. Thus vcpu_start_shutdown referral will execute vcpu_check_shutdown(), so I'm having a hard time seeing the race. > > I am not fully familiar with the IOREQ code, but it sounds to me this is > > not the behavior that was intended. Can someone more familiar with the > > code confirm it? > > No indeed. I think emulation should complete before the vcpu pauses. Paul > > Cheers, > > > > -- > Julien Grall
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |