[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] Xen crash after S3 suspend - Xen 4.13



On Wed, Mar 18, 2020 at 02:50:52PM +0000, Andrew Cooper wrote:
> On 18/03/2020 14:16, Marek Marczykowski-Górecki wrote:
> > Hi,
> >
> > In my test setup (inside KVM with nested virt enabled), I rather
> > frequently get Xen crash on resume from S3. Full message below.
> >
> > This is Xen 4.13.0, with some patches, including "sched: fix resuming
> > from S3 with smt=0".
> >
> > Contrary to the previous issue, this one does not happen always - I
> > would say in about 40% cases on this setup, but very rarely on physical
> > setup.
> >
> > This is _without_ core scheduling enabled, and also with smt=off.
> >
> > Do you think it would be any different on xen-unstable? I cat try, but
> > it isn't trivial in this setup, so I'd ask first.
> >
> >
> > (XEN) Xen BUG at sched_credit2.c:3474
> > (XEN) ----[ Xen-4.13.0  x86_64  debug=n   Not tainted ]----
> 
> Any chance you can reproduce with a debug build?
> 
> There is a good chance that an assertion might fire earlier than this BUG().

Got it:

(XEN) Assertion 'c2rqd(ops, sched_unit_master(unit)) == svc->rqd' failed at 
sched_credit2.c:2137
(XEN) ----[ Xen-4.13.0  x86_64  debug=y   Not tainted ]----
(XEN) CPU:    0
(XEN) RIP:    e008:[<ffff82d08022bee9>] 
sched_credit2.c#csched2_unit_wake+0x174/0x176
(XEN) RFLAGS: 0000000000010097   CONTEXT: hypervisor (d0v1)
(XEN) rax: ffff8301ba8fb198   rbx: ffff83008ea2ae10   rcx: 000000000000001b
(XEN) rdx: ffff8301ba8fb0c0   rsi: ffff8301ba8fb040   rdi: 0000000000000001
(XEN) rbp: ffff8300be84fd30   rsp: ffff8300be84fd10   r8:  deadbeefdeadf00d
(XEN) r9:  deadbeefdeadf00d   r10: 0000000000000000   r11: 0000000000000000
(XEN) r12: ffff83008ea2af30   r13: ffff82d080430720   r14: 0000000000000001
(XEN) r15: 0000004177a8cc12   cr0: 0000000080050033   cr4: 00000000000006e0
(XEN) cr3: 000000010e658000   cr2: 0000599b4e014010
(XEN) fsb: 00007d6c53bfd700   gsb: ffff88813ff00000   gss: 0000000000000000
(XEN) ds: 0000   es: 0000   fs: 0000   gs: 0000   ss: e010   cs: e008
(XEN) Xen code around <ffff82d08022bee9> 
(sched_credit2.c#csched2_unit_wake+0x174/0x176):
(XEN)  ef e8 e4 c3 ff ff eb a7 <0f> 0b 55 48 89 e5 41 57 41 56 41 55 41 54 53 48
(XEN) Xen stack trace from rsp=ffff8300be84fd10:
(XEN)    ffff83008dc3b000 ffff83008ea2ae10 ffff83008ea2ae10 ffff8301ba8fb198
(XEN)    ffff8300be84fd78 ffff82d0802346c6 0000000000000202 ffff8301ba8fb198
(XEN)    ffff83008dc3b000 ffff83011e6e4000 00007d6c503d2010 0000000000000000
(XEN)    0000000000000001 ffff8300be84fd90 ffff82d0802082f1 ffff83011e6e4000
(XEN)    ffff8300be84fda8 ffff82d08020843e 0000000000000000 ffff8300be84fec0
(XEN)    ffff82d080205ea5 0000000000000206 ffff830100e16060 0000000000007f88
(XEN)    ffff8301ba88b000 ffff8300be84fdf8 ffff82d08023bae1 ffff8301ba88b068
(XEN)    ffff82d080557300 ffff8300be84fe50 0000000000000206 0000001200000004
(XEN)    00007d6c2f6b0005 00007d6c3c00a250 0000000000000005 0000000000000001
(XEN)    00007d6c2f6b0189 000000030000007c 00007d6c10000e20 ffff010a00000005
(XEN)    000000000002cf56 000000000002e846 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000001 0000000554eea2c6 0000000000000001
(XEN)    7ba0548d00000000 ffff8300be84fef8 ffff8301ba88b000 0000000000000001
(XEN)    0000000000000000 0000000000000000 ffff8300be84fee8 ffff82d08034d922
(XEN)    ffff8301ba88b000 0000000000000000 0000000000000000 00007cff417b00e7
(XEN)    ffff82d080354432 ffff88813a157100 00007d6c53bfc590 00007d6c53bfc590
(XEN)    ffff8881358ad3e8 ffff88813a157100 ffff88813a157101 0000000000000282
(XEN)    0000000000000000 ffffc90001953e08 0000000000000000 0000000000000024
(XEN)    ffffffff8100148a 0000000000000000 0000000000000000 00007d6c503d2010
(XEN)    0000010000000000 ffffffff8100148a 000000000000e033 0000000000000282
(XEN) Xen call trace:
(XEN)    [<ffff82d08022bee9>] R sched_credit2.c#csched2_unit_wake+0x174/0x176
(XEN)    [<ffff82d0802346c6>] F vcpu_wake+0xdd/0x3ff
(XEN)    [<ffff82d0802082f1>] F domain_unpause+0x2f/0x3b
(XEN)    [<ffff82d08020843e>] F domain_unpause_by_systemcontroller+0x40/0x60
(XEN)    [<ffff82d080205ea5>] F do_domctl+0x9e4/0x1952
(XEN)    [<ffff82d08034d922>] F pv_hypercall+0x548/0x560
(XEN)    [<ffff82d080354432>] F lstar_enter+0x112/0x120
(XEN) 
(XEN) 
(XEN) ****************************************
(XEN) Panic on CPU 0:
(XEN) Assertion 'c2rqd(ops, sched_unit_master(unit)) == svc->rqd' failed at 
sched_credit2.c:2137
(XEN) ****************************************
(XEN) 
(XEN) Reboot in five seconds...


-- 
Best Regards,
Marek Marczykowski-Górecki
Invisible Things Lab
A: Because it messes up the order in which people normally read text.
Q: Why is top-posting such a bad thing?

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.