[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH 00/60] xen: add core scheduling support
Hi Juergen, I did some testing of this series (with sched-gran=core) and posting a couple of crash backtraces here for your information. Additionally, resuming a Debian 7 guest after suspend is broken. I will be able to provide any additional information only after XenSummit :) 1) This crash is quite likely to happen: [2019-07-04 18:22:46 UTC] (XEN) [ 3425.220660] Watchdog timer detects that CPU2 is stuck! [2019-07-04 18:22:46 UTC] (XEN) [ 3425.226293] ----[ Xen-4.13.0-8.0.6-d x86_64 debug=y Not tainted ]---- [2019-07-04 18:22:46 UTC] (XEN) [ 3425.233576] CPU: 2 [2019-07-04 18:22:46 UTC] (XEN) [ 3425.236348] RIP: e008:[<ffff82d08023d578>] vcpu_sleep_sync+0x50/0x71 [2019-07-04 18:22:46 UTC] (XEN) [ 3425.243458] RFLAGS: 0000000000000202 CONTEXT: hypervisor (d34v0) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.250129] rax: 0000000000000001 rbx: ffff8305f29e6000 rcx: ffff8305f29e6128 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.258101] rdx: 0000000000000000 rsi: 0000000000000296 rdi: ffff8308066f9128 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.266076] rbp: ffff8308066f7cb8 rsp: ffff8308066f7ca8 r8: 00000000deadf00d [2019-07-04 18:22:47 UTC] (XEN) [ 3425.274052] r9: 00000000deadf00d r10: 0000000000000000 r11: 0000000000000000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.282026] r12: 0000000000000000 r13: ffff8305f29e6000 r14: 0000000000000000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.289994] r15: 0000000000000003 cr0: 000000008005003b cr4: 00000000001526e0 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.297970] cr3: 00000005f2de3000 cr2: 00000000c012ae78 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.303864] fsb: 0000000004724000 gsb: 00000000c52c4a20 gss: 0000000000000000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.311836] ds: 007b es: 007b fs: 00d8 gs: 00e0 ss: 0000 cs: e008 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.319290] Xen code around <ffff82d08023d578> (vcpu_sleep_sync+0x50/0x71): [2019-07-04 18:22:47 UTC] (XEN) [ 3425.326744] ec 01 00 00 09 d0 48 98 <48> 0b 83 20 01 00 00 74 09 80 bb 07 01 00 00 00 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.335152] Xen stack trace from rsp=ffff8308066f7ca8: [2019-07-04 18:22:47 UTC] (XEN) [ 3425.340783] ffff82d0802aede4 ffff8305f29e6000 ffff8308066f7cc8 ffff82d080208370 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.348844] ffff8308066f7ce8 ffff82d08023e25d 0000000000000001 ffff8305f33f0000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.356904] ffff8308066f7d58 ffff82d080209682 0000031c63c966ad 00000000ed601000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.364963] 0000000092920063 0000000000000009 ffff8305f33f0000 0000000000000001 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.373024] 0000000000000292 ffff82d080242ee2 0000000000000001 ffff8305f29e6000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.381084] 0000000000000000 ffff8305f33f0000 ffff8308066f7e28 ffff82d08024f970 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.389144] ffff8305f33f00d4 000000000000000c ffff8305f33f0000 00000000deadf00d [2019-07-04 18:22:47 UTC] (XEN) [ 3425.397207] ffff8308066f7da8 ffff82d0802b3754 ffff82d080209d46 ffff82d08020b6e7 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.405262] ffff8308066f7e28 ffff82d08020c658 00000002ec86be74 0000000000000002 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.413325] ffff8305c33d8300 ffff8305f33f00d4 aaaaaaaaaaaaaaaa 0000000c00000008 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.421383] 0000000000000009 ffff83081cca1000 ffff82d08038835a ffff8308066f7ef8 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.429445] ffff8306a2b11000 00000000deadf00d 0000000000000180 0000000000000003 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.437503] ffff8308066f7ec8 ffff82d080383964 ffff82d08038835a ffff82d000000007 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.445565] ffff82d000000001 ffff82d000000000 ffff82d0deadf00d ffff82d0deadf00d [2019-07-04 18:22:47 UTC] (XEN) [ 3425.453624] ffff82d08038835a ffff82d08038834e ffff82d08038835a ffff82d08038834e [2019-07-04 18:22:47 UTC] (XEN) [ 3425.461683] ffff82d08038835a ffff82d08038834e ffff82d08038835a ffff8308066f7ef8 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.469744] 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.477803] ffff8308066f7ee8 ffff82d080385644 ffff82d08038835a ffff8306a2b11000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.485865] 00007cf7f99080e7 ffff82d08038839b 0000000000000000 0000000000000000 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.493923] 0000000000000000 0000000000000000 0000000000000001 0000000000000007 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.501989] Xen call trace: [2019-07-04 18:22:47 UTC] (XEN) [ 3425.505278] [<ffff82d08023d578>] vcpu_sleep_sync+0x50/0x71 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.511518] [<ffff82d080208370>] vcpu_pause+0x21/0x23 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.517326] [<ffff82d08023e25d>] vcpu_set_periodic_timer+0x27/0x73 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.524258] [<ffff82d080209682>] do_vcpu_op+0x2c9/0x668 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.530238] [<ffff82d08024f970>] compat_vcpu_op+0x250/0x390 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.536566] [<ffff82d080383964>] pv_hypercall+0x364/0x564 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.542719] [<ffff82d080385644>] do_entry_int82+0x26/0x2d [2019-07-04 18:22:47 UTC] (XEN) [ 3425.548876] [<ffff82d08038839b>] entry_int82+0xbb/0xc0 [2019-07-04 18:22:47 UTC] (XEN) [ 3425.554764] [2019-07-04 18:22:47 UTC] (XEN) [ 3425.556760] CPU1 @ beef:fffff88000a5f495 (0000000000000000) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.562825] CPU0 @ e008:ffff82d080253c51 (ns16550.c#ns16550_interrupt+0/0x79) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.570452] CPU7 @ e033:ffffffff810fb49b (0000000000000000) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.576518] CPU6 @ e008:ffff82d080279bc1 (domain.c#default_idle+0xc3/0xda) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.583887] CPU3 @ 0061:c04b668a (0000000000000000) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.589259] CPU4 @ 0061:c053468e (0000000000000000) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.594636] CPU5 @ e008:ffff82d080279bc1 (domain.c#default_idle+0xc3/0xda) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.602537] [2019-07-04 18:22:47 UTC] (XEN) [ 3425.604532] **************************************** [2019-07-04 18:22:47 UTC] (XEN) [ 3425.609991] Panic on CPU 2: [2019-07-04 18:22:47 UTC] (XEN) [ 3425.613283] FATAL TRAP: vector = 2 (nmi) [2019-07-04 18:22:47 UTC] (XEN) [ 3425.617706] [error_code=0000] [2019-07-04 18:22:47 UTC] (XEN) [ 3425.621257] **************************************** [2019-07-04 18:22:47 UTC] (XEN) [ 3425.626715] [2019-07-04 18:22:47 UTC] (XEN) [ 3425.628711] Reboot in five seconds... 2) This one has been seen only once so far: [2019-07-05 00:37:16 UTC] (XEN) [24907.482686] Watchdog timer detects that CPU30 is stuck! [2019-07-05 00:37:16 UTC] (XEN) [24907.514180] ----[ Xen-4.13.0-8.0.6-d x86_64 debug=y Not tainted ]---- [2019-07-05 00:37:16 UTC] (XEN) [24907.552070] CPU: 30 [2019-07-05 00:37:16 UTC] (XEN) [24907.565281] RIP: e008:[<ffff82d0802406fc>] sched_context_switched+0xaf/0x101 [2019-07-05 00:37:16 UTC] (XEN) [24907.601232] RFLAGS: 0000000000000202 CONTEXT: hypervisor [2019-07-05 00:37:16 UTC] (XEN) [24907.629998] rax: 0000000000000002 rbx: ffff83202782e880 rcx: 000000000000001e [2019-07-05 00:37:16 UTC] (XEN) [24907.669651] rdx: ffff83202782e904 rsi: ffff832027823000 rdi: ffff832027823000 [2019-07-05 00:37:16 UTC] (XEN) [24907.706560] rbp: ffff83403cab7d20 rsp: ffff83403cab7d00 r8: 0000000000000000 [2019-07-05 00:37:16 UTC] (XEN) [24907.743258] r9: 0000000000000000 r10: 0200200200200200 r11: 0100100100100100 [2019-07-05 00:37:16 UTC] (XEN) [24907.779940] r12: ffff832027823000 r13: ffff832027823000 r14: ffff83202782e7b0 [2019-07-05 00:37:16 UTC] (XEN) [24907.816849] r15: ffff83202782e880 cr0: 000000008005003b cr4: 00000000000426e0 [2019-07-05 00:37:16 UTC] (XEN) [24907.854125] cr3: 00000000bd8a1000 cr2: 000000001851b798 [2019-07-05 00:37:16 UTC] (XEN) [24907.881483] fsb: 0000000000000000 gsb: 0000000000000000 gss: 0000000000000000 [2019-07-05 00:37:16 UTC] (XEN) [24907.918309] ds: 0000 es: 0000 fs: 0000 gs: 0000 ss: 0000 cs: e008 [2019-07-05 00:37:16 UTC] (XEN) [24907.952619] Xen code around <ffff82d0802406fc> (sched_context_switched+0xaf/0x101): [2019-07-05 00:37:16 UTC] (XEN) [24907.990277] 00 00 eb 18 f3 90 8b 02 <85> c0 75 f8 eb 0e 49 8b 7e 30 48 85 ff 74 05 e8 [2019-07-05 00:37:16 UTC] (XEN) [24908.032393] Xen stack trace from rsp=ffff83403cab7d00: [2019-07-05 00:37:16 UTC] (XEN) [24908.061298] ffff832027823000 ffff832027823000 0000000000000000 ffff83202782e880 [2019-07-05 00:37:16 UTC] (XEN) [24908.098529] ffff83403cab7d60 ffff82d0802407c0 0000000000000082 ffff83202782e7c8 [2019-07-05 00:37:16 UTC] (XEN) [24908.135622] 000000000000001e ffff83202782e7c8 000000000000001e ffff82d080602628 [2019-07-05 00:37:16 UTC] (XEN) [24908.172671] ffff83403cab7dc0 ffff82d080240d83 000000000000df99 000000000000001e [2019-07-05 00:37:16 UTC] (XEN) [24908.210212] ffff832027823000 000016a62dc8c6bc 000000fc00000000 000000000000001e [2019-07-05 00:37:16 UTC] (XEN) [24908.247181] ffff83202782e7c8 ffff82d080602628 ffff82d0805da460 000000000000001e [2019-07-05 00:37:16 UTC] (XEN) [24908.284279] ffff83403cab7e60 ffff82d080240ea4 00000002802aecc5 ffff832027823000 [2019-07-05 00:37:16 UTC] (XEN) [24908.321128] ffff83202782e7b0 ffff83202782e880 ffff83403cab7e10 ffff82d080273b4e [2019-07-05 00:37:16 UTC] (XEN) [24908.358308] ffff83403cab7e10 ffff82d080242f7f ffff83403cab7e60 ffff82d08024663a [2019-07-05 00:37:17 UTC] (XEN) [24908.395662] ffff83403cab7ea0 ffff82d0802ec32a ffff8340000000ff ffff82d0805bc880 [2019-07-05 00:37:17 UTC] (XEN) [24908.432376] ffff82d0805bb980 ffffffffffffffff ffff83403cab7fff 000000000000001e [2019-07-05 00:37:17 UTC] (XEN) [24908.469812] ffff83403cab7e90 ffff82d080242575 0000000000000f00 ffff82d0805bb980 [2019-07-05 00:37:17 UTC] (XEN) [24908.508373] 000000000000001e ffff82d0806026f0 ffff83403cab7ea0 ffff82d0802425ca [2019-07-05 00:37:17 UTC] (XEN) [24908.549856] ffff83403cab7ef0 ffff82d08027a601 ffff82d080242575 0000001e7ffde000 [2019-07-05 00:37:17 UTC] (XEN) [24908.588022] ffff832027823000 ffff832027823000 ffff83127ffde000 ffff83203ffe5000 [2019-07-05 00:37:17 UTC] (XEN) [24908.625217] 000000000000001e ffff831204092000 ffff83403cab7d78 00000000ffffffed [2019-07-05 00:37:17 UTC] (XEN) [24908.662932] ffffffff81800000 0000000000000000 ffffffff81800000 0000000000000000 [2019-07-05 00:37:17 UTC] (XEN) [24908.703246] ffffffff818f4580 ffff880039118848 00000e6a3c4b2698 00000000148900db [2019-07-05 00:37:17 UTC] (XEN) [24908.743671] 0000000000000000 ffffffff8101e650 ffffffff8185c3e0 0000000000000000 [2019-07-05 00:37:17 UTC] (XEN) [24908.781927] 0000000000000000 0000000000000000 0000beef0000beef ffffffff81054eb2 [2019-07-05 00:37:17 UTC] (XEN) [24908.820986] Xen call trace: [2019-07-05 00:37:17 UTC] (XEN) [24908.836789] [<ffff82d0802406fc>] sched_context_switched+0xaf/0x101 [2019-07-05 00:37:17 UTC] (XEN) [24908.869916] [<ffff82d0802407c0>] schedule.c#sched_context_switch+0x72/0x151 [2019-07-05 00:37:17 UTC] (XEN) [24908.907384] [<ffff82d080240d83>] schedule.c#sched_slave+0x2a3/0x2b2 [2019-07-05 00:37:17 UTC] (XEN) [24908.941241] [<ffff82d080240ea4>] schedule.c#schedule+0x112/0x2a1 [2019-07-05 00:37:17 UTC] (XEN) [24908.973939] [<ffff82d080242575>] softirq.c#__do_softirq+0x85/0x90 [2019-07-05 00:37:17 UTC] (XEN) [24909.007101] [<ffff82d0802425ca>] do_softirq+0x13/0x15 [2019-07-05 00:37:17 UTC] (XEN) [24909.035971] [<ffff82d08027a601>] domain.c#idle_loop+0xad/0xc0 [2019-07-05 00:37:17 UTC] (XEN) [24909.070546] [2019-07-05 00:37:17 UTC] (XEN) [24909.080286] CPU0 @ e008:ffff82d0802431ba (stop_machine.c#stopmachine_wait_state+0x1a/0x24) [2019-07-05 00:37:17 UTC] (XEN) [24909.122896] CPU1 @ e008:ffff82d0802406f8 (sched_context_switched+0xab/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.159518] CPU3 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.199607] CPU2 @ e008:ffff82d0802406fc (sched_context_switched+0xaf/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.235773] CPU5 @ e008:ffff82d0802431f4 (stop_machine.c#stopmachine_action+0x30/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.276039] CPU4 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.312371] CPU7 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.352930] CPU6 @ e008:ffff82d0802406fc (sched_context_switched+0xaf/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.388928] CPU8 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.424664] CPU9 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.465376] CPU10 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.507449] CPU11 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.544703] CPU13 @ e008:ffff82d0802431f2 (stop_machine.c#stopmachine_action+0x2e/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.588884] CPU12 @ e008:ffff82d0802406fc (sched_context_switched+0xaf/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.625781] CPU15 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.666649] CPU14 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.703396] CPU17 @ e008:ffff82d0802431f4 (stop_machine.c#stopmachine_action+0x30/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.744089] CPU16 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.781117] CPU23 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.821692] CPU22 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:18 UTC] (XEN) [24909.858139] CPU27 @ e008:ffff82d0802431f4 (stop_machine.c#stopmachine_action+0x30/0xa0) [2019-07-05 00:37:18 UTC] (XEN) [24909.898704] CPU26 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:19 UTC] (XEN) [24909.936069] CPU19 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:19 UTC] (XEN) [24909.977291] CPU18 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:19 UTC] (XEN) [24910.014078] CPU31 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:19 UTC] (XEN) [24910.055692] CPU21 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:19 UTC] (XEN) [24910.100486] CPU24 @ e008:ffff82d0802406fa (sched_context_switched+0xad/0x101) [2019-07-05 00:37:19 UTC] (XEN) [24910.136824] CPU25 @ e008:ffff82d0802431fa (stop_machine.c#stopmachine_action+0x36/0xa0) [2019-07-05 00:37:19 UTC] (XEN) [24910.177529] CPU29 @ e008:ffff82d0802431f4 (stop_machine.c#stopmachine_action+0x30/0xa0) [2019-07-05 00:37:19 UTC] (XEN) [24910.218420] CPU28 @ e008:ffff82d0802406fc (sched_context_switched+0xaf/0x101) [2019-07-05 00:37:19 UTC] (XEN) [24910.255219] CPU20 @ e008:ffff82d0802406fc (sched_context_switched+0xaf/0x101) [2019-07-05 00:37:19 UTC] (XEN) [24910.292152] [2019-07-05 00:37:19 UTC] (XEN) [24910.301667] **************************************** [2019-07-05 00:37:19 UTC] (XEN) [24910.327892] Panic on CPU 30: [2019-07-05 00:37:19 UTC] (XEN) [24910.344165] FATAL TRAP: vector = 2 (nmi) [2019-07-05 00:37:19 UTC] (XEN) [24910.365476] [error_code=0000] [2019-07-05 00:37:19 UTC] (XEN) [24910.382509] **************************************** [2019-07-05 00:37:19 UTC] (XEN) [24910.408547] [2019-07-05 00:37:19 UTC] (XEN) [24910.418129] Reboot in five seconds... Thanks, Sergey _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxxx https://lists.xenproject.org/mailman/listinfo/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |