[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] XSAVE/XRSTOR crash resurgence in 4.3



On Mon, Jul 8, 2013 at 10:24 AM, Jan Beulich <JBeulich@xxxxxxxx> wrote:
>>>> On 08.07.13 at 16:13, Ben Guthro <ben@xxxxxxxxxx> wrote:
>> On Fri, Jul 5, 2013 at 8:58 AM, Ben Guthro <ben.guthro@xxxxxxxxx> wrote:
>>> On Jul 5, 2013, at 8:15 AM, Jan Beulich <JBeulich@xxxxxxxx> wrote:
>>>
>>>>>>> On 05.07.13 at 14:10, Ben Guthro <ben.guthro@xxxxxxxxx> wrote:
>>>>> Since I am not in the office today, nor near a machine that I can
>>>>> access this, I asked someone else to apply and check in this patch, in
>>>>> the hope that we could get some useful debug info from the weekend
>>>>> test run.
>>>>>
>>>>> However, when he went to test booting a VM with this patch, the host
>>>>> machine rebooted - presumably a Xen crash.
>>>>
>>>> Oops - it's just printing stuff, so I can't immediately see how that
>>>> would happen. But perhaps a trivial oversight of mine...
>>
>> Here's the crash with this patch
>> I'm continuing to look at it, but if something jumps out at you,
>> please let me know.
>
> Quite obvious: hvm_guest_x86_mode() has this assertion. Yet
> the original, supposedly working patch had a use of this too iirc.

It did...which is worrying.

One difference here, is that 4.2 is running in debug=n mode, where 4.3
is debug=y

iirc, asserts are disabled on debug=n builds.



>
> Jan
>
>> (XEN) Assertion 'v == current' failed at
>> /data/home/bguthro/dev/orc-precise/xen/xen/include:275
>> (XEN) ----[ Xen-4.3-unstable  x86_64  debug=y  Not tainted ]----
>> (XEN) CPU:    1
>> (XEN) RIP:    e008:[<ffff82c4c01a4f65>] xsave+0x178/0x2c7
>> (XEN) RFLAGS: 0000000000010006   CONTEXT: hypervisor
>> (XEN) rax: ffff830215860000   rbx: ffff83020904ec00   rcx: ffff83021d63a000
>> (XEN) rdx: 00000000bfffffff   rsi: 0000000000000003   rdi: ffff83020904ec00
>> (XEN) rbp: ffff830215867d20   rsp: ffff830215867cd0   r8:  0000000000000000
>> (XEN) r9:  0000000000000000   r10: 0000000e23c7b646   r11: 0000000000000000
>> (XEN) r12: ffff8300d0840000   r13: 0000000000000008   r14: ffff830215860000
>> (XEN) r15: ffff830215867f18   cr0: 0000000080050033   cr4: 00000000001426f0
>> (XEN) cr3: 000000020bae2000   cr2: 0000000000000000
>> (XEN) ds: 0000   es: 0000   fs: 0000   gs: 0000   ss: 0000   cs: e008
>> (XEN) Xen stack trace from rsp=ffff830215867cd0:
>> (XEN)    ffff830215867df8 ffff830215867d38 ffff82c4c01b4622 ffff830215867d00
>> (XEN)    ffff83021d63ad80 0000000000000286 00000000000fa933 ffff8300d0840000
>> (XEN)    0000000000000001 ffff8300d0840000 ffff830215867d60 ffff82c4c0167a1a
>> (XEN)    ffff830215867d48 ffff830215867dc8 ffff82c4c017019c ffff830215867d98
>> (XEN)    0000000000000282 ffff8300d04fc000 ffff830215867db0 ffff82c4c0160c80
>> (XEN)    0000000a1d63ad18 0000000000000000 80000000d0840510 ffff8300d04fc000
>> (XEN)    ffff8300d072b000 0000000000000001 0000000000000001 ffff83021583d080
>> (XEN)    ffff830215867e20 ffff82c4c01634bc ffff82c4c01277b9 ffff8300d072b030
>> (XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
>> (XEN)    ffff830215867e20 ffff8300d072b000 ffff8300d04fc000 0000000e21f655a0
>> (XEN)    0000000000000001 ffff83021583d080 ffff830215867eb0 ffff82c4c0124364
>> (XEN)    0000000000000002 ffff83021583d060 0000000100867e60 ffff83021583d068
>> (XEN)    ffff82c4c0127824 ffff83021583c028 ffff830215867eb0 ffff8300d04fc000
>> (XEN)    0000000000989680 ffff82c4c01ab600 ffff8300d072b000 ffff82c4c02f0080
>> (XEN)    ffff82c4c02f0000 ffff830215860000 ffffffffffffffff ffff83021583d080
>> (XEN)    ffff830215867ee0 ffff82c4c012710e ffff830215860000 ffff830215860000
>> (XEN)    00000000ffffff01 0000000000000003 ffff830215867ef0 ffff82c4c0127174
>> (XEN)    ffff830215867f10 ffff82c4c0160aba ffff8300d072b000 ffff8300d0840000
>> (XEN)    ffff830215867dc8 0000000000000000 0000000000000000 0000000000000000
>> (XEN)    0000000000000000 0000000000000080 00000000000001f7 0000000000000000
>> (XEN) Xen call trace:
>> (XEN)    [<ffff82c4c01a4f65>] xsave+0x178/0x2c7
>> (XEN)    [<ffff82c4c0167a1a>] vcpu_save_fpu+0x83/0x159
>> (XEN)    [<ffff82c4c0160c80>] __context_switch+0x16e/0x50b
>> (XEN)    [<ffff82c4c01634bc>] context_switch+0x1ba/0xf4e
>> (XEN)    [<ffff82c4c0124364>] schedule+0x60b/0x61a
>> (XEN)    [<ffff82c4c012710e>] __do_softirq+0x8e/0x99
>> (XEN)    [<ffff82c4c0127174>] do_softirq+0x13/0x15
>> (XEN)    [<ffff82c4c0160aba>] idle_loop+0x68/0x6a
>> (XEN)
>> (XEN)
>> (XEN) ****************************************
>> (XEN) Panic on CPU 1:
>> (XEN) Assertion 'v == current' failed at
>> /data/home/bguthro/dev/orc-precise/xen/xen/include:275
>> (XEN) ****************************************
>> (XEN)
>> (XEN) Reboot in five seconds...
>> (XEN) Assertion 'current == idle_vcpu[smp_processor_id()]' failed at
>> domain.c:1494
>> (XEN) ----[ Xen-4.3-unstable  x86_64  debug=y  Not tainted ]----
>> (XEN) CPU:    1
>> (XEN) RIP:    e008:[<ffff82c4c01642cc>] __sync_local_execstate+0x63/0x86
>> (XEN) RFLAGS: 0000000000010087   CONTEXT: hypervisor
>> (XEN) rax: ffff830215860000   rbx: 0000000000000001   rcx: 0000000000000001
>> (XEN) rdx: ffff82c4c026c4a0   rsi: ffff8300d072b000   rdi: ffff830215867a68
>> (XEN) rbp: ffff8302158679b8   rsp: ffff8302158679a8   r8:  0000000000000004
>> (XEN) r9:  000000000000000a   r10: 0000000000000020   r11: 000000000000000a
>> (XEN) r12: 0000000000000082   r13: 00000000000000fd   r14: 0000000000000113
>> (XEN) r15: ffff830215867f18   cr0: 0000000080050033   cr4: 00000000001426f0
>> (XEN) cr3: 000000020bae2000   cr2: 0000000000000000
>> (XEN) ds: 0000   es: 0000   fs: 0000   gs: 0000   ss: 0000   cs: e008
>> (XEN) Xen stack trace from rsp=ffff8302158679a8:
>> (XEN)    00000000000000fd ffff830215867a68 ffff8302158679c8 ffff82c4c01860eb
>> (XEN)    ffff830215867a58 ffff82c4c017019c ffff82c4c024d581 0000000000000086
>> (XEN)    ffff830215867a48 0000000000000086 ffff830215867a18 ffff82c4c012788c
>> (XEN)    0000000000000000 8000000015867a28 ffff830215867a28 ffff82c4c01437dc
>> (XEN)    ffff830215867a58 ffff82c4c0261138 0000000000000082 ffff82c4c02566f0
>> (XEN)    0000000000000113 ffff830215867f18 00007cfdea798577 ffff82c4c0167bef
>> (XEN)    ffff830215867f18 0000000000000113 ffff82c4c02566f0 0000000000000082
>> (XEN)    ffff830215867b58 ffff82c4c0261138 000000000000000a 0000000000000020
>> (XEN)    000000000000000a 0000000000000004 0000000000000000 0000000000000000
>> (XEN)    ffff83021583d020 0000000000000082 ffff82c4c0287860 000000fd00000000
>> (XEN)    ffff82c4c0185be0 000000000000e008 0000000000000246 ffff830215867b18
>> (XEN)    0000000000000000 ffff82c4c0185bd8 ffff830215867b48 0000138800000061
>> (XEN)    ffff83021583d020 ffff82c4c0261138 0000000000000082 ffff82c4c02566f0
>> (XEN)    0000000000000113 ffff830215867f18 ffff830215867bc8 ffff82c4c0143670
>> (XEN)    ffff830215867ba8 ffff82c400000020 ffff830215867bd8 ffff830215867b88
>> (XEN)    ffff82c4c024cf6a ffff82c4c024cf6a ffff82c4c02566f0 0000000000000113
>> (XEN)    0000000000000004 0000000000000061 ffff830215867c28 ffff82c4c024cf6a
>> (XEN)    ffff830215867c18 ffff82c4c018dde0 ffff830200000000 0000000000000000
>> (XEN)    fbbc000000000000 044fc20b0f000a7f ffff8300d04fc000 ffff8300d0840000
>> (XEN)    0000000000000008 ffff830215860000 00007cfdea7983b7 ffff82c4c023020d
>> (XEN) Xen call trace:
>> (XEN)    [<ffff82c4c01642cc>] __sync_local_execstate+0x63/0x86
>> (XEN)    [<ffff82c4c01860eb>] invalidate_interrupt+0x33/0x86
>> (XEN)    [<ffff82c4c017019c>] do_IRQ+0x9e/0x68d
>> (XEN)    [<ffff82c4c0167bef>] common_interrupt+0x5f/0x70
>> (XEN)    [<ffff82c4c0185be0>] machine_restart+0x39/0x208
>> (XEN)    [<ffff82c4c0143670>] panic+0x120/0x129
>> (XEN)    [<ffff82c4c018dde0>] do_invalid_op+0x3d7/0x45b
>> (XEN)    [<ffff82c4c023020d>] handle_exception_saved+0x2e/0x6c
>> (XEN)    [<ffff82c4c01a4f65>] xsave+0x178/0x2c7
>> (XEN)    [<ffff82c4c0167a1a>] vcpu_save_fpu+0x83/0x159
>> (XEN)    [<ffff82c4c0160c80>] __context_switch+0x16e/0x50b
>> (XEN)    [<ffff82c4c01634bc>] context_switch+0x1ba/0xf4e
>> (XEN)    [<ffff82c4c0124364>] schedule+0x60b/0x61a
>> (XEN)    [<ffff82c4c012710e>] __do_softirq+0x8e/0x99
>> (XEN)    [<ffff82c4c0127174>] do_softirq+0x13/0x15
>> (XEN)    [<ffff82c4c0160aba>] idle_loop+0x68/0x6a
>> (XEN)
>> (XEN)
>> (XEN) ****************************************
>> (XEN) Panic on CPU 1:
>> (XEN) Assertion 'current == idle_vcpu[smp_processor_id()]' failed at
>> domain.c:1494
>> (XEN) ****************************************
>> (XEN)
>> (XEN) Reboot in five seconds...
>> (XEN) Resetting with ACPI MEMORY or I/O RESET_REG.
>>
>>
>> Ben
>>
>> _______________________________________________
>> Xen-devel mailing list
>> Xen-devel@xxxxxxxxxxxxx
>> http://lists.xen.org/xen-devel
>
>

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.