[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP)



Hi,
        Running xen on a Debian Etch platform and about every 1 in 10 boots I
am getting the following error:
(XEN) ************************************
(XEN) CPU0 FATAL TRAP 6 (invalid opcode), ERROR_CODE 0000, IN INTERRUPT
CONTEXT.
(XEN) System shutting down -- need manual reset.
(XEN) ************************************

(I have attached a file with the full output also of an OK boot and a
failed one also).

This is a Quad-Core, Dual CPU system currently running with 4Gb of RAM
with a Intel S5000PAL server motherboard. I have tried tweaking around
the APIC options to be bigsmp to no help at all. I am kind of at a loss
right now as to how to progress. I have upgraded the BIOS to the latest
revision to no avail also.

Here are the xen package versions from my system:
# dpkg -l | egrep '(\||xen)'
| Status=Not/Installed/Config-files/Unpacked/Failed-config/Half-installed
|/ Err?=(none)/Hold/Reinst-required/X=both-problems (Status,Err:
uppercase=bad)
||/ Name                                Version
Description
ii  linux-image-2.6-xen-amd64           2.6.18+6etch2
Linux kernel 2.6 image on AMD64
ii  linux-image-2.6.18-4-xen-amd64      2.6.18.dfsg.1-12etch2
Linux 2.6.18 image on AMD64
ii  linux-image-2.6.18-5-xen-amd64      2.6.18.dfsg.1-13etch2
Linux 2.6.18 image on AMD64
ii  linux-modules-2.6.18-4-xen-amd64    2.6.18.dfsg.1-12etch2
Linux 2.6.18 modules on AMD64
ii  linux-modules-2.6.18-5-xen-amd64    2.6.18.dfsg.1-13etch2
Linux 2.6.18 modules on AMD64
ii  xen-hypervisor-3.0.3-1-amd64        3.0.3-0-2
The Xen Hypervisor on AMD64
ii  xen-ioemu-3.0.3-1                   3.0.3-0-2
XEN administrative tools
ii  xen-linux-system-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2
XEN system with Linux 2.6.18 image on AMD64
ii  xen-tools                           2.8-2
Tools to manage debian XEN virtual servers
ii  xen-utils-3.0.3-1                   3.0.3-0-2
XEN administrative tools
ii  xen-utils-common                    3.0.3-0-2
XEN administrative tools - common files

And here is my grub boot section:
title           Xen 3.0.3-1-amd64 / Debian GNU/Linux, kernel
2.6.18-5-xen-amd64
root            (hd0,0)
kernel          /xen-3.0.3-1-amd64.gz com2=19200,8n1 console=com2
module          /vmlinuz-2.6.18-5-xen-amd64 root=/dev/mapper/hornet-root
ro console=ttyS1,19200 console=tty0
module          /initrd.img-2.6.18-5-xen-amd64

If I can help with any further information please let me know. Anyone
any ideas on how to fix this problem?

(I will start running the Intel Platform Test Suite to check its not the
hardware, but I doubt it is since this happens the same on two identical
boxes)

-- 
James Ray.                          <j.ray@xxxxxxxxxx>
Computing Services
Queen Mary, University of London
(XEN) Command line: /xen-3.0.3-1-amd64.gz com2=19200,8n1 console=com2 
noirqbalance watchdog (XEN) Physical RAM map:ry will be booted automatically in 
1 seconds. (XEN)  0000000000000000 - 000000000009fc00 (usable)
(XEN)  000000000009fc00 - 0000000000100000 (reserved)
(XEN)  0000000000100000 - 000000009e4a1000 (usable)
(XEN)  000000009e4a1000 - 000000009e562000 (ACPI NVS)
(XEN)  000000009e562000 - 000000009fa42000 (usable)
(XEN)  000000009fa42000 - 000000009fa9a000 (reserved)
(XEN)  000000009fa9a000 - 000000009fab1000 (usable)
(XEN)  000000009fab1000 - 000000009fb1a000 (ACPI NVS)
(XEN)  000000009fb1a000 - 000000009fb2a000 (usable)
(XEN)  000000009fb2a000 - 000000009fb3a000 (ACPI data)
(XEN)  000000009fb3a000 - 000000009fc00000 (usable)
(XEN)  000000009fc00000 - 00000000b0000000 (reserved)
(XEN)  00000000ffc00000 - 00000000ffc0c000 (reserved)
(XEN)  0000000100000000 - 0000000160000000 (usable)
(XEN) System RAM: 4090MB (4188212kB)
(XEN) Xen heap: 13MB (14304kB)
(XEN) found SMP MP-table at 000fd1b0
(XEN) DMI 2.5 present.
(XEN) Using APIC driver default
(XEN) ACPI: RSDP (v002 INTEL                                 ) @ 
0x00000000000f03c0
(XEN) ACPI: XSDT (v001 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb39120
(XEN) ACPI: FADT (v003 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb37000
(XEN) ACPI: MADT (v001 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb36000
(XEN) ACPI: SPCR (v001 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb2f000
(XEN) ACPI: HPET (v001 INTEL  S5000PAL 0x00000001 INTL 0x01000013) @ 
0x000000009fb2e000
(XEN) ACPI: MCFG (v001 INTEL  S5000PAL 0x00000001 INTL 0x01000013) @ 
0x000000009fb2d000
(XEN) ACPI: OEM1 (v001 INTEL  S5000PAL 0x00000001 INTL 0x01000013) @ 
0x000000009fb2c000
(XEN) ACPI: SSDT (v002 INTEL  EIST     0x00004000 INTL 0x01000013) @ 
0x000000009fb2b000
(XEN) ACPI: SSDT (v002 INTEL  IPMI     0x00004000 INTL 0x01000013) @ 
0x000000009fb2a000
(XEN) ACPI: DSDT (v002 INTEL  S5000PAL 0x00000008 INTL 0x01000013) @ 
0x0000000000000000
(XEN) ACPI: Local APIC address 0xfee00000
(XEN) ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
(XEN) Processor #0 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x02] enabled)
(XEN) Processor #2 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled)
(XEN) Processor #4 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x06] enabled)
(XEN) Processor #6 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x01] enabled)
(XEN) Processor #1 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x03] enabled)
(XEN) Processor #3 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x05] enabled)
(XEN) Processor #5 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x07] enabled)
(XEN) Processor #7 6:15 APIC version 20
(XEN) ACPI: LAPIC_NMI (acpi_id[0x00] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x01] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x02] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x03] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x04] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x05] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x06] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x07] high level lint[0x1])
(XEN) ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
(XEN) IOAPIC[0]: apic_id 8, version 32, address 0xfec00000, GSI 0-23
(XEN) ACPI: IOAPIC (id[0x09] address[0xfec80000] gsi_base[24])
(XEN) IOAPIC[1]: apic_id 9, version 32, address 0xfec80000, GSI 24-47
(XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
(XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
(XEN) ACPI: IRQ0 used by override.
(XEN) ACPI: IRQ2 used by override.
(XEN) ACPI: IRQ9 used by override.
(XEN) Enabling APIC mode:  Flat.  Using 2 I/O APICs
(XEN) ACPI: HPET id: 0x8086a201 base: 0xfed00000
(XEN) Using ACPI (MADT) for SMP configuration information
(XEN) Using scheduler: SMP Credit Scheduler (credit)
(XEN) Initializing CPU#0
(XEN) Detected 1861.936 MHz processor.
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 0
(XEN) CPU: Processor Core ID: 0
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#0.
(XEN) CPU0: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 1/2 eip 90000
(XEN) Initializing CPU#1
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 0
(XEN) CPU: Processor Core ID: 2
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#1.
(XEN) CPU1: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 2/4 eip 90000
(XEN) Initializing CPU#2
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 1
(XEN) CPU: Processor Core ID: 0
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#2.
(XEN) CPU2: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 3/6 eip 90000
(XEN) Not responding.
(XEN) Inquiring remote APIC #6...
(XEN) ... APIC #6 ID: failed
(XEN) ... APIC #6 VERSION: failed
(XEN) ... APIC #6 SPIV: failed
(XEN) CPU #6 not responding - cannot use it.
(XEN) BUG at domain.c:78
(XEN) ----[ Xen-3.0.3-1  x86_64  debug=n  Not tainted ]----
(XEN) CPU:    0
(XEN) RIP:    e010:[<ffff8300001180f4>] __bug+0x24/0x30
(XEN) RFLAGS: 0000000000010086   CONTEXT: hypervisor
(XEN) rax: 0000000000000000   rbx: 000000000000004e   rcx: 00000000000015b6
(XEN) rdx: 000000000000000a   rsi: 000000000000000a   rdi: ffff8300001bf013
(XEN) rbp: ffff830000ffb080   rsp: ffff8300001b7d78   r8:  0000000000000000
(XEN) r9:  0000000000000000   r10: 00000000fffffffb   r11: ffff8300001186b0
(XEN) r12: ffff830000172cf5   r13: 0000000000000003   r14: 0000000000000003
(XEN) r15: 0000000000000003   cr0: 000000008005003b   cr4: 00000000000026f0
(XEN) cr3: 0000000000102000   cr2: 0000000000000000
(XEN) ds: 0000   es: 0000   fs: 0000   gs: 0000   ss: 0000   cs: e010
(XEN) Xen stack trace from rsp=ffff8300001b7d78:
(XEN)    ffff830000ffb080 0000000000000003 0000000000000003 ffff830000106f15
(XEN)    ffff8300001b7da8 0001000000000005 0000000000000003 0000000000000003
(XEN)    0000000000000001 0000000000000004 ffff83000017f9a4 ffff830000107369
(XEN)    0000000000000006 0000000000000000 0000000000000003 ffff830000169f49
(XEN)    ffff8300001b7e48 0000000000000006 ffff8300001b7e68 0000000300000020
(XEN)    0000000306000000 0000000000090000 000000000000000e 0000000000000092
(XEN)    000000000020a000 0000000500000690 ffff83000017656a ffff830000175506
(XEN)    ffff83000017550e ffff83000017fa80 0000000300000002 ffff83000000000f
(XEN)    ffff83000002dd40 ffff830000ffb080 ffff830000191000 000000000000000e
(XEN)    ffff8300000229e0 ffff83000002dd40 0000000000000002 ffff830000168f0c
(XEN)    0000000800000000 000000010000006e 0000000000000003 00000000000002f8
(XEN)    0000000000000000 000000000000000e 0000000000000000 0000000000000000
(XEN)    0000000000067eac 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 ffff8300001001c1 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN) Xen call trace:
(XEN)    [<ffff8300001180f4>] __bug+0x24/0x30
(XEN)    [<ffff830000106f15>] alloc_vcpu+0x45/0x130
(XEN)    [<ffff830000107369>] alloc_idle_vcpu+0x59/0x80
(XEN)    [<ffff830000169f49>] smp_prepare_cpus+0x4a9/0xd10
(XEN)    [<ffff830000168f0c>] __start_xen+0x75c/0xac0
(XEN)    [<ffff8300001001c1>] __high_start+0x94/0x96
(XEN)
(XEN) ************************************
(XEN) CPU0 FATAL TRAP 6 (invalid opcode), ERROR_CODE 0000, IN INTERRUPT CONTEXT.
(XEN) System shutting down -- need manual reset.
(XEN) ************************************
(XEN) System RAM: 4090MB (4188208kB)
(XEN) Xen heap: 13MB (14304kB)
(XEN) found SMP MP-table at 000fd1b0
(XEN) DMI 2.5 present.
(XEN) Using APIC driver default
(XEN) ACPI: RSDP (v002 INTEL                                 ) @ 
0x00000000000f03c0
(XEN) ACPI: XSDT (v001 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb39120
(XEN) ACPI: FADT (v003 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb37000
(XEN) ACPI: MADT (v001 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb36000
(XEN) ACPI: SPCR (v001 INTEL  S5000PAL 0x00000000 INTL 0x01000013) @ 
0x000000009fb2f000
(XEN) ACPI: HPET (v001 INTEL  S5000PAL 0x00000001 INTL 0x01000013) @ 
0x000000009fb2e000
(XEN) ACPI: MCFG (v001 INTEL  S5000PAL 0x00000001 INTL 0x01000013) @ 
0x000000009fb2d000
(XEN) ACPI: OEM1 (v001 INTEL  S5000PAL 0x00000001 INTL 0x01000013) @ 
0x000000009fb2c000
(XEN) ACPI: SSDT (v002 INTEL  EIST     0x00004000 INTL 0x01000013) @ 
0x000000009fb2b000
(XEN) ACPI: SSDT (v002 INTEL  IPMI     0x00004000 INTL 0x01000013) @ 
0x000000009fb2a000
(XEN) ACPI: DSDT (v002 INTEL  S5000PAL 0x00000008 INTL 0x01000013) @ 
0x0000000000000000
(XEN) ACPI: Local APIC address 0xfee00000
(XEN) ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
(XEN) Processor #0 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x02] enabled)
(XEN) Processor #2 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled)
(XEN) Processor #4 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x06] enabled)
(XEN) Processor #6 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x01] enabled)
(XEN) Processor #1 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x03] enabled)
(XEN) Processor #3 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x05] enabled)
(XEN) Processor #5 6:15 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x07] enabled)
(XEN) Processor #7 6:15 APIC version 20
(XEN) ACPI: LAPIC_NMI (acpi_id[0x00] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x01] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x02] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x03] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x04] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x05] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x06] high level lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x07] high level lint[0x1])
(XEN) ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
(XEN) IOAPIC[0]: apic_id 8, version 32, address 0xfec00000, GSI 0-23
(XEN) ACPI: IOAPIC (id[0x09] address[0xfec80000] gsi_base[24])
(XEN) IOAPIC[1]: apic_id 9, version 32, address 0xfec80000, GSI 24-47
(XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
(XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
(XEN) ACPI: IRQ0 used by override.
(XEN) ACPI: IRQ2 used by override.
(XEN) ACPI: IRQ9 used by override.
(XEN) Enabling APIC mode:  Flat.  Using 2 I/O APICs
(XEN) ACPI: HPET id: 0x8086a201 base: 0xfed00000
(XEN) Using ACPI (MADT) for SMP configuration information
(XEN) Using scheduler: SMP Credit Scheduler (credit)
(XEN) Initializing CPU#0
(XEN) Detected 1861.960 MHz processor.
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 0
(XEN) CPU: Processor Core ID: 0
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#0.
(XEN) CPU0: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 1/2 eip 90000
(XEN) Initializing CPU#1
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 0
(XEN) CPU: Processor Core ID: 2
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#1.
(XEN) CPU1: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 2/4 eip 90000
(XEN) Initializing CPU#2
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 1
(XEN) CPU: Processor Core ID: 0
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#2.
(XEN) CPU2: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 3/6 eip 90000
(XEN) Initializing CPU#3
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 1
(XEN) CPU: Processor Core ID: 2
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#3.
(XEN) CPU3: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 4/1 eip 90000
(XEN) Initializing CPU#4
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 0
(XEN) CPU: Processor Core ID: 1
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#4.
(XEN) CPU4: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 5/3 eip 90000
(XEN) Initializing CPU#5
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 0
(XEN) CPU: Processor Core ID: 3
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#5.
(XEN) CPU5: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 6/5 eip 90000
(XEN) Initializing CPU#6
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 1
(XEN) CPU: Processor Core ID: 1
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#6.
(XEN) CPU6: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Booting processor 7/7 eip 90000
(XEN) Initializing CPU#7
(XEN) CPU: L1 I cache: 32K, L1 D cache: 32K
(XEN) CPU: Physical Processor ID: 1
(XEN) CPU: Processor Core ID: 3
(XEN) VMXON is done
(XEN) Intel machine check architecture supported.
(XEN) Intel machine check reporting enabled on CPU#7.
(XEN) CPU7: Intel(R) Xeon(R) CPU           E5320  @ 1.86GHz stepping 07
(XEN) Total of 8 processors activated.
(XEN) ENABLING IO-APIC IRQs
(XEN)  -> Using new ACK method
(XEN) ..TIMER: vector=0xF0 apic1=0 pin1=2 apic2=-1 pin2=-1
(XEN) checking TSC synchronization across 8 CPUs: passed.
(XEN) Platform timer is 14.318MHz HPET
(XEN) Brought up 8 CPUs
(XEN) Machine check exception polling timer started.
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.