[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH] xen: point xen_start_info to a dummy struct for PV on HVM guests



On Wed, 2012-10-03 at 15:11 +0100, Konrad Rzeszutek Wilk wrote:
> On Wed, Oct 03, 2012 at 02:54:42PM +0100, Ian Campbell wrote:
> > On Wed, 2012-10-03 at 14:51 +0100, Stefano Stabellini wrote:
> > > On Wed, 3 Oct 2012, Ian Campbell wrote:
> > > > On Wed, 2012-10-03 at 14:37 +0100, Stefano Stabellini wrote:
> > > > > PV on HVM guests don't have a start_info page mapped by Xen, so
> > > > > xen_start_info is just NULL for them.
> > > > > That is problem because other parts of the code expect xen_start_info 
> > > > > to
> > > > > point to something valid, for example xen_initial_domain() is defined 
> > > > > as
> > > > > follow:
> > > > > 
> > > > > #define xen_initial_domain()    (xen_domain() && \
> > > > >                  xen_start_info->flags & SIF_INITDOMAIN)
> > > > 
> > > > But anyone who calls this before xen_start_info is setup is going to get
> > > > a bogus result, specifically in this case they will think they are domU
> > > > when in reality they are dom0 -- wouldn't it be better to fix those
> > > > callsites?
> > > 
> > > That cannot be the case because setting up xen_start_info is the very
> > > first thing that is done, before even calling to C.
> > 
> > On PV, yes, but you are trying to fix PVHVM here, no?
> > 
> > Otherwise if this is always set before calling into C then what is the
> > purpose of this patch?
> 
> to fix this - as PVHVM has it set to NULL and we end up de-referencing
> the xen_start_info and crashing. As so::
> 

Right, so returning to my original point: The caller here is calling
xen_initial_domain() *before* start info is setup. This is bogus and is
your actual bug, all this patch does is hide that real issue.

With this "fix" the caller of xen_initial_domain shown in this trace now
gets a rubbish result based on the content of a dummy shared info
instead of the real answer from that actual shared info.

The right fix is to fix the caller to not call xen_initial_domain()
until after the shared info has been setup. Maybe that means moving
shinfo setup earlier, or maybe it means deferring this call until later
in the PVHVM case.

> 
> Decompressing Linux... Parsing ELF... done.
> Booting the kernel.
> [    0.000000] Initializing cgroup subsys cpuset
> [    0.000000] Initializing cgroup subsys cpu
> [    0.000000] Linux version 3.6.0upstream-04121-g0313983 
> (konrad@xxxxxxxxxxxxxxxxxx) (gcc version 4.4.4 20100503 (Red Hat 4.4.4-2) 
> (GCC) ) #1 SMP Tue Oct 2 16:31:21 EDT 2012
> [    0.000000] Command line: initrd=initramf.gz  console=ttyS0,115200 
> test=net nofb earlyprintk=serial,ttyS0,115200 BOOT_IMAGE=vmlinuz 
> [    0.000000] e820: BIOS-provided physical RAM map:
> [    0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009dfff] usable
> [    0.000000] BIOS-e820: [mem 0x000000000009e000-0x000000000009ffff] reserved
> [    0.000000] BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved
> [    0.000000] BIOS-e820: [mem 0x0000000000100000-0x000000007fffffff] usable
> [    0.000000] BIOS-e820: [mem 0x00000000fc000000-0x00000000ffffffff] reserved
> [    0.000000] bootconsole [earlyser0] enabled
> [    0.000000] NX (Execute Disable) protection: active
> [    0.000000] DMI 2.4 present.
> [    0.000000] Hypervisor detected: Xen HVM
> [    0.000000] Xen version 4.1.
> [    0.000000] Netfront and the Xen platform PCI driver have been compiled 
> for this kernel: unplug emulated NICs.
> [    0.000000] Blkfront and the Xen platform PCI driver have been compiled 
> for this kernel: unplug emulated disks.
> [    0.000000] You might have to change the root device
> [    0.000000] from /dev/hd[a-d] to /dev/xvd[a-d]
> [    0.000000] in your root= kernel command line option
> [    0.000000] No AGP bridge found
> [    0.000000] e820: last_pfn = 0x80000 max_arch_pfn = 0x400000000
> [    0.000000] x86 PAT enabled: cpu 0, old 0x7040600070406, new 
> 0x7010600070106
> [    0.000000] found SMP MP-table at [mem 0x000fbc90-0x000fbc9f] mapped at 
> [ffff8800000fbc90]
> [    0.000000] init_memory_mapping: [mem 0x00000000-0x7fffffff]
> [    0.000000] RAMDISK: [mem 0x7abeb000-0x7ffdefff]
> [    0.000000] ACPI: RSDP 00000000000ea020 00024 (v02    Xen)
> [    0.000000] ACPI: XSDT 00000000fc00f2b0 00034 (v01    Xen      HVM 
> 00000000 HVML 00000000)
> [    0.000000] ACPI: FACP 00000000fc00f0d0 000F4 (v04    Xen      HVM 
> 00000000 HVML 00000000)
> [    0.000000] ACPI: DSDT 00000000fc003440 0BC09 (v02    Xen      HVM 
> 00000000 INTL 20100528)
> [    0.000000] ACPI: FACS 00000000fc003400 00040
> [    0.000000] ACPI: APIC 00000000fc00f1d0 000D8 (v02    Xen      HVM 
> 00000000 HVML 00000000)
> [    0.000000] No NUMA configuration found
> [    0.000000] Faking a node at [mem 0x0000000000000000-0x000000007fffffff]
> [    0.000000] Initmem setup node 0 [mem 0x00000000-0x7fffffff]
> [    0.000000]   NODE_DATA [mem 0x7fffc000-0x7fffffff]
> [    0.000000] Zone ranges:
> [    0.000000]   DMA      [mem 0x00010000-0x00ffffff]
> [    0.000000]   DMA32    [mem 0x01000000-0xffffffff]
> [    0.000000]   Normal   empty
> [    0.000000] Movable zone start for each node
> [    0.000000] Early memory node ranges
> [    0.000000]   node   0: [mem 0x00010000-0x0009dfff]
> [    0.000000]   node   0: [mem 0x00100000-0x7fffffff]
> [    0.000000] ACPI: PM-Timer IO Port: 0xb008
> [    0.000000] ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x02] enabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x06] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x08] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x05] lapic_id[0x0a] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x06] lapic_id[0x0c] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x07] lapic_id[0x0e] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x08] lapic_id[0x10] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x09] lapic_id[0x12] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x0a] lapic_id[0x14] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x0b] lapic_id[0x16] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x0c] lapic_id[0x18] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x0d] lapic_id[0x1a] disabled)
> [    0.000000] ACPI: LAPIC (acpi_id[0x0e] lapic_id[0x1c] disabled)
> [    0.000000] ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0])
> [    0.000000] IOAPIC[0]: apic_id 1, version 17, address 0xfec00000, GSI 0-47
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 low level)
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 low level)
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 low level)
> [    0.000000] Using ACPI (MADT) for SMP configuration information
> [    0.000000] smpboot: Allowing 15 CPUs, 13 hotplug CPUs
> [    0.000000] PM: Registered nosave memory: 000000000009e000 - 
> 00000000000a0000
> [    0.000000] PM: Registered nosave memory: 00000000000a0000 - 
> 00000000000e0000
> [    0.000000] PM: Registered nosave memory: 00000000000e0000 - 
> 0000000000100000
> [    0.000000] e820: [mem 0x80000000-0xfbffffff] available for PCI devices
> [    0.000000] Booting paravirtualized kernel on Xen HVM
> [    0.000000] setup_percpu: NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:15 
> nr_node_ids:1
> [    0.000000] PERCPU: Embedded 28 pages/cpu @ffff88007a800000 s84352 r8192 
> d22144 u131072
> [    0.000000] Built 1 zonelists in Node order, mobility grouping on.  Total 
> pages: 517000
> [    0.000000] Policy zone: DMA32
> [    0.000000] Kernel command line: initrd=initramf.gz  console=ttyS0,115200 
> test=net nofb earlyprintk=serial,ttyS0,115200 BOOT_IMAGE=vmlinuz 
> [    0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes)
> [    0.000000] __ex_table already sorted, skipping sort
> [    0.000000] Checking aperture...
> [    0.000000] No AGP bridge found
> [    0.000000] Memory: 1967336k/2097152k available (6368k kernel code, 456k 
> absent, 129360k reserved, 4525k data, 752k init)
> [    0.000000] Hierarchical RCU implementation.
> [    0.000000]        RCU restricting CPUs from NR_CPUS=512 to nr_cpu_ids=15.
> [    0.000000] NR_IRQS:33024 nr_irqs:1208 16
> [    0.000000] Xen HVM callback vector for event delivery is enabled
> [    0.000000] Console: colour VGA+ 80x25
> [    0.000000] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000030
> [    0.000000] IP: [<ffffffff813ab3be>] xen_cons_init+0x1e/0x60
> [    0.000000] PGD 0 
> [    0.000000] Oops: 0000 [#1] SMP 
> [    0.000000] Modules linked in:
> [    0.000000] CPU 0 
> [    0.000000] Pid: 0, comm: swapper/0 Not tainted 
> 3.6.0upstream-04121-g0313983 #1 Xen HVM domU
> [    0.000000] RIP: 0010:[<ffffffff813ab3be>]  [<ffffffff813ab3be>] 
> xen_cons_init+0x1e/0x60
> [    0.000000] RSP: 0000:ffffffff81a01ef8  EFLAGS: 00010202
> [    0.000000] RAX: 0000000000000000 RBX: ffffffff81b3be60 RCX: 
> 0000000000000002
> [    0.000000] RDX: ffffffff81a59c40 RSI: ffffffff81a59b01 RDI: 
> ffffffff81ba7e81
> [    0.000000] RBP: ffffffff81a01ef8 R08: 00000000000003fd R09: 
> 0000000000000020
> [    0.000000] R10: 0000000000000000 R11: 000000000000000d R12: 
> ffffffff81b008e0
> [    0.000000] R13: ffffffff81b092e0 R14: 0000000000000000 R15: 
> 0000000000026bf0
> [    0.000000] FS:  0000000000000000(0000) GS:ffff88007a800000(0000) 
> knlGS:0000000000000000
> [    0.000000] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> [    0.000000] CR2: 0000000000000030 CR3: 0000000001a0b000 CR4: 
> 00000000000006b0
> [    0.000000] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
> 0000000000000000
> [    0.000000] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
> 0000000000000400
> [    0.000000] Process swapper/0 (pid: 0, threadinfo ffffffff81a00000, task 
> ffffffff81a13420)
> [    0.000000] Stack:
> [    0.000000]  ffffffff81a01f18 ffffffff81aeb9fb ffffffff81b008e0 
> ffffffffffffffff
> [    0.000000]  ffffffff81a01f68 ffffffff81abac39 ffffffff81aba80d 
> 0000000000026bf0
> [    0.000000]  ffffffff81a01f58 ffffffff81b092e0 0000000001000000 
> 0000000001c72000
> [    0.000000] Call Trace:
> [    0.000000]  [<ffffffff81aeb9fb>] console_init+0x19/0x2a
> [    0.000000]  [<ffffffff81abac39>] start_kernel+0x24a/0x3a3
> [    0.000000]  [<ffffffff81aba80d>] ? kernel_init+0x1e8/0x1e8
> [    0.000000]  [<ffffffff81aba356>] x86_64_start_reservations+0x131/0x136
> [    0.000000]  [<ffffffff81aba45e>] x86_64_start_kernel+0x103/0x112
> [    0.000000] Code: 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 8b 0d 5a 2e 7c 00 
> 55 31 c0 48 89 e5 85 c9 74 37 48 8b 05 51 2e 7c 00 48 c7 c2 40 9c a5 81 <f6> 
> 40 30 02 75 15 83 f9 02 74 27 e8 52 fc ff ff 85 c0 78 15 48 
> [    0.000000] RIP  [<ffffffff813ab3be>] xen_cons_init+0x1e/0x60
> [    0.000000]  RSP <ffffffff81a01ef8>
> [    0.000000] CR2: 0000000000000030
> [    0.000000] ---[ end trace 5cb378039a20e088 ]---
> [    0.000000] Kernel panic - not syncing: Attempted to kill the idle task!
> > 
> > > 
> > > 
> > > > Perhaps turn this into a static inline with a BUG_ON(!xen_start_info) to
> > > > make catching these cases easier?
> > > 
> > 



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.