[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] Debian Squeeze, xen, multipath and iscsi



On Fri, Feb 11, 2011 at 02:35:53PM +0100, Agustin Lopez wrote:
>
> Hi all!
>
> I want to update my Debian Lenny xen servers to Squeeze.
> I am testing with a new install. All installs Ok but when I install the  
> multipath
> package I get a kernel crash.
>
> I am searched a bit with google but I have not found any solution.
>
> Are there anybody in the list working with this configuration?
>
> PS: If I boot my server without Xen, with a standard kernel, multipath
> and the other is working right.
>

What exactly crashes? dom0 ? Do you get a kernel dump on the console?

I have pretty much the same setup here (iSCSI + multipath + Xen + 
Squeeze dom0 + Lenny/Etch PVM domUs) and I had some trouble with 
multipath and iSCSI beeing a little touchy.

Basically my dom0 kernel hates to have fast iSCSI logout/login 
sequences.

You'll have to give multipathd some time to cleanly remove
multipath devices before you do another login.

Otherwise I get stuff like this where kpartx (the thing that 
manages of device nodes for partitions) triggers some 
race condition:

Feb 10 06:46:43 xenhost03 kernel: [225060.039126] BUG: unable to handle kernel 
paging request at ffff88001558b010
Feb 10 06:46:43 xenhost03 kernel: [225060.039172] IP: [<ffffffff8100e428>] 
xen_set_pmd+0x15/0x2c
Feb 10 06:46:43 xenhost03 kernel: [225060.039210] PGD 1002067 PUD 1006067 PMD 
18a067 PTE 801000001558b065
Feb 10 06:46:43 xenhost03 kernel: [225060.039253] Oops: 0003 [#1] SMP
Feb 10 06:46:43 xenhost03 kernel: [225060.039284] last sysfs file: 
/sys/devices/virtual/block/dm-6/dm/suspended
Feb 10 06:46:43 xenhost03 kernel: [225060.039319] CPU 0
Feb 10 06:46:43 xenhost03 kernel: [225060.039344] Modules linked in: tun 
dm_round_robin crc32c xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_state 
nf_conntrack xt_physdev iptable_filter ip_tables x_tables bridge stp xen_evtchn 
xenfs ib_iser rdma_cm ib_c
m iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi 
scsi_transport_iscsi dm_multipath scsi_dh loop snd_hda_intel snd_hda_codec 
snd_hwdep snd_pcm i915 drm_kms_helper drm snd_timer i2c_i801 evdev parport_pc 
psmouse serio_raw pcspkr i2c_
algo_bit parport i2c_core snd soundcore video output snd_page_alloc button 
processor acpi_processor ext3 jbd mbcache dm_mod sd_mod crc_t10dif usbhid hid 
uhci_hcd ata_generic ata_piix libata ehci_hcd scsi_mod e1000e usbcore nls_base 
thermal thermal_sys
 [last unloaded: scsi_wait_scan]
Feb 10 06:46:43 xenhost03 kernel: [225060.039851] Pid: 9259, comm: kpartx_id 
Not tainted 2.6.32-5-xen-amd64 #1 To Be Filled By O.E.M.
Feb 10 06:46:43 xenhost03 kernel: [225060.039904] RIP: 
e030:[<ffffffff8100e428>]  [<ffffffff8100e428>] xen_set_pmd+0x15/0x2c
Feb 10 06:46:43 xenhost03 kernel: [225060.039959] RSP: e02b:ffff880013ad3b18  
EFLAGS: 00010246
Feb 10 06:46:43 xenhost03 kernel: [225060.039990] RAX: 0000000000000000 RBX: 
ffff88001558b010 RCX: ffff880000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] RDX: ffffea0000000000 RSI: 
0000000001cc0000 RDI: ffff88001558b010
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] RBP: 0000000000000000 R08: 
0000000001cc0000 R09: ffff880073c03100
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] R10: 0000000000000000 R11: 
ffff88002ce3bd78 R12: 000000000061c000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] R13: 0000000000400000 R14: 
ffff88001558b010 R15: ffff88002e156000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] FS:  00007f5094f64700(0000) 
GS:ffff880003630000(0000) knlGS:0000000000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] CS:  e033 DS: 0000 ES: 0000 
CR0: 000000008005003b
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] CR2: ffff88001558b010 CR3: 
0000000011a2b000 CR4: 0000000000002660
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] DR0: 0000000000000000 DR1: 
0000000000000000 DR2: 0000000000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] DR3: 0000000000000000 DR6: 
00000000ffff0ff0 DR7: 0000000000000400
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] Process kpartx_id (pid: 9259, 
threadinfo ffff880013ad2000, task ffff880002747810)
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] Stack:
Feb 10 06:46:43 xenhost03 kernel: [225060.040006]  ffff880000000000 
0000000000600000 0000000000400000 ffffffff810cf886
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] <0> ffff880013ad3fd8 
0000000017ab2067 ffff880002159180 0000000000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040636] <0> 0000000000000000 
000000000061bfff 000000000061bfff 0000000001c00000
Feb 10 06:46:43 xenhost03 kernel: [225060.040636] Call Trace:
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810cf886>] ? 
free_pgd_range+0x226/0x3bf
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810cfabb>] ? 
free_pgtables+0x9c/0xbd
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810d129d>] ? 
exit_mmap+0xef/0x148
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff8104cb09>] ? 
mmput+0x3c/0xdf
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f44d6>] ? 
flush_old_exec+0x45c/0x548
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff811270d0>] ? 
load_elf_binary+0x0/0x1954
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff8112746d>] ? 
load_elf_binary+0x39d/0x1954
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810cc572>] ? 
follow_page+0x2ad/0x303
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810ce136>] ? 
__get_user_pages+0x3ea/0x47b
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f4fcb>] ? 
get_arg_page+0x61/0x110
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff811270d0>] ? 
load_elf_binary+0x0/0x1954
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f3caa>] ? 
search_binary_handler+0xb4/0x245
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f54a7>] ? 
do_execve+0x1e4/0x2c3
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff81010500>] ? 
sys_execve+0x35/0x4c
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff81011f9a>] ? 
stub_execve+0x6a/0xc0
Feb 10 06:46:43 xenhost03 kernel: [225060.040636] Code: fb ff ff e8 c6 f4 01 00 
bf 01 00 00 00 e8 c9 ea ff ff 59 5e 5b c3 55 48 89 f5 53 48 89 fb 48 83 ec 08 
e8 6e e3 ff ff 84 c0 75 08 <48> 89 2b 41 59 5b 5d c3 41 58 48 89 df 48 89 ee 5b 
5d e9 7e ff
Feb 10 06:46:43 xenhost03 kernel: [225060.042981] RIP  [<ffffffff8100e428>] 
xen_set_pmd+0x15/0x2c
Feb 10 06:46:43 xenhost03 kernel: [225060.042981]  RSP <ffff880013ad3b18>
Feb 10 06:46:43 xenhost03 kernel: [225060.042981] CR2: ffff88001558b010
Feb 10 06:46:43 xenhost03 kernel: [225060.042981] ---[ end trace 
9939eec096f5a2de ]---


Also I noticed dom0 lockups of more than a minute when starting
HVM domUs while another domU was creating heavy IO load. 
Those only disapeared when I gave my dom0 a fixed ammount of 
RAM instead of balooning it down.


Other than that I had no bad trouble. (Well, life migration of
Lenny 32 bit domUs on 64bit dom0 doesn't work because the 
Lenny domU kernel is not good at that).


I didn't do a new install of squeeze though. I started
with lenny and upgraded to squeeze.

cheers
-henrik


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.