[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-users] Debian Squeeze, xen, multipath and iscsi
Hi! Thanks for the answer. Below is the kernel message I am repeatly getting in the log. The system crash only with the interaction between xen, iscsi and multipath. Again, system is Debian Squeeze running on a Fujitsu PRIMERGY RX200 S4 with 8 cores Intel(R) Xeon(R) CPU E5405 @ 2.00GHz iSCSI is from a EMC cabinet. Any help, please? AgustinModules linked in: dm_round_robin scsi_dh_emc crc32c ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi bridge stp xen_evtchn xenfs dm_multipath dm_mod scsi_dh loop i2c_i801 usbhid ioatdma dca hid shpchp i2c_cor Feb 16 12:08:01 ariete kernel: [ 358.633003] Pid: 1672, comm: dmsetup Tainted: G D 2.6.32-5-xen-amd64 #1 PRIMERGY RX200 S4 Feb 16 12:08:01 ariete kernel: [ 358.633003] RIP: e030:[<ffffffff8130cb16>] [<ffffffff8130cb16>] _spin_lock+0x13/0x1b Feb 16 12:08:01 ariete kernel: [ 358.633003] RSP: e02b:ffff8807dbccdb10 EFLAGS: 00000297 Feb 16 12:08:01 ariete kernel: [ 358.633003] RAX: 0000000000000022 RBX: ffff8807dbccdb28 RCX: ffff8807dbccdb68 Feb 16 12:08:01 ariete kernel: [ 358.633003] RDX: 0000000000000021 RSI: 0000000000000200 RDI: ffff8807dbe1c300 Feb 16 12:08:01 ariete kernel: [ 358.633003] RBP: 0000000000000200 R08: 0000000000000008 R09: ffffffff814eb870 Feb 16 12:08:01 ariete kernel: [ 358.633003] R10: 000000000000000b R11: ffff8807dbe1c280 R12: ffff8807dbe1c280 Feb 16 12:08:01 ariete kernel: [ 358.633003] R13: 000000000000c580 R14: ffff8807dbccdb28 R15: ffffffff814eb830 Feb 16 12:08:01 ariete kernel: [ 358.633003] FS: 00007fe9c607a7a0(0000) GS:ffff8800280c7000(0000) knlGS:0000000000000000 Feb 16 12:08:01 ariete kernel: [ 358.633003] CS: e033 DS: 0000 ES: 0000 CR0: 000000008005003b Feb 16 12:08:01 ariete kernel: [ 358.633003] CR2: 00007fe9c5803420 CR3: 0000000001001000 CR4: 0000000000002660 Feb 16 12:08:01 ariete kernel: [ 358.633003] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Feb 16 12:08:01 ariete kernel: [ 358.633003] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Feb 16 12:08:01 ariete kernel: [ 358.633003] Call Trace: Feb 16 12:08:01 ariete kernel: [ 358.633003] [<ffffffff8100dd87>] ? xen_exit_mmap+0xf8/0x136 Feb 16 12:08:01 ariete kernel: [ 358.633003] [<ffffffff810d1208>] ? exit_mmap+0x5a/0x148 Feb 16 12:08:01 ariete kernel: [ 358.633003] [<ffffffff8104cb09>] ? mmput+0x3c/0xdf Feb 16 12:08:01 ariete kernel: [ 358.633003] [<ffffffff81050702>] ? exit_mm+0x102/0x10d Feb 16 12:08:01 ariete kernel: [ 358.633003] [<ffffffff8130ca72>] ? _spin_lock_irq+0x7/0x22 Feb 16 12:08:01 ariete kernel: [ 358.633003] [<ffffffff81052127>] ? do_exit+0x1f8/0x6c6 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8100ecdf>] ? xen_restore_fl_direct_end+0x0/0x1 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8130cb3a>] ? _spin_unlock_irqrestore+0xd/0xe Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8104f3af>] ? release_console_sem+0x17e/0x1af Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8130d9dd>] ? oops_end+0xaf/0xb4 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff810135f0>] ? do_invalid_op+0x8b/0x95 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8100c694>] ? pin_pagetable_pfn+0x2d/0x36 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffffa01bb9ea>] ? copy_params+0x71/0xb1 [dm_mod] Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff810baf07>] ? __alloc_pages_nodemask+0x11c/0x5f5 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8101293b>] ? invalid_op+0x1b/0x20 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8100c694>] ? pin_pagetable_pfn+0x2d/0x36 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8100c690>] ? pin_pagetable_pfn+0x29/0x36 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff810cd4e2>] ? __pte_alloc+0x6b/0xc6 Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff810cb394>] ? pmd_alloc+0x28/0x5b Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff810cd60b>] ? handle_mm_fault+0xce/0x80f Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff810fbc5c>] ? do_vfs_ioctl+0x48d/0x4cb Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8130f016>] ? do_page_fault+0x2e0/0x2fc Feb 16 12:08:01 ariete kernel: [ 358.642915] [<ffffffff8130ceb5>] ? page_fault+0x25/0x30 El 11/02/2011 15:18, Henrik Langos escribió: On Fri, Feb 11, 2011 at 02:35:53PM +0100, Agustin Lopez wrote:Hi all! I want to update my Debian Lenny xen servers to Squeeze. I am testing with a new install. All installs Ok but when I install the multipath package I get a kernel crash. I am searched a bit with google but I have not found any solution. Are there anybody in the list working with this configuration? PS: If I boot my server without Xen, with a standard kernel, multipath and the other is working right.What exactly crashes? dom0 ? Do you get a kernel dump on the console? I have pretty much the same setup here (iSCSI + multipath + Xen + Squeeze dom0 + Lenny/Etch PVM domUs) and I had some trouble with multipath and iSCSI beeing a little touchy. Basically my dom0 kernel hates to have fast iSCSI logout/login sequences. You'll have to give multipathd some time to cleanly remove multipath devices before you do another login. Otherwise I get stuff like this where kpartx (the thing that manages of device nodes for partitions) triggers some race condition: Feb 10 06:46:43 xenhost03 kernel: [225060.039126] BUG: unable to handle kernel paging request at ffff88001558b010 Feb 10 06:46:43 xenhost03 kernel: [225060.039172] IP: [<ffffffff8100e428>] xen_set_pmd+0x15/0x2c Feb 10 06:46:43 xenhost03 kernel: [225060.039210] PGD 1002067 PUD 1006067 PMD 18a067 PTE 801000001558b065 Feb 10 06:46:43 xenhost03 kernel: [225060.039253] Oops: 0003 [#1] SMP Feb 10 06:46:43 xenhost03 kernel: [225060.039284] last sysfs file: /sys/devices/virtual/block/dm-6/dm/suspended Feb 10 06:46:43 xenhost03 kernel: [225060.039319] CPU 0 Feb 10 06:46:43 xenhost03 kernel: [225060.039344] Modules linked in: tun dm_round_robin crc32c xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack xt_physdev iptable_filter ip_tables x_tables bridge stp xen_evtchn xenfs ib_iser rdma_cm ib_c m iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi dm_multipath scsi_dh loop snd_hda_intel snd_hda_codec snd_hwdep snd_pcm i915 drm_kms_helper drm snd_timer i2c_i801 evdev parport_pc psmouse serio_raw pcspkr i2c_ algo_bit parport i2c_core snd soundcore video output snd_page_alloc button processor acpi_processor ext3 jbd mbcache dm_mod sd_mod crc_t10dif usbhid hid uhci_hcd ata_generic ata_piix libata ehci_hcd scsi_mod e1000e usbcore nls_base thermal thermal_sys [last unloaded: scsi_wait_scan] Feb 10 06:46:43 xenhost03 kernel: [225060.039851] Pid: 9259, comm: kpartx_id Not tainted 2.6.32-5-xen-amd64 #1 To Be Filled By O.E.M. Feb 10 06:46:43 xenhost03 kernel: [225060.039904] RIP: e030:[<ffffffff8100e428>] [<ffffffff8100e428>] xen_set_pmd+0x15/0x2c Feb 10 06:46:43 xenhost03 kernel: [225060.039959] RSP: e02b:ffff880013ad3b18 EFLAGS: 00010246 Feb 10 06:46:43 xenhost03 kernel: [225060.039990] RAX: 0000000000000000 RBX: ffff88001558b010 RCX: ffff880000000000 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] RDX: ffffea0000000000 RSI: 0000000001cc0000 RDI: ffff88001558b010 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] RBP: 0000000000000000 R08: 0000000001cc0000 R09: ffff880073c03100 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] R10: 0000000000000000 R11: ffff88002ce3bd78 R12: 000000000061c000 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] R13: 0000000000400000 R14: ffff88001558b010 R15: ffff88002e156000 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] FS: 00007f5094f64700(0000) GS:ffff880003630000(0000) knlGS:0000000000000000 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] CS: e033 DS: 0000 ES: 0000 CR0: 000000008005003b Feb 10 06:46:43 xenhost03 kernel: [225060.040006] CR2: ffff88001558b010 CR3: 0000000011a2b000 CR4: 0000000000002660 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Feb 10 06:46:43 xenhost03 kernel: [225060.040006] Process kpartx_id (pid: 9259, threadinfo ffff880013ad2000, task ffff880002747810) Feb 10 06:46:43 xenhost03 kernel: [225060.040006] Stack: Feb 10 06:46:43 xenhost03 kernel: [225060.040006] ffff880000000000 0000000000600000 0000000000400000 ffffffff810cf886 Feb 10 06:46:43 xenhost03 kernel: [225060.040006]<0> ffff880013ad3fd8 0000000017ab2067 ffff880002159180 0000000000000000 Feb 10 06:46:43 xenhost03 kernel: [225060.040636]<0> 0000000000000000 000000000061bfff 000000000061bfff 0000000001c00000 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] Call Trace: Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810cf886>] ? free_pgd_range+0x226/0x3bf Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810cfabb>] ? free_pgtables+0x9c/0xbd Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810d129d>] ? exit_mmap+0xef/0x148 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff8104cb09>] ? mmput+0x3c/0xdf Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810f44d6>] ? flush_old_exec+0x45c/0x548 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff811270d0>] ? load_elf_binary+0x0/0x1954 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff8112746d>] ? load_elf_binary+0x39d/0x1954 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810cc572>] ? follow_page+0x2ad/0x303 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810ce136>] ? __get_user_pages+0x3ea/0x47b Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810f4fcb>] ? get_arg_page+0x61/0x110 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff811270d0>] ? load_elf_binary+0x0/0x1954 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810f3caa>] ? search_binary_handler+0xb4/0x245 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff810f54a7>] ? do_execve+0x1e4/0x2c3 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff81010500>] ? sys_execve+0x35/0x4c Feb 10 06:46:43 xenhost03 kernel: [225060.040636] [<ffffffff81011f9a>] ? stub_execve+0x6a/0xc0 Feb 10 06:46:43 xenhost03 kernel: [225060.040636] Code: fb ff ff e8 c6 f4 01 00 bf 01 00 00 00 e8 c9 ea ff ff 59 5e 5b c3 55 48 89 f5 53 48 89 fb 48 83 ec 08 e8 6e e3 ff ff 84 c0 75 08<48> 89 2b 41 59 5b 5d c3 41 58 48 89 df 48 89 ee 5b 5d e9 7e ff Feb 10 06:46:43 xenhost03 kernel: [225060.042981] RIP [<ffffffff8100e428>] xen_set_pmd+0x15/0x2c Feb 10 06:46:43 xenhost03 kernel: [225060.042981] RSP<ffff880013ad3b18> Feb 10 06:46:43 xenhost03 kernel: [225060.042981] CR2: ffff88001558b010 Feb 10 06:46:43 xenhost03 kernel: [225060.042981] ---[ end trace 9939eec096f5a2de ]--- Also I noticed dom0 lockups of more than a minute when starting HVM domUs while another domU was creating heavy IO load. Those only disapeared when I gave my dom0 a fixed ammount of RAM instead of balooning it down. Other than that I had no bad trouble. (Well, life migration of Lenny 32 bit domUs on 64bit dom0 doesn't work because the Lenny domU kernel is not good at that). I didn't do a new install of squeeze though. I started with lenny and upgraded to squeeze. cheers -henrik _______________________________________________ Xen-users mailing list Xen-users@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-users _______________________________________________ Xen-users mailing list Xen-users@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-users
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |