[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] OpenSuSE 11.2 bug, dom0-cpus limit causes xenwatch_cb running 100% and xm command freeze and xend dead


  • To: jp.pozzi@xxxxxxxxx
  • From: Boris Derzhavets <bderzhavets@xxxxxxxxx>
  • Date: Tue, 24 Nov 2009 10:05:59 -0800 (PST)
  • Cc: xen-users@xxxxxxxxxxxxxxxxxxx
  • Delivery-date: Tue, 24 Nov 2009 10:08:56 -0800
  • Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:Cc:In-Reply-To:MIME-Version:Content-Type; b=XK9Zws7eNDlJjlyoMW/S/1PnByhxK6y0ZFDaCfeFUN7fL511GE3eZsueuZ1M++RzHnrkxlFC2VyO30qIBiQynSD69wQvuMhdVOsrBJIaUoRxmfu1+R7G8i393Vk6O0c+/WquMK851gP2hu8+5RNnSVT/v9uMtxZDbJrht97JwOw=;
  • List-id: Xen user discussion <xen-users.lists.xensource.com>

> I get a kernel patch and all is OK now the VMs are running flawlessly,
> even if my server is a much smaller one.
Could you try install F12 PV DomU  (minimal set of packages) ?

Boris.

--- On Tue, 11/24/09, Moi meme <storm66@xxxxxxxxxxxxxxxx> wrote:

From: Moi meme <storm66@xxxxxxxxxxxxxxxx>
Subject: Re: [Xen-users] OpenSuSE 11.2 bug, dom0-cpus limit causes xenwatch_cb running 100% and xm command freeze and xend dead
To: "Vladislav Karpenko" <vladislav@xxxxxxxxxxxxxx>
Cc: xen-users@xxxxxxxxxxxxxxxxxxx
Date: Tuesday, November 24, 2009, 10:52 AM

Hello,

I get a problem while upgrading to OpenSuse 11.2 :
cf : https://bugzilla.novell.com/show_bug.cgi?id=552492#status_changes

I get a kernel patch and all is OK now the VMs are running flawlessly,
even if my server is a much smaller one.

You didn't say how much RAM is in your system.

Regards

JPP

Le mardi 24 novembre 2009 Ã 17:36 +0200, Vladislav Karpenko a Ãcrit :
> Yes have that also, y could try to fix it if u say in boot kernel option vcpu amount
> mine is:
> dom0_mem=512M dom0_vcpus_pin dom0_max_vcpus=1
>
> but for now i dont use suse 12.2, its not stable with xen 3.4.1
>
>
> 23 ÃÄÅÃ. 2009, Ã 15:26, Fischer Udo Attila ÃÃÄÃÃÃÄ(Ã):
>
> > Hi all,
> >
> > I have upgraded a test machine from OpenSuSE 11.1 to 11.2.
> > I have found following bug:
> > the server is a 2x quadcore intel box also 2x4=8cpu
> >
> > If you limit the dom0 cpu with dom0-cpus= [1-7]:
> > - [xenwatch_cb] is running 100% cpu and makes var log entry every 65 sec
> > BUG: soft lockup - CPU#X stuck for 61s!
> > - xm commands not work
> > - xend is dead
> >
> >
> >
> > if set dom0-cpus to 0 or 8:
> > - everything looks fine
> >
> > Can somebody else confirm that bug?
> >
> >
> > Best regards
> >
> > Udo Attila Fischer
> > ------------------------------
> >
> >
> >
> > for example: vcpu=7
> >
> > PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
> > 4532 root      15  -5     0    0    0 R  100  0.0  11:14.84 xenwatch_cb
> >
> >
> > # ps aux |grep xen
> > root        39  0.0  0.0      0     0 ?        S<   13:03   0:00 [xenwatch]
> > root        40  0.0  0.0      0     0 ?        S<   13:03   0:00 [xenbus]
> > root      3791  0.0  0.0  11300  1560 ?        S    13:04   0:00 /bin/bash
> > /etc/init.d/xend start
> > root      4209  0.0  0.1 107504 13864 ?        S    13:04   0:00
> > /usr/bin/python2.6 /usr/sbin/xend start
> > root      4446  0.0  0.0   8488  1000 ?        S    13:04   0:00 xenstored
> > --pid-file /var/run/xenstore.pid
> > root      4448  0.0  0.0      0     0 ?        Z    13:04   0:00
> > [xenconsoled] <defunct>
> > root      4450  0.0  0.0      0     0 ?        Zs   13:04   0:00 [xend]
> > <defunct>
> > root      4451  0.0  0.1 107500 11500 ?        S    13:04   0:00
> > /usr/bin/python2.6 /usr/sbin/xend start
> > root      4453  0.0  0.0  22724   560 ?        Sl   13:04   0:00 xenconsoled
> > root      4455  0.0  0.2 148304 16652 ?        Sl   13:04   0:00
> > /usr/bin/python2.6 /usr/sbin/xend start
> > root      4532  100  0.0      0     0 ?        R<   13:04  40:35
> > [xenwatch_cb]
> > root      4533  0.0  0.0      0     0 ?        D<   13:04   0:00
> > [xenwatch_cb]
> > root      4534  0.0  0.0      0     0 ?        D<   13:04   0:00
> > [xenwatch_cb]
> > root      4535  0.0  0.0      0     0 ?        D<   13:04   0:00
> > [xenwatch_cb]
> > root      4536  0.0  0.0      0     0 ?        D<   13:04   0:00
> > [xenwatch_cb]
> >
> >
> >
> > from /var/log/messages every 65 sec
> >
> > Nov 23 13:55:14 dom0-u2 kernel: [ 3112.781517] BUG: soft lockup - CPU#4
> > stuck for 61s! [xenwatch_cb:4532]
> > Nov 23 13:55:14 dom0-u2 kernel: [ 3112.781517] Modules linked in:
> > sha1_generic hmac cryptomgr aead pcompress crypto_
> > blkcipher crypto_hash crypto_algapi drbd netbk blkbk blkback_pagemap
> > blktap xenbus_be binfmt_misc xt_tcpudp ip6t_REJ
> > ECT nf_conntrack_ipv6 ip6table_raw xt_NOTRACK ipt_REJECT xt_physdev
> > xt_state iptable_raw iptable_filter ip6table_man
> > gle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4
> > ip_tables ip6table_filter ip6_tables x_tab
> > les ipv6 bridge stp llc dummy fuse loop dm_mod mptctl iTCO_wdt
> > iTCO_vendor_support i5k_amb sg i5000_edac ppdev 8250_
> > pnp pcspkr sr_mod edac_core parport_pc shpchp e1000e dcdbas 8250
> > pci_hotplug tg3 parport serio_raw serial_core butto
> > n usbhid hid uhci_hcd ehci_hcd xenblk cdrom xennet edd fan ide_pci_generic
> > piix ide_core ata_generic ata_piix mptsas
> > mptscsih mptbase scsi_transport_sas thermal processor thermal_sys hwmon
> > Nov 23 13:55:14 dom0-u2 kernel: [ 3112.781517] CPU 4:
> > Nov 23 13:55:14 dom0-u2 kernel: [ 3112.781517] Modules linked in:
> > sha1_generic hmac cryptomgr aead pcompress crypto_blkcipher crypto_hash
> > crypto_algapi drbd netbk blkbk blkback_pagemap blktap xenbus_be
> > binfmt_misc xt_tcpudp ip6t_REJECT nf_conntrack_ipv6 ip6table_raw
> > xt_NOTRACK ipt_REJECT xt_physdev xt_state iptable_raw iptable_filter
> > ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack
> > nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables ipv6 bridge
> > stp llc dummy fuse loop dm_mod mptctl iTCO_wdt iTCO_vendor_support i5k_amb
> > sg i5000_edac ppdev 8250_pnp pcspkr sr_mod edac_core parport_pc shpchp
> > e1000e dcdbas 8250 pci_hotplug tg3 parport serio_raw serial_core button
> > usbhid hid uhci_hcd ehci_hcd xenblk cdrom xennet edd fan ide_pci_generic
> > piix ide_core ata_generic ata_piix mptsas mptscsih mptbase
> > scsi_transport_sas thermal processor thermal_sys hwmon
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] RIP:
> > e030:[<ffffffff8005f07f>]  [<ffffffff8005f07f>] lock_timer_base+
> > 0x7f/0x90
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] RSP: e02b:ffff8801e8d0bc10
> > EFLAGS: 00000246
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] RAX: 0000000000000000 RBX:
> > 0000000000000000 RCX: ffffffff80778370
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] RDX: 0000000000000007 RSI:
> > ffff8801e8d0bc50 RDI: ffffc90000075280
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] RBP: ffff8801e8d0bc40 R08:
> > ffffffff807813b0 R09: 0000000000000000
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] R10: ffff8801e8d0bcf0 R11:
> > 00000000e15cfb6d R12: ffffc90000075280
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] R13: ffff8801e8d0bc50 R14:
> > 0000000000000000 R15: ffffffff80778600
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] FS:  00007f53d0abf6f0(0000)
> > GS:ffffc90000040000(0000) knlGS:0000000000000000
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] CS:  e033 DS: 0000 ES: 0000
> > CR0: 000000008005003b
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] CR2: 00007f53d0691260 CR3:
> > 0000000000003000 CR4: 0000000000002660
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] DR0: 0000000000000000 DR1:
> > 0000000000000000 DR2: 0000000000000000
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] DR3: 0000000000000000 DR6:
> > 00000000ffff0ff0 DR7: 0000000000000400
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855] Call Trace:
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8005f0bc>]
> > try_to_del_timer_sync+0x2c/0x90
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8005f14a>]
> > del_timer_sync+0x2a/0x50
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8046758f>]
> > mce_cpu_callback+0x122/0x1aa
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff80471de7>]
> > notifier_call_chain+0x57/0xb0
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff80075a1c>]
> > __raw_notifier_call_chain+0x1c/0x40
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8045b90f>]
> > _cpu_down+0xaf/0x310
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8045bbf7>]
> > cpu_down+0x87/0xb0
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8046a42c>]
> > vcpu_hotplug+0xce/0x102
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8046a4ab>]
> > handle_vcpu_hotplug_event+0x4b/0x61
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff80306c4c>]
> > xenwatch_handle_callback+0x2c/0x80
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8006fb96>]
> > kthread+0xb6/0xc0
> > Nov 23 13:54:09 dom0-u2 kernel: [ 3047.280855]  [<ffffffff8000d38a>]
> > child_rip+0xa/0x20
> >
> >
> >
> > _______________________________________________
> > Xen-users mailing list
> > Xen-users@xxxxxxxxxxxxxxxxxxx
> > http://lists.xensource.com/xen-users
>
>
> _______________________________________________
> Xen-users mailing list
> Xen-users@xxxxxxxxxxxxxxxxxxx
> http://lists.xensource.com/xen-users



_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.