[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-users] Possible bug with scsi disk and Xen

  • To: xen-users@xxxxxxxxxxxxxxxxxxx
  • From: Jordan Pittier <jordan.pittier@xxxxxxxxx>
  • Date: Sat, 29 Jan 2011 16:27:25 +0100
  • Delivery-date: Sat, 29 Jan 2011 07:29:12 -0800
  • Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; b=B/demVXPxTsBitwRRD7cv2dnh8/N5NLQKz/+CEb8ZkW3D3ahT78PLqdsMAfH3Cf1Hy O29d4qFsAxuAbpx1yBCxzbKbFj9203aFnkvewZh+LwIM5+asZ+byaJS4wmNghLTsA/7U G/WuIsmnn/7UO/LRdEE5KYcy67cI5v8yZgimk=
  • List-id: Xen user discussion <xen-users.lists.xensource.com>

I have been tracking a bug affecting all my servers running Debian Squeeze for more than a month now, and I desperately need your help :) 

I have 10 Sun v20z servers (2*66GB SCSI disk in RAID 1 == mirror). 4 of them are running Debian Squeeze with the latest Xen Debian kernel (2.6.32-5-xen-amd64 == 2.6.32-29). The rest are running Debian Lenny (2.6.26-2-xen-amd64 == 2.6.26-26lenny1).

On a Squeeze boxe, under very high IO (such as running a IO stress test, ie bonnie++), server starts behaving weirdly and I see messages like these in kernel.log : [see attachement]. Then the server becomes totally unresponsive (but doesn't "freeze") and commands such as "ls" or "reboot" don't work anymore. I have to do an hard reboot. After the server has reboot, the RAID array seems degraded (I am using the mpt-status command) and starts rebuilding. After several hours, the raid array is "fine" ("clean"). The raid controler is "LSI53C1030" U320, with driver "Fusion MPT SPI Host driver 3.04.06". I have attached the result of "lsmod".

None of my Lenny boxes are affected by this issue, all of my Squeeze boxes are.

What does it have to do with Xen ? When I boot my Squeeze boxes without the Xen hypervisor but the same Xen kernel, bonnie++ runs absolutely fine. The issue appears only with the Xen hypervisor loaded. 

There is a debian bug report for this : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=603727

Any suggestion ? 
Thanks !


Attachment: dmesg_output
Description: Binary data

Attachment: lsmod_output
Description: Binary data

Xen-users mailing list



Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.