This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


[Xen-users] Possible bug with scsi disk and Xen

To: xen-users@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-users] Possible bug with scsi disk and Xen
From: Jordan Pittier <jordan.pittier@xxxxxxxxx>
Date: Sat, 29 Jan 2011 16:27:25 +0100
Delivery-date: Sat, 29 Jan 2011 07:29:12 -0800
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:from:date:message-id:subject:to :content-type; bh=MQTPbWsg+KVowIiNaQRjgg+hq8M0XwpCsvk0dvMM4Qw=; b=UW9ecR6Nw3TpgsNUZytWIqFMkQvOGO5U1ZXd+X44MnXYUBVIUEdXsQCHLVJSSBmvEq rIbP8yI9HimQvCLBkLf5BKY0qPerzdFmcahP+xjBEJ/bXq9Qd0iKebEK3zqoRI05QL7B gMmwGfeKdo+3FQg+kxiArzaEXRgAPwzxJtmYQ=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; b=B/demVXPxTsBitwRRD7cv2dnh8/N5NLQKz/+CEb8ZkW3D3ahT78PLqdsMAfH3Cf1Hy O29d4qFsAxuAbpx1yBCxzbKbFj9203aFnkvewZh+LwIM5+asZ+byaJS4wmNghLTsA/7U G/WuIsmnn/7UO/LRdEE5KYcy67cI5v8yZgimk=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
I have been tracking a bug affecting all my servers running Debian Squeeze for more than a month now, and I�desperately�need your help :)�

I have 10 Sun v20z servers (2*66GB SCSI disk in RAID 1 == mirror). 4 of them are running Debian Squeeze with the latest Xen Debian kernel (2.6.32-5-xen-amd64 ==�2.6.32-29). The rest are running Debian Lenny (2.6.26-2-xen-amd64 ==�2.6.26-26lenny1).

On a Squeeze boxe, under very high IO (such as running a IO stress test, ie bonnie++), server starts behaving�weirdly and I see messages like these in kernel.log : [see attachement]. Then the server becomes totally unresponsive (but doesn't "freeze") and commands such as "ls" or "reboot" don't work anymore. I have to do an hard reboot. After the server has reboot, the RAID array seems degraded (I am using the mpt-status command) and starts rebuilding. After several hours, the raid array is "fine" ("clean"). The raid controler is "LSI53C1030" U320, with driver "Fusion MPT SPI Host driver 3.04.06". I have attached the result of "lsmod".

None of my Lenny boxes are affected by this issue, all of my Squeeze boxes are.

What does it have to do with Xen ? When I boot my Squeeze boxes without the Xen hypervisor but the same Xen kernel, bonnie++ runs�absolutely�fine. The issue appears only with the Xen hypervisor loaded.�

There is a debian bug report for this :�http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=603727

Any suggestion ?�
Thanks !

Attachment: dmesg_output
Description: Binary data

Attachment: lsmod_output
Description: Binary data

Xen-users mailing list
<Prev in Thread] Current Thread [Next in Thread>