WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] dom0 freezes under high IO load - HP ML150 G2

To: Daniel Mealha Cabrita <dancab@xxxxxxxxxxxx>
Subject: Re: [Xen-users] dom0 freezes under high IO load - HP ML150 G2
From: Tom Mornini <tmornini@xxxxxxxxxxxxxx>
Date: Fri, 2 Mar 2007 19:24:36 -0800
Cc: xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Fri, 02 Mar 2007 19:23:59 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
In-reply-to: <200703030027.56503.dancab@xxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <200703030027.56503.dancab@xxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
Hello Daniel.

We've had similar problems, but have received very little feedback from our machines.

Our setup is also not entirely similar to yours. :-)

Could you try something out?

Set a cron job to run every 1 minute:

  cat /proc/slabinfo >> /root/slabinfo.txt

When we do this, our problem gets *much* worse. I'd love to know if these are similar problems.

--
-- Tom Mornini, CTO
-- Engine Yard, Ruby on Rails Hosting
-- Reliability, Ease of Use, Scalability
-- (866) 518-YARD (9273)

On Mar 2, 2007, at 7:27 PM, Daniel Mealha Cabrita wrote:


hi there,


        Does anyone have have suggestions on how to proceed in this case?


I've experiencing dom0 (xen 3.0.3, xen-3.0.4 and 3.0.4-testing) lockups under
heavy disk load (testing under dom0 directly).
The hardware is a HP ML150 G2 with a HP 4ch SATA fakeraid (OEM Adaptec 1420SA,
sata_mv driver).
The machine does not respond to network, keyboard not anything noticeable
when it happens.

I've tried passing a number of parameters to kernel but no success. The ones
below even make things worse, causing a CPU0 soft lockup during boot:
kernel = (hd0,0)/xen-3.0.4 dom0_mem=384M acpi=off noapic nolapic

The disks runs in Linux kernel RAID5. High load to/from an individual disk
(the max an individual SATA HD can handle) does not cause any problem.

Also, I've noticed that just after booting the machine does not respond to pings nor anything from the network. If I locally ping from that to another
host, the network starts working. OR, if I wait enough time (several
minutes), the machine's network starts to respond normally.

I've got no soft lockups logged in /var/log/messages. Nor anything strange
enough to call my attention.

        The problem does not happen with a non-Xen kernel.
The machine firmware (BIOS if you like) is updated to the latest version. Disabling all the non-essential hardware (USB, serial/parallel ports, IDE
ports, powersaving etc) makes no difference.

When not under high disk load the machine seems stable, with several domU VMs
running happily under it.

--
 Daniel Mealha Cabrita
 Divisao de Suporte Tecnico
 AINFO / Reitoria / UTFPR
 http://www.utfpr.edu.br

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users