This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


[Xen-devel] Re: [Xen-users] dom0 hangs in xen 4.0.1-rc3-pre

To: Andreas Kinzler <ml-xen-users@xxxxxx>
Subject: [Xen-devel] Re: [Xen-users] dom0 hangs in xen 4.0.1-rc3-pre
From: Bruce Edge <bruce.edge@xxxxxxxxx>
Date: Mon, 27 Sep 2010 07:32:30 -0700
Cc: xen-devel@xxxxxxxxxxxxxxxxxxx, xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Mon, 27 Sep 2010 07:33:28 -0700
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:cc:content-type; bh=MH+E6hpM6nOkSai0A/wFBIZ/TmGx2Dm72CltZvdjITc=; b=joTBwbqoGxvfUzK9in7vE4gdc03FWOGckzbXFk52afkt2KDL8ePz5faqfBc/2+G76x MuCpUzeBDqd32ZjKd/LqBNDZUKZvscSEW9O7IKrbHhaTg4PsXaTVo2LejT3Ll5dANWDN lXs5FIH5ARHy+4KyGaqm6WcFZIZWHZ33JmrjE=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=eSjfYOKfJcFnZS9FjFRuhLI1e9JjE9frrwVwFap7F8VK8JNZ0udd8qXgindMQWNt0A RWxFDkVaJbcjqT3piuRKUzJyQsfKP+NvHSnc2XGEXmIFQP/UAsBiFY6HFe/JjTGR0F2B J5DM1zWBE+ldFWCJM/Fkj89k0sPyrm5KaTEOo=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <4CA0A8AF.6010908@xxxxxx>
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References: <AANLkTimPVj-AXyR8DuQRxuAwcFwHm0sVkgiXvkA1+f7-@xxxxxxxxxxxxxx> <4C9DE72E.1000006@xxxxxx> <AANLkTi=jxHQp3_GDML9JcoYNNkGTGLR3_okBspWnFdfC@xxxxxxxxxxxxxx> <4CA0A8AF.6010908@xxxxxx>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx

On Mon, Sep 27, 2010 at 7:22 AM, Andreas Kinzler <ml-xen-users@xxxxxx> wrote:
On 27.09.2010 16:06, Bruce Edge wrote:
I saw reproducible hangs in dom0 when the system is under heavy load.
four dom0s share a nfs server for domU images. a total number of 24 domUs
domUs on each dom0). When the system under heavy load, busy processing
e-commerce requests, one or two of the dom0s hanged. no input can be
accepted and reboot is necessary.
Anyone had the same experience? The causes I can come up are following:
Please post your hardware (mainboard, chipset, CPU, RAID controller).
I have found a severe problem on Lynnfield systems.
Does this affect all Nehalem chips or only the Lynnfields? The .21 kernel is

causing grief for us too.  I was wondering if this was related.

I am still researching this. For testing I bought a test system with Westmere-EP (Xeon E5620) which has ARAT. This system worked stable while Intel still lists it as having the C6 errata. This leads me to the conclusion that the HPET timer migration code (called HPET broadcast) from Xen is the root cause. This affects all CPUs that use it - but mainly Nehalem because of turbo mode.

Regards Andreas

Thanks for the info. I'll try disabling turbo mode in the BIOS and see if that helps.
Let me know if there's anything I can run/do/test/etc.

Xen-devel mailing list