WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-devel

[Xen-users] Re: [Xen-devel] Kernel panic with 2.6.32-30 under network ac

To: Jan Beulich <JBeulich@xxxxxxxxxx>
Subject: [Xen-users] Re: [Xen-devel] Kernel panic with 2.6.32-30 under network activity
From: Olivier Hanesse <olivier.hanesse@xxxxxxxxx>
Date: Thu, 17 Mar 2011 13:11:46 +0100
Cc: "xen-devel@xxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxx>, Ian Campbell <Ian.Campbell@xxxxxxxxxx>, Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>, Xen Users <xen-users@xxxxxxxxxxxxxxxxxxx>
Delivery-date: Thu, 17 Mar 2011 05:13:34 -0700
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=hjMEbH9++f6IJG/mQ5FwAjnj9M36N76fTbABbHtuf7w=; b=u4uYmJEtNY6E2mKs9Gji5vtMhrLkc14Wod6in+cNUTlofsunQEtLPGHmsLcvL8BlqM s/Vcm3bFXVDEChOjVtBdECtWJfRMOtzMYjkD2ymCAmY3HrOdw++7tJLF1aQhfhY5M+82 54R1bBXwHSbZOMDH623sv8hWUln0/sdDerikY=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=qM1Box4Kp1JoQlwCQ7jUY+5TX8YzNEWy04Qkizr5JcV2Zb6/3CqNdPldbmPITtrGfe u2Qw3z4CifmEidE3z9p2T6p+mkc/qKvU1tX28D+JLSQVjiwuxTb8bPzL1bNE98jM62w5 /3h4A10AQaTXsWbJivHZWTnirFfusiuCsB32k=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <4D82054F02000078000370F4@xxxxxxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <AANLkTimVdAG6y+-9jNuQM78Bz+O7CuBteQdF1yK1YYCo@xxxxxxxxxxxxxx> <20110316032018.GC7905@xxxxxxxxxxxx> <4D8092240200007800036C9B@xxxxxxxxxxxxxxxxxx> <1300270268.17339.2417.camel@xxxxxxxxxxxxxxxxxxxxxx> <4D80A1940200007800036D07@xxxxxxxxxxxxxxxxxx> <AANLkTikmb=agRRpjbPtxMU0iEmuPFBgWfh7xMypXYWe-@xxxxxxxxxxxxxx> <4D82054F02000078000370F4@xxxxxxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx


2011/3/17 Jan Beulich <JBeulich@xxxxxxxxxx>
>>> On 17.03.11 at 11:34, Olivier Hanesse <olivier.hanesse@xxxxxxxxx> wrote:
> It happens again a few minutes ago. It is the same kernel stack each time
> (alignment check: 0000 [#1] SMP etc ...)
>
> The dom0 where all the faulty domU are running is a dual Xeon 5420 so 8 real
> cores available.
> 20 domUs are running on it, 35 vcpus are set up, is that too much ? The bug
> happens randomly on domUs
> I was running the same config with xen3.2 without any issue.

Are we to read this as "same kernels in DomU-s and Dom0"? If so,
that would hint at some subtle Xen regression. If not, you'd need
to be more precise as to what works and what doesn't, and would
possibly want to try intermediate versions to narrow when this
got introduced.


Dom0 et DomU are using different kernels (both coming from Debian repository, but same version)

domU : 
ii  linux-image-2.6.32-bpo.5-amd64              2.6.32-30~bpo50+1          Linux 2.6.32 for 64-bit PCs

dom0: 
ii  linux-image-2.6.32-bpo.5-xen-amd64        2.6.32-30~bpo50+1          Linux 2.6.32 for 64-bit PCs, Xen dom0 suppor

I was running Debian Lenny's version for xen 3.2, so it was 2.6.26 
 
> I found this old post :
> http://lists.xensource.com/archives/html/xen-devel/2010-03/msg01561.html
>
> It may be related, no issue with 2.6.24, and issue with 2.6.32.

Yes, that indeed looks very similar. Nevertheless, without this
being generally reproducible, we'll have to rely on you doing
some analysis/debugging work on this.

I "pinned" all domUs cpus in order that they don't use the same cpu as dom0 (which is pinned to cpu0).
I can run any analysis/debugging tools you want. 
I will also try an older kernel (for example 2.6.32-10) and see what happens.

 

Jan


Regards

Olivier 

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users