[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [SPAM] Re: kernel BUG at arch/x86/xen/mmu.c:1860! - ideas.


  • To: Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>
  • From: Teck Choon Giam <giamteckchoon@xxxxxxxxx>
  • Date: Thu, 17 Mar 2011 00:26:20 +0800
  • Cc: Andreas Olsowski <andreas.olsowski@xxxxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxx
  • Delivery-date: Wed, 16 Mar 2011 09:26:53 -0700
  • Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=TqDfat3mzy4bhbIuUWsSdPhchS+17jyjk6wPCcTIYxpjqYvqaY7jKSaZ74UR4BYxGa lwOpMyvbdKDlhjEQETTTVbzhCIBjomOQljdMFohgmtUmsYFKOClEEhphZd4kL1Gi8P0+ mdCMDrTwg3JKALR4At/qc4KNnHJvaoCxVro64=
  • List-id: Xen developer discussion <xen-devel.lists.xensource.com>



On Wed, Mar 16, 2011 at 11:52 PM, Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx> wrote:
> ======================================================
> Kernel 2.6.32.28 without XEN:
> about 50 successful runs of Teck Choon Giams "test.sh" script.
> (modified for handling 10 test volumes and sleeping 2 seconds)
> multipathd restarted succesfully s
> multipath module loaded/unloaded successfully
> lvm2 restarted successfully
.. snip..
> ======================================================
> Kernel 2.6.32.28 with XEN 4.0.1:
> at about loop 2 for volume 7 of "test.sh" it stopped doing ... well anything
> there has been no output on the screen and neitehr syslog nor dmesg entry.
> I left it hanging for about 15 Minutes until i decided to write this
> one off as a side effect of the same underlying problem.
> All lvm2 tools stopped working and i couldnt shut it down.
> Killing the hangig process ended it properly.

Jeremy and I were brainstorming this yesterday and couple of things
that we thought might be interesting are to:

 - turn on CONFIG_DEBUG_PAGEALLOC
 - turn on CONFIG_DEBUG_LIST
 - turn on CONFIG_DEBUG_KMEMLEAK
 - turn on CONFIG_JBD_DEBUG, CONFIG_JBD2_DEBUG
 - turn on CONFIG_SLUB_DEBUG_ON

And see if anything starts coming out.

Thanks a lot for both of you spending time to do so.  It isn't easy as I believe this is something related to kernel 2.6.32.x and just wondering is there something related to *sched_domains?  I read recent mails in LKML about rebuild_sched_domains consider dangerous issues... and that is about recent kernels but won't know what recent kernels that refer to... ...

I will do those config changes in one of my test server when time permit and will post results/output here when done.
 

Also looking in the changes for the drivers/dm/ between 2.6.32 and 2.6.38 and
see if we just hitting some memory leak bugs that hadn't been back-ported.

(Still busy with the upstream effort, can't work on this).

You mean 2.6.39 merge window?

Once again, thanks.

Kindest regards,
Giam Teck Choon
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.