WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] pv_ops domU crashes on pv_ops dom0 (directly at boot)

To: Markus Schuster <ml@xxxxxxxxxxxxxxxxxxxx>
Subject: Re: [Xen-users] pv_ops domU crashes on pv_ops dom0 (directly at boot)
From: Pasi Kärkkäinen <pasik@xxxxxx>
Date: Sun, 6 Dec 2009 02:40:46 +0200
Cc: Jeremy Fitzhardinge <jeremy@xxxxxxxx>, xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Sat, 05 Dec 2009 16:41:25 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <200912052352.06078.ml@xxxxxxxxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <200912052352.06078.ml@xxxxxxxxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mutt/1.5.13 (2006-08-11)
Hello,

Jeremy: Is this already known bug/crash?

-- Pasi

On Sat, Dec 05, 2009 at 11:52:06PM +0100, Markus Schuster wrote:
> Hi list, 
> 
> I'm experiencing some really strange behavior when booting a pv_ops domU more 
> recent than kernel 2.6.29 (!) on a pv_ops dom0 (directly from git). 
> I'm using XEN 3.4.2 (Debian unstable packages (3.4.2-2) compiled for lenny) 
> on 
> Debian lenny. 
> I've checked out pv_ops dom0 source as described in the wiki - currently I'm 
> working with changeset 8735edb4a976105fd29c97c00c6d14760537e4ee. 
> domU kernel is a prebuilt kernel image from Debian unstable.
> 
> The strange thing: With the same binaries, I can't trigger this problem in my 
> test environment (Athlon 64 3500+ socket 939, 2G RAM), only on my production 
> server (2x Opteron 2218 (dual core) socket F, 8GB RAM) - maybe it's a SMP 
> related problem?
> 
> dom0 .config:
> http://markus.schuster.name/bugreport/xen/dom0-config.txt
> 
> kernel output of crashing domU
> http://markus.schuster.name/bugreport/xen/domU-2.6.31.txt
> http://markus.schuster.name/bugreport/xen/domU-2.6.32-rc8.txt
> 
> kernel output of working domU
> http://markus.schuster.name/bugreport/xen/domU-2.6.29.txt
> 
> For those just wanting to have a quick look, here is the relevant kernel 
> output from a 2.6.31 domU:
> [..]
> [    0.012368] Booting paravirtualized kernel on Xen
> [    0.012368] Xen version: 3.4 (preserve-AD)
> [    0.012368] Grant table initialized
> [    0.012368] regulator: core version 0.5
> [    0.012368] NET: Registered protocol family 16
> [    0.108050] PCI: Fatal: No config space access function found
> [    0.108555] bio: create slab <bio-0> at 0
> [    0.108555] ACPI: Interpreter disabled.
> [    0.108555] xen_balloon: Initialising balloon driver.
> [    0.112044] usbcore: registered new interface driver usbfs
> [    0.112074] usbcore: registered new interface driver hub
> [    0.112099] usbcore: registered new device driver usb
> [    0.112099] PCI: System does not support PCI
> [    0.112099] PCI: System does not support PCI
> [    0.125982] pnp: PnP ACPI: disabled
> [    0.126234] NET: Registered protocol family 2
> [    0.126234] IP route cache hash table entries: 65536 (order: 7, 524288 
> bytes)
> [    0.128563] TCP established hash table entries: 262144 (order: 10, 4194304 
> bytes)
> [    0.130630] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
> [    0.131149] TCP: Hash tables configured (established 262144 bind 65536)
> [    0.131164] TCP reno registered
> [    0.131416] NET: Registered protocol family 1
> [    0.131510] Unpacking initramfs...
> [    0.162893] Freeing initrd memory: 21240k freed
> [    0.176404] platform rtc_cmos: registered platform RTC device (no PNP 
> device found)
> [    0.176433] BUG: unable to handle kernel NULL pointer dereference at (null)
> [    0.176445] IP: [<ffffffff81386142>] cache_add_dev+0x2e/0x2ae
> [    0.176461] PGD 0
> [    0.176468] Oops: 0000 [#1] SMP
> [    0.176477] last sysfs file:
> [    0.176484] CPU 0
> [    0.176490] Modules linked in:
> [    0.176500] Pid: 1, comm: swapper Tainted: G        W  2.6.31-1-amd64 #1
> [    0.176508] RIP: e030:[<ffffffff81386142>]  [<ffffffff81386142>] 
> cache_add_dev+0x2e/0x2ae
> [    0.176522] RSP: e02b:ffff88007ffe5e50  EFLAGS: 00010246
> [    0.176529] RAX: 0000000000000003 RBX: 0000000000000000 RCX: 
> 000000008a54cc73
> [    0.176537] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
> 0000000000000000
> [    0.176545] RBP: ffffffff8152df60 R08: 0000000000000200 R09: 
> ffffffff8152df60
> [    0.176553] R10: 0000000000000000 R11: ffffffff81200d1d R12: 
> 00000000fffffffe
> [    0.176561] R13: ffffffff8152dfa0 R14: ffffffff8158af20 R15: 
> 0000000000000000
> [    0.176573] FS:  0000000000000000(0000) GS:ffffc90000000000(0000) 
> knlGS:0000000000000000
> [    0.176583] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
> [    0.176590] CR2: 0000000000000000 CR3: 0000000001001000 CR4: 
> 0000000000000660
> [    0.176599] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
> 0000000000000000
> [    0.176607] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
> 0000000000000400
> [    0.176615] Process swapper (pid: 1, threadinfo ffff88007ffe4000, task 
> ffff88007ffc6740)
> [    0.176625] Stack:
> [    0.176630]  0000000000000002 00000000ffffffea ffffffff8152dfa0 
> ffffffff8158af20
> [    0.176643] <0> 000000008a54cc73 0000000000000000 ffffffff8152df60 
> 0000000000000000
> [    0.176658] <0> ffffffff8152dfa0 ffffffff8158af20 00000000014bd800 
> ffffffff81386503
> [    0.176677] Call Trace:
> [    0.176687]  [<ffffffff81386503>] ? cache_sysfs_init+0x3b/0x86
> [    0.176699]  [<ffffffff81550787>] ? i8237A_init_sysfs+0x0/0x5e
> [    0.176708]  [<ffffffff813864c8>] ? cache_sysfs_init+0x0/0x86
> [    0.176718]  [<ffffffff8100a0a6>] ? do_one_initcall+0x78/0x1a8
> [    0.176727]  [<ffffffff81549a5b>] ? kernel_init+0x178/0x1e5
> [    0.176738]  [<ffffffff81012f2a>] ? child_rip+0xa/0x20
> [    0.180013]  [<ffffffff810120e7>] ? int_ret_from_sys_call+0x7/0x1b
> [    0.180013]  [<ffffffff8101289d>] ? retint_restore_args+0x5/0x6
> [    0.180013]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.180013]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.180013]  [<ffffffff81012f20>] ? child_rip+0x0/0x20
> [    0.180013] Code: 89 ff 41 56 41 55 41 54 41 bc fe ff ff ff 55 53 48 83 ec 
> 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 31 c0 66 8b 05 4e 35 29 00 <44> 
> 8b 37 66 85 c0 0f 84 4d 02 00 00 45 89 f5 0f b7 f8 48 c7 c5
> [    0.180013] RIP  [<ffffffff81386142>] cache_add_dev+0x2e/0x2ae
> [    0.180013]  RSP <ffff88007ffe5e50>
> [    0.180013] CR2: 0000000000000000
> [    0.180013] ---[ end trace a7919e7f17c0a726 ]---
> [    0.209859] Kernel panic - not syncing: Attempted to kill init!
> [    0.209869] Pid: 1, comm: swapper Tainted: G      D W  2.6.31-1-amd64 #1
> [    0.209877] Call Trace:
> [    0.209886]  [<ffffffff8138df3c>] ? panic+0x96/0x14a
> [    0.209895]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.209906]  [<ffffffff8100e592>] ? check_events+0x12/0x20
> [    0.209915]  [<ffffffff813908a6>] ? _write_lock_irq+0x1b/0x43
> [    0.209926]  [<ffffffff8106828b>] ? exit_ptrace+0xb7/0x14b
> [    0.209935]  [<ffffffff813908a6>] ? _write_lock_irq+0x1b/0x43
> [    0.209945]  [<ffffffff81060184>] ? do_exit+0x82/0x698
> [    0.209955]  [<ffffffff8105cd81>] ? release_console_sem+0x1a4/0x1ed
> [    0.209965]  [<ffffffff8101685f>] ? oops_end+0xc3/0xdf
> [    0.209974]  [<ffffffff8103bcab>] ? no_context+0x1fc/0x220
> [    0.209984]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.209993]  [<ffffffff8103be6f>] ? __bad_area_nosemaphore+0x1a0/0x1dc
> [    0.210002]  [<ffffffff8100e57f>] ? xen_restore_fl_direct_end+0x0/0x1
> [    0.210012]  [<ffffffff8105d46c>] ? vprintk+0x300/0x343
> [    0.210021]  [<ffffffff8100db31>] ? xen_force_evtchn_callback+0x1d/0x37
> [    0.210031]  [<ffffffff8103c16c>] ? do_page_fault+0xae/0x315
> [    0.210040]  [<ffffffff81391095>] ? page_fault+0x25/0x30
> [    0.210049]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.210058]  [<ffffffff81386142>] ? cache_add_dev+0x2e/0x2ae
> [    0.210068]  [<ffffffff81386503>] ? cache_sysfs_init+0x3b/0x86
> [    0.210079]  [<ffffffff81550787>] ? i8237A_init_sysfs+0x0/0x5e
> [    0.210088]  [<ffffffff813864c8>] ? cache_sysfs_init+0x0/0x86
> [    0.210097]  [<ffffffff8100a0a6>] ? do_one_initcall+0x78/0x1a8
> [    0.210107]  [<ffffffff81549a5b>] ? kernel_init+0x178/0x1e5
> [    0.210116]  [<ffffffff81012f2a>] ? child_rip+0xa/0x20
> [    0.210125]  [<ffffffff810120e7>] ? int_ret_from_sys_call+0x7/0x1b
> [    0.210133]  [<ffffffff8101289d>] ? retint_restore_args+0x5/0x6
> [    0.210143]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.210152]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
> [    0.210161]  [<ffffffff81012f20>] ? child_rip+0x0/0x20
> 
> Regards,
> Markus Schuster
> 
> _______________________________________________
> Xen-users mailing list
> Xen-users@xxxxxxxxxxxxxxxxxxx
> http://lists.xensource.com/xen-users

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users