WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

[Xen-users] pv_ops domU crashes on pv_ops dom0 (directly at boot)

To: xen-users@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-users] pv_ops domU crashes on pv_ops dom0 (directly at boot)
From: Markus Schuster <ml@xxxxxxxxxxxxxxxxxxxx>
Date: Sat, 5 Dec 2009 23:52:06 +0100
Delivery-date: Sat, 05 Dec 2009 14:53:01 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: KMail/1.12.2 (Linux/2.6.29-2-amd64; KDE/4.3.2; x86_64; ; )
Hi list, 

I'm experiencing some really strange behavior when booting a pv_ops domU more 
recent than kernel 2.6.29 (!) on a pv_ops dom0 (directly from git). 
I'm using XEN 3.4.2 (Debian unstable packages (3.4.2-2) compiled for lenny) on 
Debian lenny. 
I've checked out pv_ops dom0 source as described in the wiki - currently I'm 
working with changeset 8735edb4a976105fd29c97c00c6d14760537e4ee. 
domU kernel is a prebuilt kernel image from Debian unstable.

The strange thing: With the same binaries, I can't trigger this problem in my 
test environment (Athlon 64 3500+ socket 939, 2G RAM), only on my production 
server (2x Opteron 2218 (dual core) socket F, 8GB RAM) - maybe it's a SMP 
related problem?

dom0 .config:
http://markus.schuster.name/bugreport/xen/dom0-config.txt

kernel output of crashing domU
http://markus.schuster.name/bugreport/xen/domU-2.6.31.txt
http://markus.schuster.name/bugreport/xen/domU-2.6.32-rc8.txt

kernel output of working domU
http://markus.schuster.name/bugreport/xen/domU-2.6.29.txt

For those just wanting to have a quick look, here is the relevant kernel 
output from a 2.6.31 domU:
[..]
[    0.012368] Booting paravirtualized kernel on Xen
[    0.012368] Xen version: 3.4 (preserve-AD)
[    0.012368] Grant table initialized
[    0.012368] regulator: core version 0.5
[    0.012368] NET: Registered protocol family 16
[    0.108050] PCI: Fatal: No config space access function found
[    0.108555] bio: create slab <bio-0> at 0
[    0.108555] ACPI: Interpreter disabled.
[    0.108555] xen_balloon: Initialising balloon driver.
[    0.112044] usbcore: registered new interface driver usbfs
[    0.112074] usbcore: registered new interface driver hub
[    0.112099] usbcore: registered new device driver usb
[    0.112099] PCI: System does not support PCI
[    0.112099] PCI: System does not support PCI
[    0.125982] pnp: PnP ACPI: disabled
[    0.126234] NET: Registered protocol family 2
[    0.126234] IP route cache hash table entries: 65536 (order: 7, 524288 
bytes)
[    0.128563] TCP established hash table entries: 262144 (order: 10, 4194304 
bytes)
[    0.130630] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
[    0.131149] TCP: Hash tables configured (established 262144 bind 65536)
[    0.131164] TCP reno registered
[    0.131416] NET: Registered protocol family 1
[    0.131510] Unpacking initramfs...
[    0.162893] Freeing initrd memory: 21240k freed
[    0.176404] platform rtc_cmos: registered platform RTC device (no PNP 
device found)
[    0.176433] BUG: unable to handle kernel NULL pointer dereference at (null)
[    0.176445] IP: [<ffffffff81386142>] cache_add_dev+0x2e/0x2ae
[    0.176461] PGD 0
[    0.176468] Oops: 0000 [#1] SMP
[    0.176477] last sysfs file:
[    0.176484] CPU 0
[    0.176490] Modules linked in:
[    0.176500] Pid: 1, comm: swapper Tainted: G        W  2.6.31-1-amd64 #1
[    0.176508] RIP: e030:[<ffffffff81386142>]  [<ffffffff81386142>] 
cache_add_dev+0x2e/0x2ae
[    0.176522] RSP: e02b:ffff88007ffe5e50  EFLAGS: 00010246
[    0.176529] RAX: 0000000000000003 RBX: 0000000000000000 RCX: 
000000008a54cc73
[    0.176537] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
0000000000000000
[    0.176545] RBP: ffffffff8152df60 R08: 0000000000000200 R09: 
ffffffff8152df60
[    0.176553] R10: 0000000000000000 R11: ffffffff81200d1d R12: 
00000000fffffffe
[    0.176561] R13: ffffffff8152dfa0 R14: ffffffff8158af20 R15: 
0000000000000000
[    0.176573] FS:  0000000000000000(0000) GS:ffffc90000000000(0000) 
knlGS:0000000000000000
[    0.176583] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[    0.176590] CR2: 0000000000000000 CR3: 0000000001001000 CR4: 
0000000000000660
[    0.176599] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
[    0.176607] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
0000000000000400
[    0.176615] Process swapper (pid: 1, threadinfo ffff88007ffe4000, task 
ffff88007ffc6740)
[    0.176625] Stack:
[    0.176630]  0000000000000002 00000000ffffffea ffffffff8152dfa0 
ffffffff8158af20
[    0.176643] <0> 000000008a54cc73 0000000000000000 ffffffff8152df60 
0000000000000000
[    0.176658] <0> ffffffff8152dfa0 ffffffff8158af20 00000000014bd800 
ffffffff81386503
[    0.176677] Call Trace:
[    0.176687]  [<ffffffff81386503>] ? cache_sysfs_init+0x3b/0x86
[    0.176699]  [<ffffffff81550787>] ? i8237A_init_sysfs+0x0/0x5e
[    0.176708]  [<ffffffff813864c8>] ? cache_sysfs_init+0x0/0x86
[    0.176718]  [<ffffffff8100a0a6>] ? do_one_initcall+0x78/0x1a8
[    0.176727]  [<ffffffff81549a5b>] ? kernel_init+0x178/0x1e5
[    0.176738]  [<ffffffff81012f2a>] ? child_rip+0xa/0x20
[    0.180013]  [<ffffffff810120e7>] ? int_ret_from_sys_call+0x7/0x1b
[    0.180013]  [<ffffffff8101289d>] ? retint_restore_args+0x5/0x6
[    0.180013]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.180013]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.180013]  [<ffffffff81012f20>] ? child_rip+0x0/0x20
[    0.180013] Code: 89 ff 41 56 41 55 41 54 41 bc fe ff ff ff 55 53 48 83 ec 
28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 31 c0 66 8b 05 4e 35 29 00 <44> 
8b 37 66 85 c0 0f 84 4d 02 00 00 45 89 f5 0f b7 f8 48 c7 c5
[    0.180013] RIP  [<ffffffff81386142>] cache_add_dev+0x2e/0x2ae
[    0.180013]  RSP <ffff88007ffe5e50>
[    0.180013] CR2: 0000000000000000
[    0.180013] ---[ end trace a7919e7f17c0a726 ]---
[    0.209859] Kernel panic - not syncing: Attempted to kill init!
[    0.209869] Pid: 1, comm: swapper Tainted: G      D W  2.6.31-1-amd64 #1
[    0.209877] Call Trace:
[    0.209886]  [<ffffffff8138df3c>] ? panic+0x96/0x14a
[    0.209895]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.209906]  [<ffffffff8100e592>] ? check_events+0x12/0x20
[    0.209915]  [<ffffffff813908a6>] ? _write_lock_irq+0x1b/0x43
[    0.209926]  [<ffffffff8106828b>] ? exit_ptrace+0xb7/0x14b
[    0.209935]  [<ffffffff813908a6>] ? _write_lock_irq+0x1b/0x43
[    0.209945]  [<ffffffff81060184>] ? do_exit+0x82/0x698
[    0.209955]  [<ffffffff8105cd81>] ? release_console_sem+0x1a4/0x1ed
[    0.209965]  [<ffffffff8101685f>] ? oops_end+0xc3/0xdf
[    0.209974]  [<ffffffff8103bcab>] ? no_context+0x1fc/0x220
[    0.209984]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.209993]  [<ffffffff8103be6f>] ? __bad_area_nosemaphore+0x1a0/0x1dc
[    0.210002]  [<ffffffff8100e57f>] ? xen_restore_fl_direct_end+0x0/0x1
[    0.210012]  [<ffffffff8105d46c>] ? vprintk+0x300/0x343
[    0.210021]  [<ffffffff8100db31>] ? xen_force_evtchn_callback+0x1d/0x37
[    0.210031]  [<ffffffff8103c16c>] ? do_page_fault+0xae/0x315
[    0.210040]  [<ffffffff81391095>] ? page_fault+0x25/0x30
[    0.210049]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.210058]  [<ffffffff81386142>] ? cache_add_dev+0x2e/0x2ae
[    0.210068]  [<ffffffff81386503>] ? cache_sysfs_init+0x3b/0x86
[    0.210079]  [<ffffffff81550787>] ? i8237A_init_sysfs+0x0/0x5e
[    0.210088]  [<ffffffff813864c8>] ? cache_sysfs_init+0x0/0x86
[    0.210097]  [<ffffffff8100a0a6>] ? do_one_initcall+0x78/0x1a8
[    0.210107]  [<ffffffff81549a5b>] ? kernel_init+0x178/0x1e5
[    0.210116]  [<ffffffff81012f2a>] ? child_rip+0xa/0x20
[    0.210125]  [<ffffffff810120e7>] ? int_ret_from_sys_call+0x7/0x1b
[    0.210133]  [<ffffffff8101289d>] ? retint_restore_args+0x5/0x6
[    0.210143]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.210152]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.210161]  [<ffffffff81012f20>] ? child_rip+0x0/0x20

Regards,
Markus Schuster

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users