WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] pv_ops domU crashes on pv_ops dom0 (directly at boot)

To: Pasi Kärkkäinen <pasik@xxxxxx>
Subject: Re: [Xen-users] pv_ops domU crashes on pv_ops dom0 (directly at boot)
From: Jeremy Fitzhardinge <jeremy@xxxxxxxx>
Date: Sun, 06 Dec 2009 11:04:49 -0800
Cc: Markus Schuster <ml@xxxxxxxxxxxxxxxxxxxx>, xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Sun, 06 Dec 2009 11:05:35 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <20091206004046.GL16033@xxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <200912052352.06078.ml@xxxxxxxxxxxxxxxxxxxx> <20091206004046.GL16033@xxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.5) Gecko/20091125 Fedora/3.0-3.12.rc1.fc12 Lightning/1.0pre Thunderbird/3.0
On 12/05/09 16:40, Pasi Kärkkäinen wrote:
Hello,

Jeremy: Is this already known bug/crash?

I don't recognize it. What CPU is it? It would be useful to work out exactly what line is crashing.

    J

-- Pasi

On Sat, Dec 05, 2009 at 11:52:06PM +0100, Markus Schuster wrote:
Hi list,

I'm experiencing some really strange behavior when booting a pv_ops domU more
recent than kernel 2.6.29 (!) on a pv_ops dom0 (directly from git).
I'm using XEN 3.4.2 (Debian unstable packages (3.4.2-2) compiled for lenny) on
Debian lenny.
I've checked out pv_ops dom0 source as described in the wiki - currently I'm
working with changeset 8735edb4a976105fd29c97c00c6d14760537e4ee.
domU kernel is a prebuilt kernel image from Debian unstable.

The strange thing: With the same binaries, I can't trigger this problem in my
test environment (Athlon 64 3500+ socket 939, 2G RAM), only on my production
server (2x Opteron 2218 (dual core) socket F, 8GB RAM) - maybe it's a SMP
related problem?

dom0 .config:
http://markus.schuster.name/bugreport/xen/dom0-config.txt

kernel output of crashing domU
http://markus.schuster.name/bugreport/xen/domU-2.6.31.txt
http://markus.schuster.name/bugreport/xen/domU-2.6.32-rc8.txt

kernel output of working domU
http://markus.schuster.name/bugreport/xen/domU-2.6.29.txt

For those just wanting to have a quick look, here is the relevant kernel
output from a 2.6.31 domU:
[..]
[    0.012368] Booting paravirtualized kernel on Xen
[    0.012368] Xen version: 3.4 (preserve-AD)
[    0.012368] Grant table initialized
[    0.012368] regulator: core version 0.5
[    0.012368] NET: Registered protocol family 16
[    0.108050] PCI: Fatal: No config space access function found
[    0.108555] bio: create slab<bio-0>  at 0
[    0.108555] ACPI: Interpreter disabled.
[    0.108555] xen_balloon: Initialising balloon driver.
[    0.112044] usbcore: registered new interface driver usbfs
[    0.112074] usbcore: registered new interface driver hub
[    0.112099] usbcore: registered new device driver usb
[    0.112099] PCI: System does not support PCI
[    0.112099] PCI: System does not support PCI
[    0.125982] pnp: PnP ACPI: disabled
[    0.126234] NET: Registered protocol family 2
[    0.126234] IP route cache hash table entries: 65536 (order: 7, 524288
bytes)
[    0.128563] TCP established hash table entries: 262144 (order: 10, 4194304
bytes)
[    0.130630] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
[    0.131149] TCP: Hash tables configured (established 262144 bind 65536)
[    0.131164] TCP reno registered
[    0.131416] NET: Registered protocol family 1
[    0.131510] Unpacking initramfs...
[    0.162893] Freeing initrd memory: 21240k freed
[    0.176404] platform rtc_cmos: registered platform RTC device (no PNP
device found)
[    0.176433] BUG: unable to handle kernel NULL pointer dereference at (null)
[    0.176445] IP: [<ffffffff81386142>] cache_add_dev+0x2e/0x2ae
[    0.176461] PGD 0
[    0.176468] Oops: 0000 [#1] SMP
[    0.176477] last sysfs file:
[    0.176484] CPU 0
[    0.176490] Modules linked in:
[    0.176500] Pid: 1, comm: swapper Tainted: G        W  2.6.31-1-amd64 #1
[    0.176508] RIP: e030:[<ffffffff81386142>]  [<ffffffff81386142>]
cache_add_dev+0x2e/0x2ae
[    0.176522] RSP: e02b:ffff88007ffe5e50  EFLAGS: 00010246
[    0.176529] RAX: 0000000000000003 RBX: 0000000000000000 RCX:
000000008a54cc73
[    0.176537] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
0000000000000000
[    0.176545] RBP: ffffffff8152df60 R08: 0000000000000200 R09:
ffffffff8152df60
[    0.176553] R10: 0000000000000000 R11: ffffffff81200d1d R12:
00000000fffffffe
[    0.176561] R13: ffffffff8152dfa0 R14: ffffffff8158af20 R15:
0000000000000000
[    0.176573] FS:  0000000000000000(0000) GS:ffffc90000000000(0000)
knlGS:0000000000000000
[    0.176583] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[    0.176590] CR2: 0000000000000000 CR3: 0000000001001000 CR4:
0000000000000660
[    0.176599] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[    0.176607] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[    0.176615] Process swapper (pid: 1, threadinfo ffff88007ffe4000, task
ffff88007ffc6740)
[    0.176625] Stack:
[    0.176630]  0000000000000002 00000000ffffffea ffffffff8152dfa0
ffffffff8158af20
[    0.176643]<0>  000000008a54cc73 0000000000000000 ffffffff8152df60
0000000000000000
[    0.176658]<0>  ffffffff8152dfa0 ffffffff8158af20 00000000014bd800
ffffffff81386503
[    0.176677] Call Trace:
[    0.176687]  [<ffffffff81386503>] ? cache_sysfs_init+0x3b/0x86
[    0.176699]  [<ffffffff81550787>] ? i8237A_init_sysfs+0x0/0x5e
[    0.176708]  [<ffffffff813864c8>] ? cache_sysfs_init+0x0/0x86
[    0.176718]  [<ffffffff8100a0a6>] ? do_one_initcall+0x78/0x1a8
[    0.176727]  [<ffffffff81549a5b>] ? kernel_init+0x178/0x1e5
[    0.176738]  [<ffffffff81012f2a>] ? child_rip+0xa/0x20
[    0.180013]  [<ffffffff810120e7>] ? int_ret_from_sys_call+0x7/0x1b
[    0.180013]  [<ffffffff8101289d>] ? retint_restore_args+0x5/0x6
[    0.180013]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.180013]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.180013]  [<ffffffff81012f20>] ? child_rip+0x0/0x20
[    0.180013] Code: 89 ff 41 56 41 55 41 54 41 bc fe ff ff ff 55 53 48 83 ec
28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 31 c0 66 8b 05 4e 35 29 00<44>
8b 37 66 85 c0 0f 84 4d 02 00 00 45 89 f5 0f b7 f8 48 c7 c5
[    0.180013] RIP  [<ffffffff81386142>] cache_add_dev+0x2e/0x2ae
[    0.180013]  RSP<ffff88007ffe5e50>
[    0.180013] CR2: 0000000000000000
[    0.180013] ---[ end trace a7919e7f17c0a726 ]---
[    0.209859] Kernel panic - not syncing: Attempted to kill init!
[    0.209869] Pid: 1, comm: swapper Tainted: G      D W  2.6.31-1-amd64 #1
[    0.209877] Call Trace:
[    0.209886]  [<ffffffff8138df3c>] ? panic+0x96/0x14a
[    0.209895]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.209906]  [<ffffffff8100e592>] ? check_events+0x12/0x20
[    0.209915]  [<ffffffff813908a6>] ? _write_lock_irq+0x1b/0x43
[    0.209926]  [<ffffffff8106828b>] ? exit_ptrace+0xb7/0x14b
[    0.209935]  [<ffffffff813908a6>] ? _write_lock_irq+0x1b/0x43
[    0.209945]  [<ffffffff81060184>] ? do_exit+0x82/0x698
[    0.209955]  [<ffffffff8105cd81>] ? release_console_sem+0x1a4/0x1ed
[    0.209965]  [<ffffffff8101685f>] ? oops_end+0xc3/0xdf
[    0.209974]  [<ffffffff8103bcab>] ? no_context+0x1fc/0x220
[    0.209984]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.209993]  [<ffffffff8103be6f>] ? __bad_area_nosemaphore+0x1a0/0x1dc
[    0.210002]  [<ffffffff8100e57f>] ? xen_restore_fl_direct_end+0x0/0x1
[    0.210012]  [<ffffffff8105d46c>] ? vprintk+0x300/0x343
[    0.210021]  [<ffffffff8100db31>] ? xen_force_evtchn_callback+0x1d/0x37
[    0.210031]  [<ffffffff8103c16c>] ? do_page_fault+0xae/0x315
[    0.210040]  [<ffffffff81391095>] ? page_fault+0x25/0x30
[    0.210049]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.210058]  [<ffffffff81386142>] ? cache_add_dev+0x2e/0x2ae
[    0.210068]  [<ffffffff81386503>] ? cache_sysfs_init+0x3b/0x86
[    0.210079]  [<ffffffff81550787>] ? i8237A_init_sysfs+0x0/0x5e
[    0.210088]  [<ffffffff813864c8>] ? cache_sysfs_init+0x0/0x86
[    0.210097]  [<ffffffff8100a0a6>] ? do_one_initcall+0x78/0x1a8
[    0.210107]  [<ffffffff81549a5b>] ? kernel_init+0x178/0x1e5
[    0.210116]  [<ffffffff81012f2a>] ? child_rip+0xa/0x20
[    0.210125]  [<ffffffff810120e7>] ? int_ret_from_sys_call+0x7/0x1b
[    0.210133]  [<ffffffff8101289d>] ? retint_restore_args+0x5/0x6
[    0.210143]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.210152]  [<ffffffff81200d1d>] ? dummycon_dummy+0x0/0x2e
[    0.210161]  [<ffffffff81012f20>] ? child_rip+0x0/0x20

Regards,
Markus Schuster

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users