WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] Debian Squeeze, xen, multipath and iscsi

To: xen-users@xxxxxxxxxxxxxxxxxxx
Subject: Re: [Xen-users] Debian Squeeze, xen, multipath and iscsi
From: Agustin Lopez <Agustin.Lopez@xxxxx>
Date: Wed, 16 Feb 2011 12:43:10 +0100
Delivery-date: Wed, 16 Feb 2011 03:44:48 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <20110211141822.GZ3435@xxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <4D553B39.9040508@xxxxx> <20110211141822.GZ3435@xxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; es-ES; rv:1.9.2.13) Gecko/20101207 Thunderbird/3.1.7

Hi!

Thanks for the answer.
Below is the kernel message I am repeatly getting in the log.
The system crash only with the interaction between xen, iscsi and multipath.
Again, system is Debian Squeeze running on a Fujitsu  PRIMERGY RX200 S4 with 8 
cores Intel(R) Xeon(R) CPU           E5405  @ 2.00GHz
iSCSI is from a EMC cabinet.

Any help, please?

Agustin


Modules linked in: dm_round_robin scsi_dh_emc crc32c ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi bridge stp xen_evtchn xenfs dm_multipath dm_mod scsi_dh loop i2c_i801 usbhid ioatdma dca hid shpchp i2c_cor
Feb 16 12:08:01 ariete kernel: [  358.633003] Pid: 1672, comm: dmsetup Tainted: 
G      D    2.6.32-5-xen-amd64 #1 PRIMERGY RX200 S4
Feb 16 12:08:01 ariete kernel: [  358.633003] RIP: e030:[<ffffffff8130cb16>]  
[<ffffffff8130cb16>] _spin_lock+0x13/0x1b
Feb 16 12:08:01 ariete kernel: [  358.633003] RSP: e02b:ffff8807dbccdb10  
EFLAGS: 00000297
Feb 16 12:08:01 ariete kernel: [  358.633003] RAX: 0000000000000022 RBX: 
ffff8807dbccdb28 RCX: ffff8807dbccdb68
Feb 16 12:08:01 ariete kernel: [  358.633003] RDX: 0000000000000021 RSI: 
0000000000000200 RDI: ffff8807dbe1c300
Feb 16 12:08:01 ariete kernel: [  358.633003] RBP: 0000000000000200 R08: 
0000000000000008 R09: ffffffff814eb870
Feb 16 12:08:01 ariete kernel: [  358.633003] R10: 000000000000000b R11: 
ffff8807dbe1c280 R12: ffff8807dbe1c280
Feb 16 12:08:01 ariete kernel: [  358.633003] R13: 000000000000c580 R14: 
ffff8807dbccdb28 R15: ffffffff814eb830
Feb 16 12:08:01 ariete kernel: [  358.633003] FS:  00007fe9c607a7a0(0000) 
GS:ffff8800280c7000(0000) knlGS:0000000000000000
Feb 16 12:08:01 ariete kernel: [  358.633003] CS:  e033 DS: 0000 ES: 0000 CR0: 
000000008005003b
Feb 16 12:08:01 ariete kernel: [  358.633003] CR2: 00007fe9c5803420 CR3: 
0000000001001000 CR4: 0000000000002660
Feb 16 12:08:01 ariete kernel: [  358.633003] DR0: 0000000000000000 DR1: 
0000000000000000 DR2: 0000000000000000
Feb 16 12:08:01 ariete kernel: [  358.633003] DR3: 0000000000000000 DR6: 
00000000ffff0ff0 DR7: 0000000000000400
Feb 16 12:08:01 ariete kernel: [  358.633003] Call Trace:
Feb 16 12:08:01 ariete kernel: [  358.633003]  [<ffffffff8100dd87>] ? 
xen_exit_mmap+0xf8/0x136
Feb 16 12:08:01 ariete kernel: [  358.633003]  [<ffffffff810d1208>] ? 
exit_mmap+0x5a/0x148
Feb 16 12:08:01 ariete kernel: [  358.633003]  [<ffffffff8104cb09>] ? 
mmput+0x3c/0xdf
Feb 16 12:08:01 ariete kernel: [  358.633003]  [<ffffffff81050702>] ? 
exit_mm+0x102/0x10d
Feb 16 12:08:01 ariete kernel: [  358.633003]  [<ffffffff8130ca72>] ? 
_spin_lock_irq+0x7/0x22
Feb 16 12:08:01 ariete kernel: [  358.633003]  [<ffffffff81052127>] ? 
do_exit+0x1f8/0x6c6
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8100ecdf>] ? 
xen_restore_fl_direct_end+0x0/0x1
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8130cb3a>] ? 
_spin_unlock_irqrestore+0xd/0xe
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8104f3af>] ? 
release_console_sem+0x17e/0x1af
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8130d9dd>] ? 
oops_end+0xaf/0xb4
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff810135f0>] ? 
do_invalid_op+0x8b/0x95
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8100c694>] ? 
pin_pagetable_pfn+0x2d/0x36
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffffa01bb9ea>] ? 
copy_params+0x71/0xb1 [dm_mod]
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff810baf07>] ? 
__alloc_pages_nodemask+0x11c/0x5f5
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8101293b>] ? 
invalid_op+0x1b/0x20
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8100c694>] ? 
pin_pagetable_pfn+0x2d/0x36
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8100c690>] ? 
pin_pagetable_pfn+0x29/0x36
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff810cd4e2>] ? 
__pte_alloc+0x6b/0xc6
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff810cb394>] ? 
pmd_alloc+0x28/0x5b
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff810cd60b>] ? 
handle_mm_fault+0xce/0x80f
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff810fbc5c>] ? 
do_vfs_ioctl+0x48d/0x4cb
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8130f016>] ? 
do_page_fault+0x2e0/0x2fc
Feb 16 12:08:01 ariete kernel: [  358.642915]  [<ffffffff8130ceb5>] ? 
page_fault+0x25/0x30


El 11/02/2011 15:18, Henrik Langos escribió:
On Fri, Feb 11, 2011 at 02:35:53PM +0100, Agustin Lopez wrote:
Hi all!

I want to update my Debian Lenny xen servers to Squeeze.
I am testing with a new install. All installs Ok but when I install the
multipath
package I get a kernel crash.

I am searched a bit with google but I have not found any solution.

Are there anybody in the list working with this configuration?

PS: If I boot my server without Xen, with a standard kernel, multipath
and the other is working right.

What exactly crashes? dom0 ? Do you get a kernel dump on the console?

I have pretty much the same setup here (iSCSI + multipath + Xen +
Squeeze dom0 + Lenny/Etch PVM domUs) and I had some trouble with
multipath and iSCSI beeing a little touchy.

Basically my dom0 kernel hates to have fast iSCSI logout/login
sequences.

You'll have to give multipathd some time to cleanly remove
multipath devices before you do another login.

Otherwise I get stuff like this where kpartx (the thing that
manages of device nodes for partitions) triggers some
race condition:

Feb 10 06:46:43 xenhost03 kernel: [225060.039126] BUG: unable to handle kernel 
paging request at ffff88001558b010
Feb 10 06:46:43 xenhost03 kernel: [225060.039172] IP: [<ffffffff8100e428>] 
xen_set_pmd+0x15/0x2c
Feb 10 06:46:43 xenhost03 kernel: [225060.039210] PGD 1002067 PUD 1006067 PMD 
18a067 PTE 801000001558b065
Feb 10 06:46:43 xenhost03 kernel: [225060.039253] Oops: 0003 [#1] SMP
Feb 10 06:46:43 xenhost03 kernel: [225060.039284] last sysfs file: 
/sys/devices/virtual/block/dm-6/dm/suspended
Feb 10 06:46:43 xenhost03 kernel: [225060.039319] CPU 0
Feb 10 06:46:43 xenhost03 kernel: [225060.039344] Modules linked in: tun 
dm_round_robin crc32c xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_state 
nf_conntrack xt_physdev iptable_filter ip_tables x_tables bridge stp xen_evtchn 
xenfs ib_iser rdma_cm ib_c
m iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi 
scsi_transport_iscsi dm_multipath scsi_dh loop snd_hda_intel snd_hda_codec 
snd_hwdep snd_pcm i915 drm_kms_helper drm snd_timer i2c_i801 evdev parport_pc 
psmouse serio_raw pcspkr i2c_
algo_bit parport i2c_core snd soundcore video output snd_page_alloc button 
processor acpi_processor ext3 jbd mbcache dm_mod sd_mod crc_t10dif usbhid hid 
uhci_hcd ata_generic ata_piix libata ehci_hcd scsi_mod e1000e usbcore nls_base 
thermal thermal_sys
  [last unloaded: scsi_wait_scan]
Feb 10 06:46:43 xenhost03 kernel: [225060.039851] Pid: 9259, comm: kpartx_id 
Not tainted 2.6.32-5-xen-amd64 #1 To Be Filled By O.E.M.
Feb 10 06:46:43 xenhost03 kernel: [225060.039904] RIP: e030:[<ffffffff8100e428>]  
[<ffffffff8100e428>] xen_set_pmd+0x15/0x2c
Feb 10 06:46:43 xenhost03 kernel: [225060.039959] RSP: e02b:ffff880013ad3b18  
EFLAGS: 00010246
Feb 10 06:46:43 xenhost03 kernel: [225060.039990] RAX: 0000000000000000 RBX: 
ffff88001558b010 RCX: ffff880000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] RDX: ffffea0000000000 RSI: 
0000000001cc0000 RDI: ffff88001558b010
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] RBP: 0000000000000000 R08: 
0000000001cc0000 R09: ffff880073c03100
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] R10: 0000000000000000 R11: 
ffff88002ce3bd78 R12: 000000000061c000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] R13: 0000000000400000 R14: 
ffff88001558b010 R15: ffff88002e156000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] FS:  00007f5094f64700(0000) 
GS:ffff880003630000(0000) knlGS:0000000000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] CS:  e033 DS: 0000 ES: 0000 
CR0: 000000008005003b
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] CR2: ffff88001558b010 CR3: 
0000000011a2b000 CR4: 0000000000002660
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] DR0: 0000000000000000 DR1: 
0000000000000000 DR2: 0000000000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] DR3: 0000000000000000 DR6: 
00000000ffff0ff0 DR7: 0000000000000400
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] Process kpartx_id (pid: 9259, 
threadinfo ffff880013ad2000, task ffff880002747810)
Feb 10 06:46:43 xenhost03 kernel: [225060.040006] Stack:
Feb 10 06:46:43 xenhost03 kernel: [225060.040006]  ffff880000000000 
0000000000600000 0000000000400000 ffffffff810cf886
Feb 10 06:46:43 xenhost03 kernel: [225060.040006]<0>  ffff880013ad3fd8 
0000000017ab2067 ffff880002159180 0000000000000000
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]<0>  0000000000000000 
000000000061bfff 000000000061bfff 0000000001c00000
Feb 10 06:46:43 xenhost03 kernel: [225060.040636] Call Trace:
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810cf886>] ? 
free_pgd_range+0x226/0x3bf
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810cfabb>] ? 
free_pgtables+0x9c/0xbd
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810d129d>] ? 
exit_mmap+0xef/0x148
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff8104cb09>] ? 
mmput+0x3c/0xdf
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f44d6>] ? 
flush_old_exec+0x45c/0x548
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff811270d0>] ? 
load_elf_binary+0x0/0x1954
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff8112746d>] ? 
load_elf_binary+0x39d/0x1954
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810cc572>] ? 
follow_page+0x2ad/0x303
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810ce136>] ? 
__get_user_pages+0x3ea/0x47b
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f4fcb>] ? 
get_arg_page+0x61/0x110
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff811270d0>] ? 
load_elf_binary+0x0/0x1954
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f3caa>] ? 
search_binary_handler+0xb4/0x245
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff810f54a7>] ? 
do_execve+0x1e4/0x2c3
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff81010500>] ? 
sys_execve+0x35/0x4c
Feb 10 06:46:43 xenhost03 kernel: [225060.040636]  [<ffffffff81011f9a>] ? 
stub_execve+0x6a/0xc0
Feb 10 06:46:43 xenhost03 kernel: [225060.040636] Code: fb ff ff e8 c6 f4 01 00 bf 01 
00 00 00 e8 c9 ea ff ff 59 5e 5b c3 55 48 89 f5 53 48 89 fb 48 83 ec 08 e8 6e e3 ff 
ff 84 c0 75 08<48>  89 2b 41 59 5b 5d c3 41 58 48 89 df 48 89 ee 5b 5d e9 7e ff
Feb 10 06:46:43 xenhost03 kernel: [225060.042981] RIP  [<ffffffff8100e428>] 
xen_set_pmd+0x15/0x2c
Feb 10 06:46:43 xenhost03 kernel: [225060.042981]  RSP<ffff880013ad3b18>
Feb 10 06:46:43 xenhost03 kernel: [225060.042981] CR2: ffff88001558b010
Feb 10 06:46:43 xenhost03 kernel: [225060.042981] ---[ end trace 
9939eec096f5a2de ]---


Also I noticed dom0 lockups of more than a minute when starting
HVM domUs while another domU was creating heavy IO load.
Those only disapeared when I gave my dom0 a fixed ammount of
RAM instead of balooning it down.


Other than that I had no bad trouble. (Well, life migration of
Lenny 32 bit domUs on 64bit dom0 doesn't work because the
Lenny domU kernel is not good at that).


I didn't do a new install of squeeze though. I started
with lenny and upgraded to squeeze.

cheers
-henrik


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users



_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

<Prev in Thread] Current Thread [Next in Thread>