WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

[Xen-users] old issue after 1024 live migrations seems to still exist.

To: Xen Users <xen-users@xxxxxxxxxxxxxxxxxxx>
Subject: [Xen-users] old issue after 1024 live migrations seems to still exist.
From: Florian Heigl <florian.heigl@xxxxxxxxx>
Date: Wed, 21 Jul 2010 16:38:28 +0200
Delivery-date: Wed, 21 Jul 2010 07:40:07 -0700
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:date:message-id :subject:from:to:content-type; bh=ayr88XJ7tXz0n2r/pTdGokN3ZFIEJolap9HlMMO5z1A=; b=YqN0yJYweezacFHoVRlSmoFVuEG58WyHG6rqWCDr4/ZMHGtcxppu2JL8AW3HlxtFgu OlWDfAPZahsNHklZWxrQhNlQCBMtNSE/HKs5NQCyeqTryTFXLgdxKPFAysRqAeoyyMPJ q53eKhKDfZVVypgZ3jvfVLiwHBHURwARRdqVc=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=B5P3HjQhH+kXFZiagpp+ygtkCm3xmNpCHegLsRwR2VSwSDJYC7CD7cNHuxif8EdVKb PujGr+QEs9qlsxY/P+PdDfk4hQE+AwzEx8ejMowBJailT6XyZ0xobMvCOWpMhNJz7ogC jUOvnAb7weVvKBjBvblxYZANiGdqBigsMyTNI=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
Hi,

last month I did some checkig of old Xen issues that I remember and
found this one to still exist - if you do a high amount of live
migrations at some point the xen daemon chokes and dies.
The issue was reported by someone on the list like 4-5 years ago, but
it seems it hasn't been fixed (not sure if anyone even replied back
then)
The Xen version I used to test as 3.4.0 from Oracle VM 2.2

Basically You just ping-pong one domU and somewhere after 900
migrations you first see it drop the ball a few times (vm needs to be
restarted) and then about 100 times later one one of the hosts the xen
daemon will crash, restart and not be able to boot vm's any more.

(I waited a while to post this, but about time now I get it done)
I'm building some power management magic witrh loadbalancing so that
idle servers can automatically shutdown and startup, and cpu intensive
vm's can be distributed evenly.That this bug still exists is a
nightmare: 1024 migrations sounds a lot, but with 128 VMs on a host it
just equals just 4 migrations per VM, right? Without the loadbalancing
bit this wouldn't have to happen very often, but I think it's a key
feature.
If the RDMA live migration ever comes around, there'd be nothing against it...

I've also prepared a clumsy script for the test, which can be found here:

http://wartungsfenster.pastebin.org/410803

I can open a bug report but i think it'd be best if someone re-test on
Xen4 first.

Regards,
Florian

p.s.:
why is live migration so slow (2-3 seconds)  - without sdp i had 2-3
gbit of bandwidth, the vm was 64MB size  (that means 1/6 second of
transfer for the main bulk) and idle without networking!
is it just the gratious arp?

-- 
'Sie brauchen sich um Ihre Zukunft keine Gedanken zu machen'

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users