This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


Re: [Xen-users] live migration with xen 2.0.7 with fibre channelonDebian

To: Steven Hand <Steven.Hand@xxxxxxxxxxxx>
Subject: Re: [Xen-users] live migration with xen 2.0.7 with fibre channelonDebian - help needed
From: Michael Mey <michael.mey@xxxxxx>
Date: Thu, 8 Dec 2005 10:54:05 +0100
Cc: xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Thu, 08 Dec 2005 09:54:59 +0000
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <E1Ek276-0008Nf-00@xxxxxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Organization: Thinking Objects Software GmbH
References: <E1Ek276-0008Nf-00@xxxxxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: KMail/1.9
On Wednesday 07 December 2005 17:25, Steven Hand wrote:
> > I had this exact same problem with 2.0.7.  I had done a little
> > investigation and found scheduled_work gets called to schedule the
> > shutdown in the user domain kernel, but the shutdown work that gets
> > scheduled never actually gets called.  I'm glad someone else is
> > seeing this same problem now :-) Like you, it worked a number of
> > times in a row, then would fail, and it didn't seem to matter if
> > there was really any load going on or not.
> At least one issue with live migration in 2.0.7 is fixed in the
> the 2.0-testing tree (cset 3513:80a8b005b669 if you want to just
> apply the patch directly)
> This doesn't explain the situation where your domain dies tho; do
> you have any console information from the domain in question?
The only thing I have is some syslog output of the domU, but there's not more 
than this usual lines:
Dec  6 15:47:16 debian1 kernel: Xen reported: 2806.425 MHz processor.
Dec  6 15:51:12 debian1 kernel: Xen reported: 2806.429 MHz processor.
Dec  6 15:54:54 debian1 kernel: Xen reported: 2806.425 MHz processor.

and the log file of my i/o testing script, but these are only md5 checksums. 
The script writes several times a dummy text into a file on the hd and 
finally builds a md5 checksum of the file for integrity test.
That logfile is fine, it's always the same md5.

Is there a possibility to further investigate what is happening with domU?

The funny thing is, yesterday afternoon I started a new testrun, this time 
using a ramdisk as storage for the i/o test. That domU is still running 
(after approx. 350 migrations). 
So it seems to me as if there is a problem with storage during migration, 
which sometimes occurs earlier, sometimes later.

The fibre channel adapters and the san is tested and works 100% correctly, so 
this shouldn't be the reason.



Michael Mey                                  
Thinking Objects Software GmbH    |   mailto: michael.mey@xxxxxx 
Lilienthalstrasse 2/1                         |   phone: +49 711 88770-147
70825 Stuttgart-Korntal, Germany  |   fax: +49 711 88770-449

Attachment: pgpeiXnzDf1Rx.pgp
Description: PGP signature

Xen-users mailing list