This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


[Xen-devel] xen zombie while starting on "Secondary" DRBD device

To: xen-devel@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-devel] xen zombie while starting on "Secondary" DRBD device
From: Fabio Pietrosanti <lists@xxxxxxxxxxxxxx>
Date: Fri, 21 Jul 2006 11:32:24 +0200
Delivery-date: Fri, 21 Jul 2006 05:03:00 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
Hi all,

i am implementing a HA infrastructure composed of 2 phisical server and
8 virtual server with XEN.

Xen 3.0.2
DRBD 0.7 (debian)

Data redundancy and replication is handled by DRBD.

I used this howto:

I am stacking several technology:
XEN over DRBD over LVM over RAID1(software)

The HA is handled by heartbeat.
I found a very bad race condition that, when appear, require a hard
reboot of the machine (standard reboot doesn't work and hang).

Basically if a XEN server start when the corresponding DRBD device is in
Secondary state it became a zombie and it's not possible to remove it.
It's not even possible to reboot the server because xendomains stop hang
the reboot process.

Doing an "xm list" show this:
Zombie-admin-server0           15       32     2 ----cd     0.4

Zombie xen server are a very bad problem in this kind of infrastructure.

It's a problem of XEN, it's a problem of DRBD or of both?

Xen-devel mailing list

<Prev in Thread] Current Thread [Next in Thread>
  • [Xen-devel] xen zombie while starting on "Secondary" DRBD device, Fabio Pietrosanti <=