This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


Re: [Xen-users] Scary!!! Lost domU!!!

To: James Pifer <jep@xxxxxxxxxxxxxxxx>
Subject: Re: [Xen-users] Scary!!! Lost domU!!!
From: Pasi Kärkkäinen <pasik@xxxxxx>
Date: Thu, 31 Dec 2009 15:59:24 +0200
Cc: Xen List <xen-users@xxxxxxxxxxxxxxxxxxx>
Delivery-date: Thu, 31 Dec 2009 06:00:10 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <1262233922.15120.9.camel@xxxxxxxxxxxxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <1262233922.15120.9.camel@xxxxxxxxxxxxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mutt/1.5.13 (2006-08-11)
On Wed, Dec 30, 2009 at 11:32:01PM -0500, James Pifer wrote:
> I've been on vacation (and still am) but had to work on a couple
> problems. I have a couple Citrix servers that are domU's on a sles10sp2
> server that has local storage and connects to two ocfs2 volumes. 
> I tried to restart one of the Citrix servers and it would not restart,
> giving an error that the disk was already mounted in a loopback, etc. I
> looked at mount and didn't see anything mounted and I had just shut the
> domU down. I assumed it had not shut down completely. This domU runs
> from the local disk. 
> So I decided to a restart of the host was in order. I downed the rest of
> the domU's, including an oracle server running off one of the ocfs2
> clusters. This servers has been being used for the last three weeks from
> this location. 
> After restarting dom0 I started bringing the domU's back up. All of them
> came back up fine, except for the oracle server. It gave an error that
> the disk files did not exist, and they don't, they aren't there
> anymore. 
> I checked and double checked history to see if any rm commands had been
> given and I didn't find any. 
> ???When I restarted, there was an error on one of the local file systems
> that said "JDB: barrier-based sync failed...".
> Luckily I have a copy of this domU from a few weeks ago BEFORE I copied
> it to the ocfs2 volume. What could explain the sudden deletion of a
> directory like this?
> If this happened on some of the other domU's it could be ugly. 
> Any advice is appreciated!!!!

Sounds like a problem with OCFS2. This is exactly the reason why I don't
like storing VM disk images on a filesystem - fsck or this kind of weird
filesystem error can completely f*ck up the disk images.

I suggest LVM for guest disks.

Sorry, I can't really help with the problem. Did you try fsck? 

-- Pasi

Xen-users mailing list

<Prev in Thread] Current Thread [Next in Thread>