WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] Scary!!! Lost domU!!!

To: Xen List <xen-users@xxxxxxxxxxxxxxxxxxx>
Subject: Re: [Xen-users] Scary!!! Lost domU!!!
From: Jamon Camisso <jamonation@xxxxxxxxx>
Date: Fri, 01 Jan 2010 14:07:17 -0500
Delivery-date: Fri, 01 Jan 2010 11:08:46 -0800
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:subject:references:in-reply-to :content-type:content-transfer-encoding; bh=AfRXFzT+Io+GNMSVivH/F+l1je1MhteBZXMLVeWED5o=; b=WFLt8ykteWdyPJs/liHaegahWCnZsFgW8zRWQx2J08Ism4vPV5kBN8yAYvpCixTw19 JxdmelFtp5HMzu5C/y0gB6AmtoIHSGQQ/4NxvMVhQlTvSN6NcakHZCRHfADnEGH/4HOQ wgggpKSV9KCpg1yuokX2fHNsNpSpdpY/vwmgk=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; b=uiYqG55sqOZ0eT9Fi6TCcAnfJmf+f1DYoFbXqw+u1i89cwo6G6jSHwa7xOZyVsNmaD chSjzpNN1Q8zi+G0uJ2ES2kO9pz6SMwO2NphVH6RcqsstPh/k3HC67JZDHzHkM9O+xuB v3WEXfV2V99st139CrcQ1pnLxaQadrqzYyfAA=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <1262233922.15120.9.camel@xxxxxxxxxxxxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <1262233922.15120.9.camel@xxxxxxxxxxxxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mozilla-Thunderbird 2.0.0.22 (X11/20090707)
James Pifer wrote:
I've been on vacation (and still am) but had to work on a couple
problems. I have a couple Citrix servers that are domU's on a sles10sp2
server that has local storage and connects to two ocfs2 volumes.

Is there more than just the sles server using both volumes? If not, have you considered using another filesystem? Personally I've had nothing but trouble with ocfs2 in Debian and Centos -- clusters would just randomly fall apart. I've also found that unless filesystem throughput is very good, ocfs2 would end up loosing writes by getting ahead of itself somehow. All depends on the storage backend I suppose.

I tried to restart one of the Citrix servers and it would not restart,
giving an error that the disk was already mounted in a loopback, etc. I
looked at mount and didn't see anything mounted and I had just shut the
domU down. I assumed it had not shut down completely. This domU runs
from the local disk.
So I decided to a restart of the host was in order. I downed the rest of
the domU's, including an oracle server running off one of the ocfs2
clusters. This servers has been being used for the last three weeks from
this location.
After restarting dom0 I started bringing the domU's back up. All of them
came back up fine, except for the oracle server. It gave an error that
the disk files did not exist, and they don't, they aren't there
anymore.
I checked and double checked history to see if any rm commands had been
given and I didn't find any.

Do you still have the system logs from the reboots? You might see the cluster falling apart there depending on how your dom0 shutdown the domUs.

When I restarted, there was an error on one of the local file systems
that said "JDB: barrier-based sync failed...".

Luckily I have a copy of this domU from a few weeks ago BEFORE I copied
it to the ocfs2 volume. What could explain the sudden deletion of a
directory like this?

Per above, what kind of storage architecture are you using underneath your ocfs2 volumes? I recall reading a bug that described the lost writes I mentioned, though I can't for the life of me find it now.

If this happened on some of the other domU's it could be ugly.
Any advice is appreciated!!!!

I've not used SLES, so maybe that's the determining factor, but have you tried gfs2 or lustre (http://wiki.lustre.org/index.php/Main_Page)? I particularly want to try the latter.

Jamon


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

<Prev in Thread] Current Thread [Next in Thread>