[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] xend falls over *a lot* in past 2 weeks



SD> Any thoughts on why this is might be the case?

So, as far as I can tell, there is some state being kept in xend,
which causes the problem.  In my testing, I create and destroy a
domain repeatedly with the same name.  Sometimes a destroy operation
marks the domain in XenDomainDict as "terminated", but doesn't
actually remove it.  Then, xend allows another domain by the same name
to be created, thus corrupting xend's internal domain list.  Next, the
create routines in xend try to unpause the domain referenced by the
name, which turns up the record from the list of the old domain, and
therefore the old domid.  The unpause routine makes a call to libxc to
unpause the old domid, which isn't found in the list, so ESRCH ("No
such process") is returned.

It seems to me that there are (at least) two problems here:

1. The domain objects in xend's list sometimes seem to stick around
   longer than they should after a destroy operation.

2. Xend will create a duplicate domain if asked, and therefore will
   corrupt its own internal list.

I'm testing a patch right now that will cause xend to do a quick
sanity check before creating a domain to make sure that the list does
not currently contain a domain object of the same name.

-- 
Dan Smith
IBM Linux Technology Center
Open Hypervisor Team
email: danms@xxxxxxxxxx


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.