WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-devel

[Xen-devel] Re: A race condition introduced by changeset 15175: Re-init

To: xen-devel@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-devel] Re: A race condition introduced by changeset 15175: Re-init hypercall stubs page after HVM save/restore
From: Mukesh <mukesh.rathor@xxxxxxxxxx>
Date: Tue, 21 Oct 2008 03:38:48 +0000 (UTC)
Delivery-date: Mon, 20 Oct 2008 20:39:24 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References: <F4AE3CDE26E0164D9E990A34F2D4E0DF08A5F0CFA1@xxxxxxxxxxxxxxxxxxxxxxxxxxxx> <C510F587.27ACC%keir.fraser@xxxxxxxxxxxxx>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Loom/3.14 (http://gmane.org/)
Keir Fraser <keir.fraser <at> eu.citrix.com> writes:

> 
> Hi Dexuan,
> 
> Are you really sure that this is the problem? The suspend_lock was
> introduced specifically to solve this problem. Note that the BSP takes this
> lock before messing with the hypercall page.
> 
>  -- Keir

I'm also looking at this now (I'm on 3.1.4 BTW). I see both hang and panic. it
appears I see the hang because the "master" vcpu is trying to catch other vcpus
right at the cpu_relax so it can grab the lock in write mode. With many VCPUs
it's just not happening..... Not sure i like the design of this very much... i'm
gonna try to modify it a bit .... 

thanks
mukesh




> On 7/10/08 11:08, "Cui, Dexuan" <dexuan.cui <at> intel.com> wrote:
> 
> > For an SMP Linux HVM guest with PV drivers inserted, when we do save/restore
> > (or LiveMigration) for the guest, it might panic after it's restored.
> > The panic point is inside ap_suspend():
> >  ....
> >     while (info->do_spin) {
> >         cpu_relax();
> >         read_lock(&suspend_lock);
> >         HYPERVISOR_yield();      ----> guest might panic on the invocation 
> > of
> > this function.
> >         read_unlock(&suspend_lock);
> >     }
> > ...
> > 


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel