Xen project Mailing List

[Xen-devel] Re: [PATCH 00/13] Nested Virtualization: Overview

To: "Dong, Eddie" <eddie.dong@xxxxxxxxx>

From: Christoph Egger <Christoph.Egger@xxxxxxx>

Date: Fri, 22 Oct 2010 16:30:44 +0200

Cc: Tim, "xen-devel@xxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxx>, Deegan <Tim.Deegan@xxxxxxxxxx>

Delivery-date: Fri, 22 Oct 2010 08:06:50 -0700

List-id: Xen developer discussion <xen-devel.lists.xensource.com>

Eddie, I think I have found a way to address most if not all of your concerns. As a first step I have: - made SVM's VMRUN emulation SVM-specific - turned nh_arch into a union - moved msr permission maps into SVM - made nested pagefault vmexit injection SVM-specific - removed nh_arch_size - renamed nh_vmaddr to nh_vmcxaddr - moved hvm_intblk_svm_gif into SVM Next I will: - make fields like nh_vm_guestcr3, nh_vm_hostcr3, and nh_guest_asid SVM-specific (we need to cache those values for architectural reasons) and provide a wrapper for where these values may be needed by generic code - move nestedhvm_vcpu_interrupt() into SVM. This will remove the generic exitcode field from 'struct nestedhvm' and will allow me to move the 'exitinfo1' and 'exitinfo2' fields into SVM. - make SVM's VMEXIT emulation SVM-specific. That should do away with state validation and mapping issues and end the confusion around nestedhvm_vmexit, nestedhvm_vcpu_vmexit, nhvm_vcpu_vmexit, too. - address other outstanding terminology issues such as nhvm, struct nvcpu It will probably take a few days to get this straightened out, so stay tuned. Christoph On Monday 18 October 2010 03:30:10 Dong, Eddie wrote: > Christoph Egger wrote: > > Hi! > > > > This patch series brings Nested Virtualization to Xen. > > This is the fifth patch series. Improvements to the > > previous patch submission: > > It is a step forward progress. At least those obvious SVM specific stuff is > moved back to SVM tree, although there are still something left which is > not good but maybe not that critical (like - hvm_intblk_nmi_iret /* > NMI blocked until IRET */ > + hvm_intblk_nmi_iret, /* NMI blocked until IRET */ > + hvm_intblk_svm_gif, /* GIF cleared */ > ). > > However the majority of my comments are still not addressed > (http://www.mailinglistarchive.com/html/xen-devel@xxxxxxxxxxxxxxxxxxx/2010- >09/msg01078.html). > > I can re-comments here: > > +struct nestedhvm { > + bool_t nh_guestmode; /* vcpu in guestmode? */ > + void *nh_vmcx; /* l1 guest virtual VMCB/VMCS */ > + > + /* address of l1 guest virtual VMCB/VMCS, needed for VMEXIT */ > + uint64_t nh_vmaddr; > + > + void *nh_arch; /* SVM/VMX specific data */ > + size_t nh_arch_size; > > I hate the pointer+size style. Rather I prefer union for SVM/VMX data > structure. > > Once this is followed, the API > nestedhvm_vcpu_initialise/nestedhvm_vcpu_reset/nestedhvm_vcpu_destroy can > be back to wrapper only if not completely removed. > > > + /* Cache guest cr3/host cr3 the guest sets up for the nested guest. > + * Used by Shadow-on-Shadow and Nested-on-Nested. > + * nh_vm_guestcr3: in l2 guest physical address space and points to > + * the l2 guest page table > + * nh_vm_hostcr3: in l1 guest physical address space and points to > + * the l1 guest nested page table > + */ > + uint64_t nh_vm_guestcr3, nh_vm_hostcr3; > + uint32_t nh_guest_asid; > > Duplicating data instance (Caching) is really not a good solution to me. I > prefer using a wrapper API for that as my proposal sent to you in private > mail. Caching really doesn't bring additional value here. > > > + /* Only meaningful when forcevmexit flag is set */ > + struct { > + uint64_t exitcode; /* generic exitcode */ > + uint64_t exitinfo1; /* additional information to the exitcode */ > + uint64_t exitinfo2; /* additional information to the exitcode */ > + } nh_forcevmexit; > + union { > + uint32_t bytes; > + struct { > + uint32_t forcevmexit : 1; > + uint32_t use_native_exitcode : 1; > + uint32_t vmentry : 1; /* true during vmentry/vmexit > emulation */ + uint32_t reserved : 29; > + } fields; > + } nh_hostflags; > > New namespace for VM exit info and entry control. > > I am not convinced by the value of this new namespace comparing with > soultion that demux in each arch while call 1st level helper API in common > code. The only different result is that we will pay with one API: > nestedhvm_vcpu_vmexit, but we gain with removed conversion to/from between > new & old namespace APIs. I strongly prefer the demux + 1st level helper > solution. > > +int > +nestedhvm_vcpu_vmentry(struct vcpu *v, struct cpu_user_regs *regs, > + uint64_t vmaddr, unsigned int inst_len) > +{ > + int ret; > + struct nestedhvm *hvm = &vcpu_nestedhvm(v); > + > + hvm->nh_hostflags.fields.vmentry = 1; > + > + ret = nestedhvm_vcpu_state_validate(v, vmaddr); > + if (ret) { > + gdprintk(XENLOG_ERR, > + "nestedhvm_vcpu_state_validate failed, injecting > 0x%x\n", > + ret); > + hvm_inject_exception(ret, HVM_DELIVER_NO_ERROR_CODE, 0); > + return ret; > + } > + > + /* Save vmaddr. Needed for VMEXIT */ > + hvm->nh_vmaddr = vmaddr; > + > + /* get nested vm */ > + ASSERT(hvm->nh_vmcx == NULL); > + hvm->nh_vmcx = hvm_map_guest_frame_ro(vmaddr >> PAGE_SHIFT); > + if (hvm->nh_vmcx == NULL) { > + gdprintk(XENLOG_ERR, > + "hvm_map_guest_frame_ro failed, injecting #GP\n"); > + hvm_inject_exception(TRAP_gp_fault, > + HVM_DELIVER_NO_ERROR_CODE, 0); > + hvm->nh_hostflags.fields.vmentry = 0; > + return TRAP_gp_fault; > + } > + > + /* save host state */ > + ret = nhvm_vcpu_vmentry(v, regs, inst_len); > + if (ret) { > + gdprintk(XENLOG_ERR, > + "nhvm_vcpu_vmentry failed, injecting #UD\n"); > + hvm_inject_exception(TRAP_invalid_op, > + HVM_DELIVER_NO_ERROR_CODE, 0); > + hvm->nh_hostflags.fields.vmentry = 0; > + hvm_unmap_guest_frame(hvm->nh_vmcx); > + hvm->nh_vmcx = NULL; > + return ret; > + } > + > + hvm_unmap_guest_frame(hvm->nh_vmcx); > + hvm->nh_vmcx = NULL; > + > + /* Switch vcpu to guest mode. > + */ > + nestedhvm_vcpu_enter_guestmode(v); > + > + hvm->nh_hostflags.fields.vmentry = 0; > + return 0; > +} > > You must not read VMX code yet or didn't understand the reason why VMX > can;t do this. Appearantly this common code doesn't apply to VMX and have > to be moved to arch specific stuff. VMX can only switch the context (L1 or > L2 guest) at certain point (now defered to last step before resume) because > only one VMCS can be accessed in one context. > > Thx, Eddie -- ---to satisfy European Law for business letters: Advanced Micro Devices GmbH Einsteinring 24, 85609 Dornach b. Muenchen Geschaeftsfuehrer: Alberto Bozzo, Andrew Bowd Sitz: Dornach, Gemeinde Aschheim, Landkreis Muenchen Registergericht Muenchen, HRB Nr. 43632 _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-devel

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.