Xen project Mailing List

Re: [PATCH v6 5/5] tools/libxc: make use of domain context SHARED_INFO record...

To: Paul Durrant <paul@xxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx

From: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>

Date: Sat, 30 May 2020 01:29:47 +0100

Cc: Paul Durrant <pdurrant@xxxxxxxxxx>, Ian Jackson <ian.jackson@xxxxxxxxxxxxx>, Wei Liu <wl@xxxxxxx>

Delivery-date: Sat, 30 May 2020 00:30:13 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 27/05/2020 18:34, Paul Durrant wrote: > ... in the save/restore code. > > This patch replaces direct mapping of the shared_info_frame (retrieved > using XEN_DOMCTL_getdomaininfo) with save/load of the domain context > SHARED_INFO record. > > No modifications are made to the definition of the migration stream at > this point. Subsequent patches will define a record in the libxc domain > image format for passing domain context and convert the save/restore code > to use that. > > Signed-off-by: Paul Durrant <pdurrant@xxxxxxxxxx> I was going to fix up the remaining issues and commit this, but there is a rather major problem in the way. Therefore, here is a rather more full review. > diff --git a/tools/libxc/xc_sr_common.c b/tools/libxc/xc_sr_common.c > index dd9a11b4b5..1acb3765aa 100644 > --- a/tools/libxc/xc_sr_common.c > +++ b/tools/libxc/xc_sr_common.c > @@ -138,6 +138,73 @@ int read_record(struct xc_sr_context *ctx, int fd, > struct xc_sr_record *rec) > return 0; > }; > > +int get_domain_context(struct xc_sr_context *ctx) > +{ > + xc_interface *xch = ctx->xch; > + size_t len = 0; > + int rc; > + > + if ( ctx->domain_context.buffer ) > + { > + ERROR("Domain context already present"); > + return -1; > + } > + > + rc = xc_domain_getcontext(xch, ctx->domid, NULL, &len); > + if ( rc < 0 ) > + { > + PERROR("Unable to get size of domain context"); > + return -1; > + } > + > + ctx->domain_context.buffer = malloc(len); > + if ( !ctx->domain_context.buffer ) > + { > + PERROR("Unable to allocate memory for domain context"); > + return -1; > + } There is an xc_sr_blob interface, as this is a common pattern. > + > + rc = xc_domain_getcontext(xch, ctx->domid, ctx->domain_context.buffer, > + &len); > + if ( rc < 0 ) > + { > + PERROR("Unable to get domain context"); > + return -1; > + } > + > + ctx->domain_context.len = len; > + > + return 0; > +} > + > +int set_domain_context(struct xc_sr_context *ctx) > +{ > + xc_interface *xch = ctx->xch; > + int rc; > + > + if ( !ctx->domain_context.buffer ) > + { > + ERROR("Domain context not present"); > + return -1; > + } > + > + rc = xc_domain_setcontext(xch, ctx->domid, ctx->domain_context.buffer, > + ctx->domain_context.len); > + > + if ( rc < 0 ) > + { > + PERROR("Unable to set domain context"); > + return -1; > + } > + > + return 0; > +} > + > +void common_cleanup(struct xc_sr_context *ctx) > +{ > + free(ctx->domain_context.buffer); There is only a single caller to this function, so there is a (possibly latent) memory leak. > +} > + > static void __attribute__((unused)) build_assertions(void) > { > BUILD_BUG_ON(sizeof(struct xc_sr_ihdr) != 24); > diff --git a/tools/libxc/xc_sr_common.h b/tools/libxc/xc_sr_common.h > index 5dd51ccb15..0d61978b08 100644 > --- a/tools/libxc/xc_sr_common.h > +++ b/tools/libxc/xc_sr_common.h > @@ -208,6 +208,11 @@ struct xc_sr_context > > xc_dominfo_t dominfo; > > + struct { > + void *buffer; > + unsigned int len; > + } domain_context; As noted above, xc_sr_blob domain_context; > diff --git a/tools/libxc/xc_sr_restore_x86_pv.c > b/tools/libxc/xc_sr_restore_x86_pv.c > index 904ccc462a..21982a38ad 100644 > --- a/tools/libxc/xc_sr_restore_x86_pv.c > +++ b/tools/libxc/xc_sr_restore_x86_pv.c > @@ -865,7 +865,7 @@ static int handle_shared_info(struct xc_sr_context *ctx, > xc_interface *xch = ctx->xch; > unsigned int i; > int rc = -1; > - shared_info_any_t *guest_shinfo = NULL; > + shared_info_any_t *guest_shinfo; > const shared_info_any_t *old_shinfo = rec->data; > > if ( !ctx->x86.pv.restore.seen_pv_info ) > @@ -878,18 +878,14 @@ static int handle_shared_info(struct xc_sr_context *ctx, > { > ERROR("X86_PV_SHARED_INFO record wrong size: length %u" > ", expected 4096", rec->length); > - goto err; > + return -1; > } > > - guest_shinfo = xc_map_foreign_range( > - xch, ctx->domid, PAGE_SIZE, PROT_READ | PROT_WRITE, > - ctx->dominfo.shared_info_frame); > - if ( !guest_shinfo ) > - { > - PERROR("Failed to map Shared Info at mfn %#lx", > - ctx->dominfo.shared_info_frame); > - goto err; > - } > + rc = x86_pv_get_shinfo(ctx); > + if ( rc ) > + return rc; If I'm following this logic correctly, we're now in the final throws of completing migration with data in hand, and we ask the hypervisor to gather the entire domain context for this (not-yet-run) guest, copy it (twice, even) down into userspace, so we can scan through it to find the appropriate tag, copy less than a page's worth of data, then pass the full buffer back to Xen to unserialise the whole thing over the guest. The restore path shouldn't be calling DOMCTL_get* at all. It is conceptually wrong, and a waste of time/effort which would be better spent with the guest unpaused. What the restore path should be doing is passing data from the stream, straight into DOMCTL_set* (and attempting to do this will probably demonstrate why hvmctxt was an especially poor API to copy). The layout of existing migration-v2 blocks was designed around blobs which Xen produces and consumes directly, specifically to minimise the processing required. I think it is a layering violation to have libxc pick apart and reframe the internals of another layers' blob. What is not currently clear is whether the domain context wants sending as a discrete blob itself (and have a new chunk type allocated for the purpose), or each bit of it is going to want picking apart. This largely depends on what else is going in the blob at a Xen level. Also, I'd like to see the plans for the HVM side of this logic before deciding on whether the PV side is appropriate. I know we have a dedicated SHARED_INFO record right now, but it would be fine (AFAICT) to relax the migration spec to state that one of {SHARED_INFO, DOMAIN_CONTEXT} must be sent for PV. > @@ -854,13 +835,27 @@ static int write_x86_pv_p2m_frames(struct xc_sr_context > *ctx) > */ > static int write_shared_info(struct xc_sr_context *ctx) > { > + xc_interface *xch = ctx->xch; > struct xc_sr_record rec = { > .type = REC_TYPE_SHARED_INFO, > .length = PAGE_SIZE, > - .data = ctx->x86.pv.shinfo, > }; > + int rc; > > - return write_record(ctx, &rec); > + if ( !(rec.data = calloc(1, PAGE_SIZE)) ) > + { > + ERROR("Cannot allocate buffer for SHARED_INFO data"); > + return -1; > + } > + > + BUILD_BUG_ON(sizeof(*ctx->x86.pv.shinfo) > PAGE_SIZE); > + memcpy(rec.data, ctx->x86.pv.shinfo, sizeof(*ctx->x86.pv.shinfo)); These are some very contorted hoops to extend the data size sent. write_split_record() is the the tool to use here, although the above suggestions may render this change unnecessary. > @@ -1041,7 +1036,7 @@ static int x86_pv_setup(struct xc_sr_context *ctx) > if ( rc ) > return rc; > > - rc = map_shinfo(ctx); > + rc = x86_pv_get_shinfo(ctx); > if ( rc ) > return rc; > > @@ -1112,12 +1107,11 @@ static int x86_pv_cleanup(struct xc_sr_context *ctx) > if ( ctx->x86.pv.p2m ) > munmap(ctx->x86.pv.p2m, ctx->x86.pv.p2m_frames * PAGE_SIZE); > > - if ( ctx->x86.pv.shinfo ) > - munmap(ctx->x86.pv.shinfo, PAGE_SIZE); > - > if ( ctx->x86.pv.m2p ) > munmap(ctx->x86.pv.m2p, ctx->x86.pv.nr_m2p_frames * PAGE_SIZE); > > + common_cleanup(ctx); > + > return 0; > } This pair highlights an asymmetry in the patch, which will need fixing by whomever adds a second field to domain_context. Obtaining the context for use should shouldn't be a hidden side effect of getting the shared info. ~Andrew

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.