[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-devel] Re: vram_dirty vs. shadow paging dirty tracking



Ian Pratt wrote:
When thinking about multithreading the device model, it occurred to me
that it's a little odd that we're doing a memcmp to determine which
portions of the VRAM has changed.  Couldn't we just use dirty page
tracking in the shadow paging code?  That should significantly lower
the
overhead of this plus I believe the infrastructure is already mostly
there in the shadow2 code.

Yep, its been in the roadmap doc for quite a while. However, the log
dirty code isn't ideal for this. We'd need to extend it to enable it to
be turned on for just a subset of the GFN range (we could use a xen
rangeset for this).

Okay, I was curious if the log dirty stuff could do ranges.  I guess not.

Even so, I'm not super keen on the idea of tearing down and rebuilding
1024 PTE's up to 50 times a second.
A lower overhead solution would be to do scanning and resetting of the
dirty bits on the PTEs (and a global tlb flush).

Right, this is the approach I was assuming. There's really no use in tearing down the whole PTE (since you would have to take an extraneous read fault).

In the general case
this is tricky as the framebuffer could be mapped by multiple PTEs. In
practice, I believe this doesn't happen for either Linux or Windows.

I wouldn't think so, but showing my ignorance for a moment, does shadow2 not provide a mechanism to lookup VA's given a GFN? This lookup could be cheap if the structures are built during shadow page table construction.

Sounds like this is a good long term goal but I think I'll stick with the threading as an intermediate goal.

I've got a minor concern that threading isn't going to help us much when dom0 is UP since the VGA scanning won't happen while an MMIO/PIO request happens. With an SMP dom0, you could potentially do all the VGA scanning on one processor ensuring that qemu-dm wasn't ever "busy" when a request occurs. I'm slightly concerned though that having a thread that's as CPU hungry as the VGA scanning may increase context-switches during the MMIO/PIO handling which would actually hurt performance.

We'll see soon enough though.

Regards,

Anthony Liguori

There's always a good fallback of just returning 'all dirty' if the
heuristic is violated. Would be good to knock this up.

Best,
Ian


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.