|  |  | 
  
    |  |  | 
 
  |   |  | 
  
    |  |  | 
  
    |  |  | 
  
    |   xen-devel
[Xen-devel] Re: vram_dirty vs. shadow paging dirty tracking 
| 
Ian Pratt wrote:
 
When thinking about multithreading the device model, it occurred to me
that it's a little odd that we're doing a memcmp to determine which
portions of the VRAM has changed.  Couldn't we just use dirty page
tracking in the shadow paging code?  That should significantly lower
the
overhead of this plus I believe the infrastructure is already mostly
there in the shadow2 code.
 
Yep, its been in the roadmap doc for quite a while. However, the log
dirty code isn't ideal for this. We'd need to extend it to enable it to
be turned on for just a subset of the GFN range (we could use a xen
rangeset for this).
 
Okay, I was curious if the log dirty stuff could do ranges.  I guess not.
 
Even so, I'm not super keen on the idea of tearing down and rebuilding
1024 PTE's up to 50 times a second. 
A lower overhead solution would be to do scanning and resetting of the
dirty bits on the PTEs (and a global tlb flush).
 
Right, this is the approach I was assuming.  There's really no use in 
tearing down the whole PTE (since you would have to take an extraneous 
read fault). 
 
In the general case
this is tricky as the framebuffer could be mapped by multiple PTEs. In
practice, I believe this doesn't happen for either Linux or Windows.
 
I wouldn't think so, but showing my ignorance for a moment, does shadow2 
not provide a mechanism to lookup VA's given a GFN?  This lookup could 
be cheap if the structures are built during shadow page table construction. 
Sounds like this is a good long term goal but I think I'll stick with 
the threading as an intermediate goal. 
I've got a minor concern that threading isn't going to help us much when 
dom0 is UP since the VGA scanning won't happen while an MMIO/PIO request 
happens.  With an SMP dom0, you could potentially do all the VGA 
scanning on one processor ensuring that qemu-dm wasn't ever "busy" when 
a request occurs.  I'm slightly concerned though that having a thread 
that's as CPU hungry as the VGA scanning may increase context-switches 
during the MMIO/PIO handling which would actually hurt performance. 
We'll see soon enough though.
Regards,
Anthony Liguori
 
There's always a good fallback of just returning 'all dirty' if the
heuristic is violated. Would be good to knock this up.
Best,
Ian
 
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel
 | 
 |  | 
  
    |  |  |