[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] Converting heap page_infos to contiguous virtual



On Fri, Jul 15, 2016 at 10:53:51AM -0400, Boris Ostrovsky wrote:
> On 07/14/2016 09:29 AM, Andrew Cooper wrote:
> >
> > However, I would recommend getting something functioning first, before
> > trying to optimise it.
> 
> There are two fairly independent parts to improving scrubbing: one is
> making it asynchronous and second is improving clear_page() performance.
> Whole-RAM mapping is needed for the latter.

Attaching a nice graph of different memset on Broadwell (credits go to
Joao for doing the testing). Skylake is 10% faster than Broadwell.


> 
> >
> > There is probably a lot to be gained simply by improving clear_page().
> 
> The biggest improvement comes from switching to AVX(2) when available.
> It's been a while since I ran those tests so I will have to re-measure
> it but my recollection is that 4K was too small to see significant changes.
> 
> A potential improvement might come from dropping (or, rather, deferring)
> sfence in clear_page_sse2. I don't know how much this would buy us though.
> 
> -boris
> 
> 
> 
> _______________________________________________
> Xen-devel mailing list
> Xen-devel@xxxxxxxxxxxxx
> https://lists.xen.org/xen-devel

Attachment: broadwell_memset.png
Description: PNG image

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
https://lists.xen.org/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.