[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH v2 0/2] Introduce runstate area registration with phys address

To: Andrii Anisov <andrii.anisov@xxxxxxxxx>, xen-devel@xxxxxxxxxxxxx
From: Julien Grall <julien.grall@xxxxxxx>
Date: Mon, 13 May 2019 12:16:49 +0100
Cc: Andrii Anisov <andrii_anisov@xxxxxxxx>, Jan Beulich <JBeulich@xxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>
Delivery-date: Mon, 13 May 2019 11:17:03 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>



On 5/13/19 11:15 AM, Andrii Anisov wrote:

Hello Julien,

On 08.05.19 16:59, Julien Grall wrote:
Hi,

On 23/04/2019 09:10, Andrii Anisov wrote:
From: Andrii Anisov <andrii_anisov@xxxxxxxx>

Following discussion [1] it is introduced and implemented a runstate
registration interface which uses guest's phys address instead of avirtual one.The new hypercall employes the same data structures as a predecessor,but
expects the vcpu_runstate_info structure to not cross a page boundary.
The interface is implemented in a way vcpu_runstate_info structure ismapped tothe hypervisor on the hypercall processing and is directly accessedduring itsupdates. This runstate area mapping follows vcpu_info structureregistration.
Permanent mapping of runstate area would consume vmap area on arm32what islimited to 1G. Though it is assumed that ARM32 does not target theserver marketand the rest of possible applications will not host a huge number ofVCPUs to
render the limitation into the issue.
I am afraid I can't possible back this assumption. As I pointed out inthe previous version, I would be OK with the always map solution onArm32 (pending performance) because it would be possible to increasethe virtual address area by reworking the address space.
I'm sorry, I'm not sure what should be my actions about that.

There no code modification involved so far. Just updating your coverletter with what I just said above.

The series is tested for ARM64. Build tested for x86. I'd appreciateif someone
could check it with x86.
The Linux kernel patch is here [2]. Though it is for 4.14.
The patch looks wrong to me. You are using virt_to_phys() on a percpuarea. What does actually promise you the physical address will alwaysbe the same?
Sorry for my ignorance here, could you please elaborate more about whatis wrong here?

While the virtual address will never change over over the life cycle ofa variable, I am not entirely sure we can make the same assumption forthe physical address.

I know that kmalloc() is promising you that the physical address willnot change. But percpu does not seem to use kmalloc() so have youconfirmed this assumption can hold?

Are you saying that the command dd is the CPUBurn? I am not sure howthis could be considered as a CPUBurn. IHMO, this is more IO related.
Both /dev/null and /dev/zero are virtual devices no actual IO isperformed during their operations, all the load is CPU (user and sys).

Thank you for the explanation. Shall I guess this is an existingbenchmark [1]?

       VCPU(dX)->idle->VCPU(dX).
       with following results:

                             mapped               mapped
                             on access            on init
       GLMark2 320x240       2852                 2877          +0.8%
           +Dom0 CPUBurn     2088                 2094          +0.2%
       GLMark2 800x600       2368                 2375          +0.3%
           +Dom0 CPUBurn     1868                 1921          +2.8%
       GLMark2 1920x1080     931                  931            0%
           +Dom0 CPUBurn     892                  894           +0.2%

       Please note that "mapped on access" means using the old runstate
registering interface. And runstate update in this case stilloften fails to map runstate area like [5], despite the fact that our Linuxkernel does not have KPTI enabled. So runstate area update, in thiscase, is
       really shortened.
We know that the old interface is broken, so telling us the newinterface is faster is not entirely useful. What I am more interestedis how it if you use a guest physical address on the version "Mappedon access".
Hm, I see your point. Well, I can make it for ARM to compare performance.
Also it was checked IRQ latency difference using TBM in asetup similar to
       [5]. Please note that the IRQ rate is one in 30 seconds, and only
VCPU->idle->VCPU use-case is considered. With followingresults (in ns,
       the timer granularity 120ns):
How long did you run the benchmark?
I did run it until avg more ore less stabilizes (2-3 minutes), then tookthe minimal avg (note, we have a moving average there).

Did you re-run multiple time?

       mapped on access:
           max=9960 warm_max=8640 min=7200 avg=7626
       mapped on init:
           max=9480 warm_max=8400 min=7080 avg=7341
Unfortunately there are no consitent results yet fromprofiling using Lauterbach PowerTrace. Still in communication with the tracervendor in
       order to setup the proper configuration.

[1] https://patrickmn.com/projects/cpuburn/? If so, a link to thebenchmark


--
Julien Grall

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-devel

Follow-Ups:
- Re: [Xen-devel] [PATCH v2 0/2] Introduce runstate area registration with phys address
  - From: Andrii Anisov

References:
- Re: [Xen-devel] [PATCH v2 0/2] Introduce runstate area registration with phys address
  - From: Julien Grall
- Re: [Xen-devel] [PATCH v2 0/2] Introduce runstate area registration with phys address
  - From: Andrii Anisov

Prev by Date: [Xen-devel] [xen-4.7-testing test] 136050: regressions - FAIL
Next by Date: Re: [Xen-devel] [PATCH v2 06/12] x86/IRQ: consolidate use of ->arch.cpu_mask
Previous by thread: Re: [Xen-devel] [PATCH v2 0/2] Introduce runstate area registration with phys address
Next by thread: Re: [Xen-devel] [PATCH v2 0/2] Introduce runstate area registration with phys address
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.