Xen project Mailing List

[Xen-devel] Credit scheduler and latencies

From: Milan Holzäpfel <listen@xxxxxxxx>

Date: Thu, 14 Dec 2006 18:24:43 +0100

Delivery-date: Thu, 14 Dec 2006 09:24:46 -0800

List-id: Xen developer discussion <xen-devel.lists.xensource.com>

Hello, I'm currently testing Xen-3.0.4-rc1 and its new credit scheduler how it can fit my latency needs. For me, latencies up to 1-5 ms are ok. Latencies < 1 ms would be better, which I implemented with the bvt scheduler and a quantum of .2 ms so far. My test setup is as follows: Xen running on a single-core Athlon64 3000+ and reachable via 192.168.1.34. Three domUs on 192.168.1.35, .36 and .37. Two of the domUs are always spinning (python -c "while True: pass") and the third is idle. If not mentioned otherwise, all have the default weight of 256 and no cap. First of all, I find it interesting that VCPUs are rescheduled after 30 ms when the PCPU is under full load, but if a domain doesn't use much PCPU, then the credit scheduler will happily interrupt the currently-running domain almost whenever needed, e.g. at an interval of 5 ms: | ping -c 500 -i .005 192.168.1.34 | ... | --- 192.168.1.34 ping statistics --- | 500 packets transmitted, 500 received, 0% packet loss, time 2495ms | rtt min/avg/max/mdev = 0.055/0.062/2.605/0.113 ms (dom0 is idle and pinged, as described above, two spinning and one idle domUs) Average response time is 0.062 ms, medium deviation is 0.113 ms. In this light, my current plans to force the scheduler to reschedule more often (as formerly with bvt; see below) don't seem that bad to me :) Next, I checked out how ping latencies to dom0 depend on dom0's cpu usage. I used a script which sleeps and then tries to spinn for a certain amount of time (based on wall clock). These are the results: | dom0 sleep (ms) spin (ms) ping avg (ms) ping mdev (ms) | idle - - 0.099 0.024 | idle - - 0.091 0.029 | idle - - 0.087 0.031 | 25% (.2) 4 1 0.084 0.026 | 25% (.2) 8 2 0.084 0.026 | 25% (.2) 40 10 0.088 0.030 | 38% (.3) 1.5 3.5 0.084 0.025 | 44% (.35) 1.75 3.75 0.075 0.023 | 44% (.35) 3.5 6.5 0.271 1.445 | 30% (.35) 17.5 32.5 6.685 14.633 | 34.5% (.4) 2 3 11.003 17.638 | 45.6% (.9) 0.2 1.8 0.111 0.238 | === domain0 with weight 3072, capped @.2 === | 25% (.2) 4 1 0.101 0.031 | 20% (.2) 40 10 10.698 18.643 | 36% (.3) 1.5 3.5 0.061 0.713 The first column shows the CPU usage reported by xentop and the amount of time the script was spinning. Next is the length of one sleeping and one spinning interval, followed by the latency results. (Measured with ping -i .2 192.168.1.34 -c 120) As it seems, a domain/VCPU can in some cases use more than it's fair share of PCPU and still interrupt other VCPUs if it only sleeps frequently enough. If a domain/VCPU spins for a long enough amount of time, it does indeed not interrupt other VCPUs anymore, with direct effects upon the latency I measured. The results with capping enabled are also interesting (may use more than CAP if sleeping frequently enough) but not a solution for my needs. Therefore I will try reducing the rescheduling interval from 30 ms to 10 ms (should be possible?) and 1 ms (may break the credit accounting code completely? I haven't totally understood in which way it needs the timer interrupt). I'd be happy about any advice :) Regards, Milan PS: Would it easily possible to use bvt with 3.0.4-rc1? I know it has been dropped...

Attachment: pgpeAKh8fbu5u.pgp
Description: PGP signature

_______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-devel

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.