[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] Bad iSCSI I/O performance on Xen 4.6


  • To: Matthieu Cerda <matthieu.cerda@xxxxxxxxxxxxxx>, xen-users@xxxxxxxxxxxxxxxxxxxx
  • From: Jean-Louis Dupond <jean-louis@xxxxxxxxx>
  • Date: Wed, 17 Oct 2018 17:51:38 +0200
  • Arc-authentication-results: i=1; ORIGINATING; auth=pass smtp.auth=jean-louis@xxxxxxxxx smtp.mailfrom=jean-louis@xxxxxxxxx
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=dupond.be; s=dkim; t=1539791498; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=dmBQpu5cwWmHL6WE7ZKrmX9Kl9GcAog4JChGk21SWZI=; b=IndBRFi4FGuKP+C8LNEbXsX0XzkgKtU2BnfGmaHfZxSDVm2GesNxDvsH8FmwJT+3CQn2U0 xayOPhVV49qTAXBwqUUFOQr5djFlD6Y1wwbYsp4rF9w8vhu0DhRe0lZUu+wDxxbeetAbPD NhXlMjgoz6ybCvd1Vs/vjb0JghmJcN2XMBM9je2o41XacZ7kg4tq1uD0+iVrACfB4Y4zMI yg0+eJE/veYe8OFhNfk7tSoErZyQBzuZ6iPlf9Mr54PLuNIOTjcrQ0BLDcdmE6VKgS1drk pKlgZaTfnIMJutKGPldEfoK3+iTNemCessYUVn9DDNgrgacqajbcPEdRoLnEwg==
  • Arc-seal: i=1; s=dkim; d=dupond.be; t=1539791498; a=rsa-sha256; cv=none; b=dzpK6DvP9ZWVAnd27BsIzLzyfDuq+kn+Ox0jACTLcK+Irqn5DDXr7MeKbb3C9WXz0qdYbZgH5G3eAzXbg/zVKfMdLSABtTKgHjPy5eEE7MRyue5HNc1fyYkjf16dcFX9V1N2OXRgASYAKJHbM9vHHE4xTyKgpaztj86wopmxpA7Ln12AA5xFHy4DgbUVpVx3EV9yHc1/yDri/mIlfbcHHXsjdZcvGAzZ2ut1M1xP55GyPxDNnBqKMcBwzlVccQac+rJaUbQw8QyWbquimHRcSXiDUCD+gDxpxLEUTU2XWDWap1854h1Qet1InKvOJl57M7uyy34/q2hhtwvwH0OSeA==
  • Delivery-date: Wed, 17 Oct 2018 15:52:57 +0000
  • List-id: Xen user discussion <xen-users.lists.xenproject.org>

Even did some more tests today :)

Xen 4.8 and 4.10 have the same results. No change there unfortunately :(

But when playing around with dom0 CPU's and pinning, got the following results:
dom0 -> 2cpu - no pinning => 73.5 MB/s
dom0 -> 2cpu + pinning => 519 MB/s
dom0 -> 8cpu + pinning => 124 MB/s

This was with 4.9.127-32.el6.x86_64 and Xen 4.10.2-1.el6.

Perfomance inside the VM is still really bad: 18.4 MB/s :(

Thanks
Jean-Louis

On 16/10/18 17:28, Jean-Louis Dupond wrote:

Hi Matthieu,

I did all tests on dom0, and the bug listed there seems only to affect domU.
So I doubt this will be related.

Thanks
Jean-Louis

On 16/10/18 17:11, Matthieu Cerda wrote:
Hello,

You might be hitting the issue that has been discussed here: https://lists.xenproject.org/archives/html/xen-users/2017-07/msg00023.html

We solved the issue by upgrading to the latest backport kernel (4.14 at the time), it seemed due to a loop device regression in Linux.

Maybe you should try with a more recent kernel ?

Cheers,
--
Matthieu CERDA

Le 16/10/2018 à 16:53, Jean-Louis Dupond a écrit :

Did even more testing today, and seems like we hit 2 problems.

On a plain 4.9.13 kernel, without Xen, I get 686 MB/s.
With Xen (on dom0) => 58.8 MB/s

But after upgrade to 4.9.127
without Xen => 161 MB/s
with Xen => 40.3 MB/s

Then I tried downgrading to Xen xen-4.6.6-8.el6.x86_64
And that gives the following.

With 4.9.13 kernel: 107 MB/s
With 4.9.127 kernel: 61.9 MB/s

It might be retpoline/spectre/meltdown changes?
Tried adding 'spectre_v2=off nopti' to the boot line on 4.9.127 kernel (without Xen), and then it speed is 171 MB/s (10 MB/s faster)

Seems like this will be a hard one to debug further :(

Any other idea's are welcome :)

Thanks
Jean-Louis

On 11/10/18 17:15, Jean-Louis Dupond wrote:

FYI,

After some additional debugging, I found out that on the machine the speed is perfect when running stock CentOS 6 kernel (2.6.32).
When using a 4.9.x of 4.18.x kernel, the speed is degraded again.

Speed on 2.6.32: 320 MB/s
Speed on 4.9.x : 55.2 MB/s

But when I disable gro on the storage NIC, It boosts to 157 MB/s.
That is already better, but still way below what we have on 2.6.32 ...

I also did tests on plain machine without Xen, and with the same results.
So it doesn't looks like its Xen related, but more iSCSi/Kernel.

Thanks
Jean-Louis


On 11-10-18 11:18, Dario Faggioli wrote:
[Adding Roger]

On Mon, 2018-10-08 at 13:10 +0200, Jean-Louis Dupond wrote:
Hi,

We are hitting some I/O limitation on some of our Xen hypervisors.
The hypervisors are running CentOS 6 with Xen 4.6.6-12.el6 and
4.9.105+ 
kernels.

The hypervisors are attached with 10G network to the SAN network.
And 
there is no congestion at all.
Storage is exported via iSCSI and we use multipathd for failover.
Now we see a performance of +-200MB/sec write speed, but only a poor 
20-30mb/sec read speed on a LUN on the SAN.
This is while testing this on dom0. Same speeds on domU.

If I do the same test on a Xen 4.4.4-34.el6 hypervisor to the same
LUN 
(but attached with 1G), I max out the link (100MB read/write).

Right. But, if I've understood correctly, you're changing two things (I
mean between the two tests), i.e., the hypervisor and the NIC.
(BTW, is dom0 kernel the same, or does that also change?).

This makes it harder to narrow things down to where the problem could
be.

What would be useful to see would be the results of running:
- Xen 4.4.4-34.el6, with 4.9.105+ dom0 kernel on the 10G NIC / host,
  and compare this with Xen 4.6.6-12.el6, with the same kernel on the 
  same NIC / host;
- Xen 4.6.6-12.el6, with 4.9.105+ dom0 kernel on the 1G NIC / host,
  and compare this with Xen 4.4.4-34.el6, with the same kernel on the
  same NIC / host.

This will tell us, if there is a regression between Xen 4.4.x and Xen
4.6.x (as that is _the_only_ thing that varies).

And this is assuming the versions of the dom0 kernels, and of all the
other components involved are the same. If they're not, we need to go
checking, changing one component at a time.

So it really looks like the Xen 4.6 hypervisors are reaching some 
bottleneck. But we couldn't locate it yet :)

There seems to be issues, but from the tests you've performed so far, I
don't think we can conclude the problem is in Xen. And we need to know
at least where the problem most likely is, in order to have any chance
to find it! :-)

The hypervisor's dom0 has 8 vCPU and 8GB RAM, which should be plenty!

Probably. But, just in case, have you tried increasing, e.g., the
number of dom0's vcpus? Is things like vcpu-pinning or similar features
being used? Is the host a NUMA box? (Or, more generally, what are the
characteristics of the host[s]?)

Regards,
Dario

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-users
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-users

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.