[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support

To: Peter Zijlstra <peterz@xxxxxxxxxxxxx>
From: Waiman Long <waiman.long@xxxxxx>
Date: Mon, 27 Oct 2014 16:50:42 -0400
Cc: linux-arch@xxxxxxxxxxxxxxx, Raghavendra K T <raghavendra.kt@xxxxxxxxxxxxxxxxxx>, Oleg Nesterov <oleg@xxxxxxxxxx>, kvm@xxxxxxxxxxxxxxx, Scott J Norton <scott.norton@xxxxxx>, x86@xxxxxxxxxx, Paolo Bonzini <paolo.bonzini@xxxxxxxxx>, linux-kernel@xxxxxxxxxxxxxxx, virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx, Ingo Molnar <mingo@xxxxxxxxxx>, David Vrabel <david.vrabel@xxxxxxxxxx>, "H. Peter Anvin" <hpa@xxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx, Thomas Gleixner <tglx@xxxxxxxxxxxxx>, "Paul E. McKenney" <paulmck@xxxxxxxxxxxxxxxxxx>, Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>, Boris Ostrovsky <boris.ostrovsky@xxxxxxxxxx>, Douglas Hatch <doug.hatch@xxxxxx>
Delivery-date: Mon, 27 Oct 2014 20:51:22 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>

On 10/27/2014 01:27 PM, Peter Zijlstra wrote:

On Mon, Oct 27, 2014 at 01:15:53PM -0400, Waiman Long wrote:

On 10/24/2014 06:04 PM, Peter Zijlstra wrote:

On Fri, Oct 24, 2014 at 04:53:27PM -0400, Waiman Long wrote:

The additional register pressure may just cause a few more register moves
which should be negligible in the overall performance . The additional
icache pressure, however, may have some impact on performance. I was trying
to balance the performance of the pv and non-pv versions so that we won't
penalize the pv code too much for a bit more performance in the non-pv code.
Doing it your way will add a lot of function call and register
saving/restoring to the pv code.

If people care about performance they should not be using virt crap :-)

I only really care about bare metal.

Yes, I am aware of that. However, the whole point of doing PV spinlock is to
improve performance in a virtual guest.

Anything that avoids the lock holder preemption nonsense is a _massive_
win for them, a few function calls should not even register on that
scale.

I would say all the PV stuffs are mostly useful for a over-committedguest where a single CPU is shared in more than one guest. When theguests are not overcommitted, the PV code seldom get triggered. In thosecases, the overhead of the extra function call and registersaving/restoring will show up.

+#ifdef _GEN_PV_LOCK_SLOWPATH
+static void pv_queue_spin_lock_slowpath(struct qspinlock *lock, u32 val)
+#else
  void queue_spin_lock_slowpath(struct qspinlock *lock, u32 val)
+#endif

If you have two functions you might as well use the PV stuff to patch in
the right function call at the usage sites and avoid:

+       if (pv_enabled()) {
+               pv_queue_spin_lock_slowpath(lock, val);
+               return;
+       }

this alltogether.

Good point! I will do some investigation on how to do this kind offunction address patching and eliminate the extra function call overhead.

         this_cpu_dec(mcs_nodes[0].count);
  }
  EXPORT_SYMBOL(queue_spin_lock_slowpath);
+
+#if !defined(_GEN_PV_LOCK_SLOWPATH)&&  defined(CONFIG_PARAVIRT_SPINLOCKS)
+/*
+ * Generate the PV version of the queue_spin_lock_slowpath function
+ */
+#undef pv_init_node
+#undef pv_wait_check
+#undef pv_link_and_wait_node
+#undef pv_wait_head
+#undef EXPORT_SYMBOL
+#undef in_pv_code
+
+#define _GEN_PV_LOCK_SLOWPATH
+#define EXPORT_SYMBOL(x)
+#define in_pv_code     return_true
+#define pv_enabled     return_false
+
+#include "qspinlock.c"
+
+#endif

That's properly disgusting :-) But a lot better than actually
duplicating everything I suppose.

I know you don't like this kind of preprocessor trick, but this is theeasiest way that I can think of to generate two separate functions fromthe same source code.


-Longman

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

References:
- [Xen-devel] [PATCH v12 00/11] qspinlock: a 4-byte queue spinlock with PV support
  - From: Waiman Long
- [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
  - From: Waiman Long
- Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
  - From: Peter Zijlstra
- Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
  - From: Waiman Long
- Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
  - From: Peter Zijlstra
- Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
  - From: Waiman Long
- Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
  - From: Peter Zijlstra

Prev by Date: Re: [Xen-devel] xl/SR-IOV: disposition of VFs when PF disappears?
Next by Date: Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
Previous by thread: Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
Next by thread: Re: [Xen-devel] [PATCH v12 09/11] pvqspinlock, x86: Add para-virtualization support
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.