xen-devel

[Top] [All Lists]

[Xen-devel] [PATCH 02/20] x86/ticketlock: convert spin loop to C

from [Jeremy Fitzhardinge]

[Permanent Link][Original]

To:	Peter Zijlstra <peterz@xxxxxxxxxxxxx>
Subject:	[Xen-devel] [PATCH 02/20] x86/ticketlock: convert spin loop to C
From:	Jeremy Fitzhardinge <jeremy@xxxxxxxx>
Date:	Wed, 3 Nov 2010 10:59:43 -0400
Cc:	Nick Piggin <npiggin@xxxxxxx>, Xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxx>, Srivatsa Vaddagiri <vatsa@xxxxxxxxxxxxxxxxxx>, Linux Kernel Mailing List <linux-kernel@xxxxxxxxxxxxxxx>, Jan Beulich <JBeulich@xxxxxxxxxx>, Linux Virtualization <virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx>, Jeremy Fitzhardinge <jeremy.fitzhardinge@xxxxxxxxxx>, Avi Kivity <avi@xxxxxxxxxx>, "H. Peter Anvin" <hpa@xxxxxxxxx>
Delivery-date:	Wed, 03 Nov 2010 08:24:27 -0700
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<cover.1288794124.git.jeremy.fitzhardinge@xxxxxxxxxx>
In-reply-to:	<cover.1288794124.git.jeremy.fitzhardinge@xxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<cover.1288794124.git.jeremy.fitzhardinge@xxxxxxxxxx>
References:	<cover.1288794124.git.jeremy.fitzhardinge@xxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx

From: Jeremy Fitzhardinge <jeremy.fitzhardinge@xxxxxxxxxx>

The inner loop of __ticket_spin_lock isn't doing anything very special,
so reimplement it in C.

For the 8 bit ticket lock variant, we use a register union to get direct
access to the lower and upper bytes in the tickets, but unfortunately gcc
won't generate a direct comparison between the two halves of the register,
so the generated asm isn't quite as pretty as the hand-coded version.
However benchmarking shows that this is actually a small improvement in
runtime performance on some benchmarks, and never a slowdown.

We also need to make sure there's a barrier at the end of the lock loop
to make sure that the compiler doesn't move any instructions from within
the locked region into the region where we don't yet own the lock.

Signed-off-by: Jeremy Fitzhardinge <jeremy.fitzhardinge@xxxxxxxxxx>
---
 arch/x86/include/asm/spinlock.h |   58 +++++++++++++++++++-------------------
 1 files changed, 29 insertions(+), 29 deletions(-)

diff --git a/arch/x86/include/asm/spinlock.h b/arch/x86/include/asm/spinlock.h
index d6d5784..6711d36 100644
--- a/arch/x86/include/asm/spinlock.h
+++ b/arch/x86/include/asm/spinlock.h
@@ -58,21 +58,21 @@
 #if (NR_CPUS < 256)
 static __always_inline void __ticket_spin_lock(arch_spinlock_t *lock)
 {
-       unsigned short inc = 1 << TICKET_SHIFT;
-
-       asm volatile (
-               LOCK_PREFIX "xaddw %w0, %1\n"
-               "1:\t"
-               "cmpb %h0, %b0\n\t"
-               "je 2f\n\t"
-               "rep ; nop\n\t"
-               "movb %1, %b0\n\t"
-               /* don't need lfence here, because loads are in-order */
-               "jmp 1b\n"
-               "2:"
-               : "+Q" (inc), "+m" (lock->slock)
-               :
-               : "memory", "cc");
+       register union {
+               struct __raw_tickets tickets;
+               unsigned short slock;
+       } inc = { .slock = 1 << TICKET_SHIFT };
+
+       asm volatile (LOCK_PREFIX "xaddw %w0, %1\n"
+                     : "+Q" (inc), "+m" (lock->slock) : : "memory", "cc");
+
+       for (;;) {
+               if (inc.tickets.head == inc.tickets.tail)
+                       return;
+               cpu_relax();
+               inc.tickets.head = ACCESS_ONCE(lock->tickets.head);
+       }
+       barrier();              /* make sure nothing creeps before the lock is 
taken */
 }
 
 static __always_inline int __ticket_spin_trylock(arch_spinlock_t *lock)
@@ -105,22 +105,22 @@ static __always_inline void 
__ticket_spin_unlock(arch_spinlock_t *lock)
 static __always_inline void __ticket_spin_lock(arch_spinlock_t *lock)
 {
        unsigned inc = 1 << TICKET_SHIFT;
-       unsigned tmp;
+       __ticket_t tmp;
 
-       asm volatile(LOCK_PREFIX "xaddl %0, %1\n"
-                    "movzwl %w0, %2\n\t"
-                    "shrl $16, %0\n\t"
-                    "1:\t"
-                    "cmpl %0, %2\n\t"
-                    "je 2f\n\t"
-                    "rep ; nop\n\t"
-                    "movzwl %1, %2\n\t"
-                    /* don't need lfence here, because loads are in-order */
-                    "jmp 1b\n"
-                    "2:"
-                    : "+r" (inc), "+m" (lock->slock), "=&r" (tmp)
-                    :
-                    : "memory", "cc");
+       asm volatile(LOCK_PREFIX "xaddl %0, %1\n\t"
+                    : "+r" (inc), "+m" (lock->slock)
+                    : : "memory", "cc");
+
+       tmp = inc;
+       inc >>= TICKET_SHIFT;
+
+       for (;;) {
+               if ((__ticket_t)inc == tmp)
+                       return;
+               cpu_relax();
+               tmp = ACCESS_ONCE(lock->tickets.head);
+       }
+       barrier();              /* make sure nothing creeps before the lock is 
taken */
 }
 
 static __always_inline int __ticket_spin_trylock(arch_spinlock_t *lock)
-- 
1.7.2.3


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
[Xen-devel] [PATCH 17/20] x86/ticketlock: clarify barrier in arch_spin_lock, (continued) [Xen-devel] [PATCH 17/20] x86/ticketlock: clarify barrier in arch_spin_lock, Jeremy Fitzhardinge [Xen-devel] [PATCH 14/20] x86/ticketlock: loosen ordering restraints on unlock, Jeremy Fitzhardinge [Xen-devel] [PATCH 16/20] x86/ticketlock: don't inline _spin_unlock when using paravirt spinlocks, Jeremy Fitzhardinge [Xen-devel] [PATCH 09/20] xen/pvticketlock: Xen implementation for PV ticket locks, Jeremy Fitzhardinge [Xen-devel] [PATCH 05/20] x86/ticketlock: make __ticket_spin_lock common, Jeremy Fitzhardinge [Xen-devel] [PATCH 10/20] x86/pvticketlock: keep count of blocked cpus, Jeremy Fitzhardinge [Xen-devel] [PATCH 20/20] x86/ticketlock: rename ticketpair to head_tail, Jeremy Fitzhardinge [Xen-devel] [PATCH 07/20] x86/spinlocks: replace pv spinlocks with pv ticketlocks, Jeremy Fitzhardinge [Xen-devel] [PATCH 15/20] x86/ticketlock: prevent compiler reordering into locked region, Jeremy Fitzhardinge [Xen-devel] [PATCH 13/20] x86/pvticketlock: make sure unlock is seen by everyone before checking waiters, Jeremy Fitzhardinge [Xen-devel] [PATCH 02/20] x86/ticketlock: convert spin loop to C, Jeremy Fitzhardinge <= [Xen-devel] Re: [PATCH 02/20] x86/ticketlock: convert spin loop to C, Eric Dumazet [Xen-devel] Re: [PATCH 02/20] x86/ticketlock: convert spin loop to C, Jeremy Fitzhardinge [Xen-devel] [PATCH 19/20] x86/ticketlocks: use overlapping read to eliminate mb(), Jeremy Fitzhardinge [Xen-devel] [PATCH 18/20] x86/ticketlock: remove .slock, Jeremy Fitzhardinge [Xen-devel] Re: [PATCH 00/20] x86: ticket lock rewrite and paravirtualization, H. Peter Anvin [Xen-devel] Re: [PATCH 00/20] x86: ticket lock rewrite and paravirtualization, Jeremy Fitzhardinge [Xen-devel] Re: [PATCH 00/20] x86: ticket lock rewrite and paravirtualization, H. Peter Anvin [Xen-devel] Re: [PATCH 00/20] x86: ticket lock rewrite and paravirtualization, Jeremy Fitzhardinge [Xen-devel] Re: [PATCH 00/20] x86: ticket lock rewrite and paravirtualization, H. Peter Anvin [Xen-devel] Re: [PATCH 00/20] x86: ticket lock rewrite and paravirtualization, Peter Zijlstra

Previous by Date:	[Xen-devel] [PATCH 13/20] x86/pvticketlock: make sure unlock is seen by everyone before checking waiters, Jeremy Fitzhardinge
Next by Date:	[Xen-devel] [PATCH 19/20] x86/ticketlocks: use overlapping read to eliminate mb(), Jeremy Fitzhardinge
Previous by Thread:	[Xen-devel] [PATCH 13/20] x86/pvticketlock: make sure unlock is seen by everyone before checking waiters, Jeremy Fitzhardinge
Next by Thread:	[Xen-devel] Re: [PATCH 02/20] x86/ticketlock: convert spin loop to C, Eric Dumazet
Indexes:	[Date] [Thread] [Top] [All Lists]