[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Xen-devel] [PATCH v2] x86/current: Provide additional information to optimise get_cpu_info()
Exactly as with c/s d55c5eefe "x86: use compiler visible "add" instead of inline assembly "or" in get_cpu_info()", this is achieved by providing more information to the compiler. This causes a net drop of almost 4K of .text Signed-off-by: Andrew Cooper <andrew.cooper3@xxxxxxxxxx> CC: Jan Beulich <JBeulich@xxxxxxxx> --- v2: Less speculation about generated code in the comment --- xen/include/asm-x86/current.h | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/xen/include/asm-x86/current.h b/xen/include/asm-x86/current.h index 2081015..b95fd79 100644 --- a/xen/include/asm-x86/current.h +++ b/xen/include/asm-x86/current.h @@ -25,9 +25,9 @@ struct cpu_info { static inline struct cpu_info *get_cpu_info(void) { - unsigned long tos; - __asm__ ( "and %%rsp,%0" : "=r" (tos) : "0" (~(STACK_SIZE-1)) ); - return (struct cpu_info *)(tos + STACK_SIZE) - 1; + register unsigned long sp asm("rsp"); + + return (struct cpu_info *)((sp & ~(STACK_SIZE-1)) + STACK_SIZE) - 1; } #define get_current() (get_cpu_info()->current_vcpu) -- 1.7.10.4 _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |