[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [PATCH 4/5] x86/pv: Optimise to the segment context switching paths
On 11.09.2020 14:53, Andrew Cooper wrote: > On 11/09/2020 10:49, Jan Beulich wrote: >> On 09.09.2020 11:59, Andrew Cooper wrote: >>> Save the segment selectors with explicit asm, rather than with read_sreg(). >>> This permits the use of MOV's m16 encoding, which avoids indirecting the >>> selector value through a register. >> Instead of this, how about making read_sreg() look like >> >> #define read_sreg(val, name) ({ \ >> if ( sizeof(val) == 2 ) \ >> asm volatile ( "mov %%" STR(name) ",%0" : "=m" (val) ); \ >> else \ >> asm volatile ( "mov %%" STR(name) ",%k0" : "=r" (val) ); \ >> }) >> >> which then also covers read_registers()? I have a full patch >> ready to send, if you agree. > > That will go wrong for > > uint16_t ds; read_sreg(ds, ds); > > as it will force the variable to be spilled onto the stack. Let me quote the main part of the description of the patch then: "Under the assumption that standalone variables preferably wouldn't be of uint16_t (or unsigned short) type, build in a heuristic to do a store to memory when the output expression is two bytes wide. In the register variant, add a 'k' modifier to avoid assemblers possibly generating operand size of REX prefixes." A local variable has no reason to be uint16_t; nowadays even ./CODING_STYLE says so. > I don't think this is a clever move. > > > Furthermore, it is bad enough that read_sreg() already takes one magic > parameter which doesn't follow normal C rules - renaming to READ_SREG() > would be an improvement - but this is now adding a second which is a > capture by name. I was expecting this to be a possible objection from you. I wouldn't mind upper-casing the name, but ... > I'm afraid that is a firm no from me. ... looks like you prefer the open-coding, while I'd like to avoid it whenever reasonably possible. > There is one other place where we read all segment registers at once. > Maybe having a static inline save_sregs(struct cpu_user_regs *) hiding > the asm block, but I'm not necessarily convinced of this plan either. > At least it would cleanly separate the "I've obviously got a memory > operand" and "I almost certainly want it in a register anyway" logic. I could live with this as a compromise. >>> @@ -1556,18 +1557,24 @@ static void load_segments(struct vcpu *n) >>> : [ok] "+r" (all_segs_okay) \ >>> : [_val] "rm" (val) ) >>> >>> -#ifdef CONFIG_HVM >>> - if ( cpu_has_svm && !compat && (uregs->fs | uregs->gs) <= 3 ) >>> + if ( !compat ) >>> { >>> - unsigned long gsb = n->arch.flags & TF_kernel_mode >>> - ? n->arch.pv.gs_base_kernel : n->arch.pv.gs_base_user; >>> - unsigned long gss = n->arch.flags & TF_kernel_mode >>> - ? n->arch.pv.gs_base_user : n->arch.pv.gs_base_kernel; >>> + gsb = n->arch.pv.gs_base_kernel; >>> + gss = n->arch.pv.gs_base_user; >>> + >>> + /* >>> + * Figure out which way around gsb/gss want to be. gsb needs to be >>> + * the active context, and gss needs to be the inactive context. >>> + */ >>> + if ( !(n->arch.flags & TF_kernel_mode) ) >>> + SWAP(gsb, gss); >>> >>> - fs_gs_done = svm_load_segs(n->arch.pv.ldt_ents, LDT_VIRT_START(n), >>> - n->arch.pv.fs_base, gsb, gss); >>> + if ( IS_ENABLED(CONFIG_HVM) && cpu_has_svm && >> The change from #ifdef to IS_ENABLED() wants mirroring to the >> prefetching site imo. I wonder though whether the adjustment is a >> good idea: The declaration lives in svm.h, and I would view it as >> quite reasonable for that header to not get included in !HVM builds >> (there may be a lot of disentangling to do to get there, but still). > > I'm not overly fussed, but there will absolutely have to be HVM stubs > for normal code. I don't see why we should special case svm_load_segs() > to not have a stub. I don't see why they "absolutely" have to exist. With our relying on DCE I don't think there'll need to be ones consistently for _every_ HVM function used from more generic code. I also don't view this as "special casing" - there are already various functions not having stubs, but merely declarations. The special thing here is that, by their placement, these declarations may not be in scope long term. Which is because we're deliberately breaking proper layering here by using a HVM feature for PV. Jan
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |