[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH v3 2/2] xen/arm32: implement VFP context switch
On Wed, 2013-06-05 at 14:06 +0100, Julien Grall wrote: > On 06/03/2013 04:14 PM, Ian Campbell wrote: > > > On Mon, 2013-06-03 at 15:57 +0100, Julien Grall wrote: > >> On 06/03/2013 03:38 PM, Ian Campbell wrote: > >> > >>> On Mon, 2013-06-03 at 15:32 +0100, Julien Grall wrote: > >>>> On 06/03/2013 03:15 PM, Ian Campbell wrote: > >>>> > >>>>> On Mon, 2013-06-03 at 15:00 +0100, Julien Grall wrote: > >>>>>> Add support for VFP context switch on arm32 and a dummy support for > >>>>>> arm64 > >>>>>> > >>>>>> Signed-off-by: Julien Grall <julien.grall@xxxxxxxxxx> > >>>>>> > >>>>>> Changes in v3: > >>>>>> - Add vfp_init to check if the processor supports VFP 3 > >>>>>> - Add clobber memory > >>>>>> - Remove tmps > >>>>>> - s/COFNIG_ARM64/CONFIG_ARM64/ in include/asm/arm.h > >>>>>> > >>>>>> Changes in v2: > >>>>>> - Fix all the small errors (type, lost headers...) > >>>>>> - Add some comments > >>>>>> --- > >>>>>> xen/arch/arm/arm32/Makefile | 1 + > >>>>>> xen/arch/arm/arm32/vfp.c | 99 > >>>>>> +++++++++++++++++++++++++++++++++++++++ > >>>>>> xen/arch/arm/arm64/Makefile | 1 + > >>>>>> xen/arch/arm/arm64/vfp.c | 13 +++++ > >>>>>> xen/arch/arm/domain.c | 7 ++- > >>>>>> xen/include/asm-arm/arm32/vfp.h | 41 ++++++++++++++++ > >>>>>> xen/include/asm-arm/arm64/vfp.h | 16 +++++++ > >>>>>> xen/include/asm-arm/cpregs.h | 9 ++++ > >>>>>> xen/include/asm-arm/domain.h | 4 ++ > >>>>>> xen/include/asm-arm/vfp.h | 25 ++++++++++ > >>>>>> 10 files changed, 214 insertions(+), 2 deletions(-) > >>>>>> create mode 100644 xen/arch/arm/arm32/vfp.c > >>>>>> create mode 100644 xen/arch/arm/arm64/vfp.c > >>>>>> create mode 100644 xen/include/asm-arm/arm32/vfp.h > >>>>>> create mode 100644 xen/include/asm-arm/arm64/vfp.h > >>>>>> create mode 100644 xen/include/asm-arm/vfp.h > >>>>>> > >>>>>> diff --git a/xen/arch/arm/arm32/Makefile b/xen/arch/arm/arm32/Makefile > >>>>>> index aaf277a..b903803 100644 > >>>>>> --- a/xen/arch/arm/arm32/Makefile > >>>>>> +++ b/xen/arch/arm/arm32/Makefile > >>>>>> @@ -6,5 +6,6 @@ obj-y += proc-ca15.o > >>>>>> > >>>>>> obj-y += traps.o > >>>>>> obj-y += domain.o > >>>>>> +obj-y += vfp.o > >>>>>> > >>>>>> obj-$(EARLY_PRINTK) += debug.o > >>>>>> diff --git a/xen/arch/arm/arm32/vfp.c b/xen/arch/arm/arm32/vfp.c > >>>>>> new file mode 100644 > >>>>>> index 0000000..2ece43d > >>>>>> --- /dev/null > >>>>>> +++ b/xen/arch/arm/arm32/vfp.c > >>>>>> @@ -0,0 +1,99 @@ > >>>>>> +#include <xen/sched.h> > >>>>>> +#include <xen/init.h> > >>>>>> +#include <asm/processor.h> > >>>>>> +#include <asm/vfp.h> > >>>>>> + > >>>>>> +void vfp_save_state(struct vcpu *v) > >>>>>> +{ > >>>>>> + v->arch.vfp.fpexc = READ_CP32(FPEXC); > >>>>>> + > >>>>>> + WRITE_CP32(v->arch.vfp.fpexc | FPEXC_EN, FPEXC); > >>>>>> + > >>>>>> + v->arch.vfp.fpscr = READ_CP32(FPSCR); > >>>>>> + > >>>>>> + if ( v->arch.vfp.fpexc & FPEXC_EX ) /* Check for sub-architecture > >>>>>> */ > >>>>>> + { > >>>>>> + v->arch.vfp.fpinst = READ_CP32(FPINST); > >>>>>> + > >>>>>> + if ( v->arch.vfp.fpexc & FPEXC_FP2V ) > >>>>>> + v->arch.vfp.fpinst2 = READ_CP32(FPINST2); > >>>>>> + /* Disable FPEXC_EX */ > >>>>>> + WRITE_CP32((v->arch.vfp.fpexc | FPEXC_EN) & ~FPEXC_EX, FPEXC); > >>>>>> + } > >>>>>> + > >>>>>> + /* Save {d0-d15} */ > >>>>>> + asm volatile("stc p11, cr0, [%0], #32*4" > >>>>>> + : : "r" (v->arch.vfp.fpregs1) > >>>>>> + : "memory"); > >>>>> > >>>>> http://gcc.gnu.org/onlinedocs/gcc/Machine-Constraints.html suggests that > >>>>> "=m" (v->arch.vfp.fpregs1) or "=Q" (...) as output constraints might do > >>>>> the job without clobbering the whole of memory. > >>>> > >>>> I'm not sure to fully understand the concept behind "=m". Does it mean > >>>> that gcc will clobber all the memory range addressed by fpregs? > >>> > >>> I'm not totally confident in this stuff myself.... > >>> > >>> Apparently the "=" modified means[0] "this operand is write-only for > >>> this instruction: the previous value is discarded and replaced by output > >>> data." In the case of a memory constraint you'd want to hope that > >>> "discarded and replaced" would be equivalent to clobbering the address, > >>> at least in cases where the compiler knows the size of the thing, as it > >>> does here. > >>> > >>> Ian. > >>> > >>> [0] http://gcc.gnu.org/onlinedocs/gcc-4.8.1/gcc/Modifiers.html#Modifiers > >>> > >> > >> > >> Shall I use this contrainst for the stc instructions and send a new > >> patch series? > > > > If you buy my reasoning and if it works then I guess so. > > > Hi, > > So I gave a try and it doesn't work. That's a shame, oh well we'll have to use the big clobber then -- not the end of the world. > The modifiers =m replaces %0 by [rn,imm] which is wrong. So it's not > possible to specify the amount of data which the instruction need to > read (the #32*4) because pre-index and post-index aren't allowed in a > same instruction. > > If I remove #32*4, the instruction is different. With objdump on each > generate object: > - vstmia r3!, {d0,d15} > - vstr d0, [rn, imm] > > -- > Julien > _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |