[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [RFC PATCH 02/24] ARM: GICv3: allocate LPI pending and property table
On Thu, 10 Nov 2016, Andre Przywara wrote: > Hi, > > On 26/10/16 02:10, Stefano Stabellini wrote: > > Hi Andre, > > > > Sorry for the late reply, I'll try to be faster for the next rounds of > > review. The patch looks good for a first iteration. Some comments below. > > No worries and thanks for the thorough review, much appreciated. > As you can see I took my time to respond as well ;-) > > > > > On Wed, 28 Sep 2016, Andre Przywara wrote: > >> The ARM GICv3 ITS provides a new kind of interrupt called LPIs. > >> The pending bits and the configuration data (priority, enable bits) for > >> those LPIs are stored in tables in normal memory, which software has to > >> provide to the hardware. > >> Allocate the required memory, initialize it and hand it over to each > >> ITS. We limit the number of LPIs we use with a compile time constant to > >> avoid wasting memory. > >> > >> Signed-off-by: Andre Przywara <andre.przywara@xxxxxxx> > >> --- > >> xen/arch/arm/Kconfig | 6 ++++ > >> xen/arch/arm/efi/efi-boot.h | 1 - > >> xen/arch/arm/gic-its.c | 76 > >> +++++++++++++++++++++++++++++++++++++++ > >> xen/arch/arm/gic-v3.c | 27 ++++++++++++++ > >> xen/include/asm-arm/cache.h | 4 +++ > >> xen/include/asm-arm/gic-its.h | 22 +++++++++++- > >> xen/include/asm-arm/gic_v3_defs.h | 48 ++++++++++++++++++++++++- > >> 7 files changed, 181 insertions(+), 3 deletions(-) > >> > >> diff --git a/xen/arch/arm/Kconfig b/xen/arch/arm/Kconfig > >> index 9fe3b8e..66e2bb8 100644 > >> --- a/xen/arch/arm/Kconfig > >> +++ b/xen/arch/arm/Kconfig > >> @@ -50,6 +50,12 @@ config HAS_ITS > >> depends on ARM_64 > >> depends on HAS_GICV3 > >> > >> +config HOST_LPI_BITS > >> + depends on HAS_ITS > >> + int "Maximum bits for GICv3 host LPIs (14-32)" > >> + range 14 32 > >> + default "20" > >> + > >> config ALTERNATIVE > >> bool > >> > >> diff --git a/xen/arch/arm/efi/efi-boot.h b/xen/arch/arm/efi/efi-boot.h > >> index 045d6ce..dc64aec 100644 > >> --- a/xen/arch/arm/efi/efi-boot.h > >> +++ b/xen/arch/arm/efi/efi-boot.h > >> @@ -10,7 +10,6 @@ > >> #include "efi-dom0.h" > >> > >> void noreturn efi_xen_start(void *fdt_ptr, uint32_t fdt_size); > >> -void __flush_dcache_area(const void *vaddr, unsigned long size); > >> > >> #define DEVICE_TREE_GUID \ > >> {0xb1b621d5, 0xf19c, 0x41a5, {0x83, 0x0b, 0xd9, 0x15, 0x2c, 0x69, 0xaa, > >> 0xe0}} > >> diff --git a/xen/arch/arm/gic-its.c b/xen/arch/arm/gic-its.c > >> index 0f42a77..b52dff3 100644 > >> --- a/xen/arch/arm/gic-its.c > >> +++ b/xen/arch/arm/gic-its.c > >> @@ -20,10 +20,86 @@ > >> #include <xen/lib.h> > >> #include <xen/device_tree.h> > >> #include <xen/libfdt/libfdt.h> > >> +#include <asm/p2m.h> > >> #include <asm/gic.h> > >> #include <asm/gic_v3_defs.h> > >> #include <asm/gic-its.h> > >> > >> +/* Global state */ > >> +static struct { > >> + uint8_t *lpi_property; > >> + int host_lpi_bits; > >> +} lpi_data; > >> + > >> +/* Pending table for each redistributor */ > >> +static DEFINE_PER_CPU(void *, pending_table); > >> + > >> +#define MAX_HOST_LPI_BITS \ > > > > To avoid confusion, I would call this MAX_PHYS_LPI_BITS > > > > > >> + min_t(unsigned int, lpi_data.host_lpi_bits, CONFIG_HOST_LPI_BITS) > >> +#define MAX_HOST_LPIS (BIT(MAX_HOST_LPI_BITS) - 8192) > > > > And this MAX_PHYS_LPIS > > Done. > > >> +uint64_t gicv3_lpi_allocate_pendtable(void) > >> +{ > >> + uint64_t reg, attr; > >> + void *pendtable; > > > > I would introduce a check to make sure that this_cpu(pending_table) == NULL. > > Can do. So I return back this value then, though this should never happen. > > > > >> + attr = GIC_BASER_CACHE_RaWaWb << > >> GICR_PENDBASER_INNER_CACHEABILITY_SHIFT; > >> + attr |= GIC_BASER_CACHE_SameAsInner << > >> GICR_PENDBASER_OUTER_CACHEABILITY_SHIFT; > >> + attr |= GIC_BASER_InnerShareable << GICR_PENDBASER_SHAREABILITY_SHIFT; > >> + > >> + /* > >> + * The pending table holds one bit per LPI, so we need three bits less > >> + * than the number of LPI_BITs. > > > > Why 3 bit less? Please add more info on how you came up with 3. > > 3 bits as in 2 << 3 = 8 = BITS_PER_BYTES. We need to divide by that, > which is shift by 3, which is ORDER - 3. Does that make sense? > But this mayhem goes away anyway with _xmalloc. Please add info to the in code comment (if it will still be there). > > > >> But the alignment requirement from the > >> + * ITS is 64K, so make order at least 16 (-12). > > > > Does it need to be 64K aligned or does it need to be at least 64K in > > size? > > The first. > > > That makes a big difference. If it just needs to be 64K aligned, > > you can do that with xmalloc. > > Well, not xmalloc (since I don't have a data structure of that size), > but _xmalloc. I just saw that this is exported as well (I dismissed this > before because of the leading underscore). > Also "alloc pages" sounded more like what I had in mind, but I guess > aligning it to 64K serves the same purpose. > > >> + */ > >> + pendtable = alloc_xenheap_pages(MAX(lpi_data.host_lpi_bits - 3, 16) - > >> 12, 0); > > > > Shouldn't we be using MAX_HOST_LPI_BITS instead of > > lpi_data.host_lpi_bits to make this calculation? > > I was under the impression that the redistributors expect the pending > table to cover every possible LPI as reported in GICD_TYPER (because in > contrast to PROPBASER the PENDBASER register lacks a size field). > But thinking about this again this seems to be insane, since 32 bit > worth of LPIs would lead to a 0.5GB pending table. But as the LPI > numbers are under the control of software, we can go with allocating > less - up to our internal limit - which is also what Linux does. > > > > >> + if ( !pendtable ) > >> + return 0; > >> + > >> + memset(pendtable, 0, BIT(lpi_data.host_lpi_bits - 3)); > > > > flush_dcache? > > Uhm, yes. > > > > >> + this_cpu(pending_table) = pendtable; > >> + > >> + reg = attr | GICR_PENDBASER_PTZ; > >> + reg |= virt_to_maddr(pendtable) & GENMASK(51, 16); > >> + > >> + return reg; > >> +} > >> + > >> +uint64_t gicv3_lpi_get_proptable() > >> +{ > >> + uint64_t attr; > >> + static uint64_t reg = 0; > >> + > >> + /* The property table is shared across all redistributors. */ > >> + if ( reg ) > >> + return reg; > > > > Can't you just use lpi_data.lpi_property != NULL instead of introducing > > a new static local variable? > > Seems like a good idea actually. We have to reconstruct the register > content, but that seems doable. > > >> + attr = GIC_BASER_CACHE_RaWaWb << > >> GICR_PENDBASER_INNER_CACHEABILITY_SHIFT; > >> + attr |= GIC_BASER_CACHE_SameAsInner << > >> GICR_PENDBASER_OUTER_CACHEABILITY_SHIFT; > >> + attr |= GIC_BASER_InnerShareable << GICR_PENDBASER_SHAREABILITY_SHIFT; > >> + > >> + lpi_data.lpi_property = alloc_xenheap_pages(MAX_HOST_LPI_BITS - 12, > >> 0); > > > > Please add a comment on how the order is calculated. > > Does " ... - PAGE_SHIFT" suffice? Yes, that would work. > > > > > >> + if ( !lpi_data.lpi_property ) > >> + return 0; > >> + > >> + memset(lpi_data.lpi_property, GIC_PRI_IRQ | LPI_PROP_RES1, > >> MAX_HOST_LPIS); > >> + __flush_dcache_area(lpi_data.lpi_property, MAX_HOST_LPIS); > >> + > >> + reg = attr | ((MAX_HOST_LPI_BITS - 1) << 0); > >> + reg |= virt_to_maddr(lpi_data.lpi_property) & GENMASK(51, 12); > >> + > >> + return reg; > >> +} > >> + > >> +int gicv3_lpi_init_host_lpis(int lpi_bits) > >> +{ > >> + lpi_data.host_lpi_bits = lpi_bits; > >> + > >> + printk("GICv3: using at most %ld LPIs on the host.\n", MAX_HOST_LPIS); > >> + > >> + return 0; > >> +} > >> + > >> void gicv3_its_dt_init(const struct dt_device_node *node) > >> { > >> const struct dt_device_node *its = NULL; > >> diff --git a/xen/arch/arm/gic-v3.c b/xen/arch/arm/gic-v3.c > >> index 238da84..2534aa5 100644 > >> --- a/xen/arch/arm/gic-v3.c > >> +++ b/xen/arch/arm/gic-v3.c > >> @@ -546,6 +546,9 @@ static void __init gicv3_dist_init(void) > >> type = readl_relaxed(GICD + GICD_TYPER); > >> nr_lines = 32 * ((type & GICD_TYPE_LINES) + 1); > >> > >> + if ( type & GICD_TYPE_LPIS ) > >> + gicv3_lpi_init_host_lpis(((type >> GICD_TYPE_ID_BITS_SHIFT) & > >> 0x1f) + 1); > > > > Please #define a mask instead of using 0x1f > > > > > >> + > >> printk("GICv3: %d lines, (IID %8.8x).\n", > >> nr_lines, readl_relaxed(GICD + GICD_IIDR)); > >> > >> @@ -615,6 +618,26 @@ static int gicv3_enable_redist(void) > >> > >> return 0; > >> } > >> +static void gicv3_rdist_init_lpis(void __iomem * rdist_base) > >> +{ > >> + uint32_t reg; > >> + uint64_t table_reg; > >> + > >> + if ( list_empty(&host_its_list) ) > >> + return; > >> + > >> + /* Make sure LPIs are disabled before setting up the BASERs. */ > >> + reg = readl_relaxed(rdist_base + GICR_CTLR); > >> + writel_relaxed(reg & ~GICR_CTLR_ENABLE_LPIS, rdist_base + GICR_CTLR); > >> + > >> + table_reg = gicv3_lpi_allocate_pendtable(); > >> + if ( table_reg ) > >> + writeq_relaxed(table_reg, rdist_base + GICR_PENDBASER); > > > > Maybe we want to return in case table_reg == NULL ? > > I guess so. I just wonder what we would do in this case? Panic? > Theoretically we could just proceed without enabling LPIs on this > redistributor, but that's probably not what a user would expect. Print an error and/or panic are good options. _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx https://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |