[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [XEN v3 11/12] xen/Arm: GICv3: Define macros to read/write 64 bit

To: Julien Grall <julien@xxxxxxx>, Xenia Ragiadakou <burzalodowa@xxxxxxxxx>, Ayan Kumar Halder <ayan.kumar.halder@xxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
From: Ayan Kumar Halder <ayankuma@xxxxxxx>
Date: Mon, 28 Nov 2022 12:32:34 +0000
Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=amd.com; dmarc=pass action=none header.from=amd.com; dkim=pass header.d=amd.com; arc=none
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=BoxEh5LRVcud7DCQK4+ASWl7mfl+zMSuSSEO7THBMhQ=; b=SCNLgGa3BtXZX08QrARcsNl119u5ttyHTodMam06Dqk5JMZc0vo2B/lP15KzE3muNpPT+NVBx+fsogBpcIwTpDq6gAIeaOYFUvGJN5topCnd9UPDxldfewD1fFNSm9mwQqkVlvN+yWLnlerwZGVo0C+wgAHY2EZcNLIvGH1O9hhb7sDoH5kLpgULEPhQqzrLq8+Y7vj0A8fc/fE/hdyjeQqfE+Pq/8cIx3rIzOhQVvzPSpt5RnPirV4HEb3/ZR/D3VzTSpFfzudwQw5YaVhMgAGR5+T7KFkwdlQW7ffmHe1gT3K2TV1qtPvwHWluB7md7jmxsYb1tro6xCU42QFP1Q==
Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=XfhWh/x1r5V3qgGpLVLFm/8YmJtnmE3wAQFTpJuiyODukpeAIMbYCDWD/5pMTbekWd1Wa+S5j+lPWz6Yv+eqECVb5+EOvs23PN8s8jiEe/haFyLNG1ZGXzJ2C74Jtv2gc42dWPIOFKMFQcSEZE2jfYvIv9XFCWLWRw3j5vaOEG0UDlfax6bnbxR9oTEnYjxIi/zhHuK9r8hpMUgwke4gLJZkEBpePE3FKCx4lhi4egBcwQX/VicDCqogVmGFtthVttljeD9otG3IMPbxLs6X1+twLA+kUeQE/mzlgJ4AGkkGOpPikNNogNEgdRF4rSPxza5dHxl37IgE+0QXRlOowA==
Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=amd.com;
Cc: sstabellini@xxxxxxxxxx, stefanos@xxxxxxxxxx, Volodymyr_Babchuk@xxxxxxxx, bertrand.marquis@xxxxxxx, michal.orzel@xxxxxxx, jgrall@xxxxxxxxxx
Delivery-date: Mon, 28 Nov 2022 12:32:53 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>


On 11/11/2022 17:53, Julien Grall wrote:

Hi,


Hi Julien,

I need a clarification.

On 11/11/2022 17:37, Ayan Kumar Halder wrote:
On 11/11/2022 16:17, Xenia Ragiadakou wrote:
Hi Ayan,
Hi Xenia,
On 11/11/22 16:17, Ayan Kumar Halder wrote:
On AArch32, ldrd/strd instructions are not atomic when used toaccess MMIO.Furthermore, ldrd/strd instructions are not decoded by Arm whenrunning as
a guest to access emulated MMIO region.
Thus, we have definedreadq_relaxed_non_atomic()/writeq_relaxed_non_atomic()which in turn calls readl_relaxed()/writel_relaxed() for the lowerand upper
32 bits.
As GICv3 registers (GICD_IROUTER, GICR_TYPER) can be accessed in anon atomic
fashion, so we have used {read/write}q_relaxed_non_atomic() on Arm32.

Signed-off-by: Ayan Kumar Halder <ayan.kumar.halder@xxxxxxx>
---

Changes from :-
v1 - 1. Use ldrd/strd for readq_relaxed()/writeq_relaxed().
2. No need to use le64_to_cpu() as the returned byte order isalready in cpu
endianess.
v2 - 1. Replace {read/write}q_relaxed with{read/write}q_relaxed_non_atomic().
  xen/arch/arm/gic-v3.c               | 12 ++++++++++++
  xen/arch/arm/include/asm/arm32/io.h |  9 +++++++++
  2 files changed, 21 insertions(+)

diff --git a/xen/arch/arm/gic-v3.c b/xen/arch/arm/gic-v3.c
index 6457e7033c..a5bc549765 100644
--- a/xen/arch/arm/gic-v3.c
+++ b/xen/arch/arm/gic-v3.c
@@ -651,7 +651,11 @@ static void __init gicv3_dist_init(void)
      affinity &= ~GICD_IROUTER_SPI_MODE_ANY;
        for ( i = NR_GIC_LOCAL_IRQS; i < nr_lines; i++ )
+#ifdef CONFIG_ARM_32
+ writeq_relaxed_non_atomic(affinity, GICD + GICD_IROUTER +i * 8);
+#else
          writeq_relaxed(affinity, GICD + GICD_IROUTER + i * 8);
+#endif
I would have been OK if there was one place needed a #ifdef. But 2 isa bit too much.
Please provide a wrapper writeq_relaxed_non_atomic() for arm64. Theimplementation would call writeq(). The same stands for...

For arm64, why shouldn't we invoke {read/write}q_relaxed() for{read/write}q_relaxed_non_atomic() ?

As I understand, this will be less expensive than invokingreadq()/writeq() (as it will introduce memory barriers).


Also, the original code was invoking {read/write}q_relaxed() for arm64.

Please let me know if I am missing something ?

- Ayan

  }
    static int gicv3_enable_redist(void)
@@ -745,7 +749,11 @@ static int __init gicv3_populate_rdist(void)
          }
            do {
+#ifdef CONFIG_ARM_32
+            typer = readq_relaxed_non_atomic(ptr + GICR_TYPER);
+#else
              typer = readq_relaxed(ptr + GICR_TYPER);
+#endif
... here.
                if ( (typer >> 32) == aff )
              {
@@ -1265,7 +1273,11 @@ static void gicv3_irq_set_affinity(structirq_desc *desc, const cpumask_t *mask)
      affinity &= ~GICD_IROUTER_SPI_MODE_ANY;
        if ( desc->irq >= NR_GIC_LOCAL_IRQS )
+#ifdef CONFIG_ARM_32
+ writeq_relaxed_non_atomic(affinity, (GICD + GICD_IROUTER +desc->irq * 8));
+#else
writeq_relaxed(affinity, (GICD + GICD_IROUTER + desc->irq* 8));
+#endif
        spin_unlock(&gicv3.lock);
  }
diff --git a/xen/arch/arm/include/asm/arm32/io.hb/xen/arch/arm/include/asm/arm32/io.h
index 73a879e9fb..4ddfbea5c2 100644
--- a/xen/arch/arm/include/asm/arm32/io.h
+++ b/xen/arch/arm/include/asm/arm32/io.h
@@ -80,17 +80,26 @@ static inline u32 __raw_readl(const volatilevoid __iomem *addr)
                                          __raw_readw(c)); __r; })
  #define readl_relaxed(c) ({ u32 __r = le32_to_cpu((__force __le32) \
                                          __raw_readl(c)); __r; })
+#define readq_relaxed_non_atomic(c) \
+ ({ u64 __r = (((u64)readl_relaxed((c) +4)) << 32) | \
+ readl_relaxed(c); __r; })
As Julien pointed out, the expression c will be evaluated twice andif it produces side effects they will be performed twice.To prevent this, you can either assign the expression to a localvariable and pass this one to readl_relaxed()
Just to make sure I understand you correctly, you are suggesting this :-

#define readq_relaxed_non_atomic(c) \

                         ({ void _iomem *__addr = (c); \
u64 __r = (((u64)readl_relaxed(__addr +4)) << 32) | \
readl_relaxed(__addr); __r; })

#define writeq_relaxed_non_atomic(v,c) \

                        (( u64 __v = (v); \

                           void _iomem *__addr = (c); \

                           writel_relaxed((u32)__v, __addr); \
writel_relaxed((u32)((__v) >> 32), (__addr+ 4); })
Is this correct understanding ?
or use a static inline function instead of a macro, for implementingreadq_relaxed_non_atomic().The latter is the MISRA C recommended (not strictly required)approach according to Dir 4.9 "A function should be used inpreference to a function-like macro where
 they are interchangeable".
I have mixed opinion about this.
On one hand, there will be a performance penalty when invoking afunction (compared to macro).
Most of the compilers are nowadays clever enough to inline smallfunctions. But we can force the compiler with the attributealways_inline.
On the other hand {readq/writeq}_relaxed_non_atomic() are calledduring init (gicv3 initialization, setting up the interrupthandlers), so the impact will not be bad.
I am fine with whatever you and any maintainer suggest.
Project wide, we are trying to use "static inline" whenever it ispossible because it adds a bit more type-safety (the error you madewouldn't have happened) and the code is clearer (no slash).
So my preference is to use static line.

Cheers,

References:
- [XEN v3 00/12] Arm: Enable GICv3 for AArch32
  - From: Ayan Kumar Halder
- [XEN v3 11/12] xen/Arm: GICv3: Define macros to read/write 64 bit
  - From: Ayan Kumar Halder
- Re: [XEN v3 11/12] xen/Arm: GICv3: Define macros to read/write 64 bit
  - From: Xenia Ragiadakou
- Re: [XEN v3 11/12] xen/Arm: GICv3: Define macros to read/write 64 bit
  - From: Ayan Kumar Halder
- Re: [XEN v3 11/12] xen/Arm: GICv3: Define macros to read/write 64 bit
  - From: Julien Grall

Prev by Date: Re: [PATCH] ioreq_broadcast(): accept partial broadcast success
Next by Date: Re: [PATCH v3] xen-pciback: Consider INTx disabled when MSI/MSI-X is enabled
Previous by thread: Re: [XEN v3 11/12] xen/Arm: GICv3: Define macros to read/write 64 bit
Next by thread: Re: Making KMSAN compatible with paravirtualization
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.