Xen project Mailing List

Re: [PATCH] xen/arm: fix SBDF calculation for vPCI MMIO handlers

To: Julien Grall <julien@xxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>

From: Oleksandr Andrushchenko <Oleksandr_Andrushchenko@xxxxxxxx>

Date: Thu, 28 Oct 2021 14:16:02 +0000

Accept-language: en-US

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=epam.com; dmarc=pass action=none header.from=epam.com; dkim=pass header.d=epam.com; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=YeVa6WyfW6WLKqGpv0QIgGb1PoLtq1h7iG9lzPDI5+g=; b=eLGnXWaU9VkFdzg0c5+bkFlJHW814ZtzS8NmBkPjsgG3Tk6yT3yAR2pCNunL5o4u5aqQ9qkmNcMU5BHg1NVJZ0AN9Md0oc/S2nsltnc404Auc5SdjVpOxv7eqp/xhSAmbXtqKhT+xaFZbinvuWQqn6BVrHUAq/Y2zxkuQK5wrbuJQDI0QstMiO/kKs/Ddghp+lI5ACIV1Z73yolT+fRRpURWAowupWmPXBRxjdLv38UqcPg7ZSYMBGYXgSO87VZq862yRcbzSDA73Jwp5I6Ujs2xvfENtvreCHfl/aZfb7QEe4IC6Z0oEZpSy8HzUmfyuppCHjZPY/kADekUQ6sNnw==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=enrqpxI6T5iA7pLnTAQSd1Exy+gS+XfPmos1uUV3SNpflYkYj7n5gIYUvvqobK4+Q7J8Mn25lw97YXZAenY9GYaRO3cBzFIHmXidxo8fFwMCfsDg2uffu9ijsn3PMQ5BeWD+7/HEy7Nz9ZN0vUQW028KqEkzjBA/PdT5X1SSQ+OBT53wWsltvQvA7YYobY3DQvYN++3NxrzRb0E5MI5y8VsNsr5jjIoKrq/kct34k3kWyZmbjvrfXzxWCweUS4E5rZDzj6Y8M7FtSGy/TD21RrzIFNO0ZtHWNdBsx7WHR0pP7eu8IyT1P2ZEuXMdzJm1qTxS2UgWp6oOb7CNSK+1dA==

Cc: "sstabellini@xxxxxxxxxx" <sstabellini@xxxxxxxxxx>, "iwj@xxxxxxxxxxxxxx" <iwj@xxxxxxxxxxxxxx>, Bertrand Marquis <bertrand.marquis@xxxxxxx>, Rahul Singh <rahul.singh@xxxxxxx>

Delivery-date: Thu, 28 Oct 2021 14:16:20 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

Thread-index: AQHXyww6rKHs3LJru0q52eLUia+oNavnHBIAgAE3H4CAABSQgIAADtGA

Thread-topic: [PATCH] xen/arm: fix SBDF calculation for vPCI MMIO handlers

On 28.10.21 16:22, Julien Grall wrote: > On 28/10/2021 13:09, Oleksandr Andrushchenko wrote: >> Hi, Julien! > > Hello, > >> On 27.10.21 20:35, Julien Grall wrote: >>> Hi Oleksandr, >>> >>> On 27/10/2021 09:25, Oleksandr Andrushchenko wrote: >>>> From: Oleksandr Andrushchenko <oleksandr_andrushchenko@xxxxxxxx> >>>> >>>> While in vPCI MMIO trap handlers for the guest PCI host bridge it is not >>>> enough for SBDF translation to simply call VPCI_ECAM_BDF(info->gpa) as >>>> the base address may not be aligned in the way that the translation >>>> always work. If not adjusted with respect to the base address it may not be >>>> able to properly convert SBDF and crashes: >>>> >>>> (XEN) vpci_mmio_read 0000:65:1a.0 reg 8bc gpa e65d08bc >>> >>> I can't find a printk() that may output this message. Where does this comes >>> from? >> That was a debug print. I shouldn't have used that in the patch description, >> but >> probably after "---" to better explain what's happening >>> >>> Anyway, IIUC the guest physical address is 0xe65d08bc which, if I am not >>> mistaken, doesn't belong to the range advertised for GUEST_VPCI_ECAM. >> This is from dom0 I am working on now. >>> >>> IMHO, the stack trace should come from usptream Xen or need some >>> information to explain how this was reproduced. >>> >>>> (XEN) Data Abort Trap. Syndrome=0x6 >>>> (XEN) Walking Hypervisor VA 0x467a28bc on CPU0 via TTBR 0x00000000481d5000 >>> I can understnad that if we don't substract GUEST_VPCI_ECAM, we would (in >>> theory) not get the correct BDF. But... I don't understand how this would >>> result to a data abort in the hypervisor. >>> >>> In fact, I think the vPCI code should be resilient enough to not crash if >>> we pass the wrong BDF. >> Well, there is no (?) easy way to validate SBDF. > > AFAICT pci_ecam_map_bus() is already doing some validation for the bus > number. So... what it does is not enough as... if ( busn < cfg->busn_start || busn > cfg->busn_end ) return NULL; busn -= cfg->busn_start; base = cfg->win + (busn << ops->bus_shift); return base + (PCI_DEVFN2(sbdf.bdf) << devfn_shift) + where; this can still overrun > > And this could be a problem if we have a misbehaving >> guest which may force Xen to access the memory beyond that of PCI host bridge >>> >>> When there is a data abort in Xen, you should get a stack trace from where >>> this comes from. Can you paste it here? >> (XEN) Data Abort Trap. Syndrome=0x6 >> (XEN) Walking Hypervisor VA 0x467a28bc on CPU0 via TTBR 0x00000000481d5000 >> (XEN) 0TH[0x0] = 0x00000000481d4f7f >> (XEN) 1ST[0x1] = 0x00000000481d2f7f >> (XEN) 2ND[0x33] = 0x0000000000000000 >> (XEN) CPU0: Unexpected Trap: Data Abort > > ... I am getting quite confused why this is crashing. Are we validation > correctly the access? See above. If provided with big enough SBDF we can end up getting out of the window. > > >> (XEN) ----[ Xen-4.16-unstable arm64 debug=y Not tainted ]---- >> (XEN) CPU: 0 >> (XEN) PC: 000000000026d3d4 pci_generic_config_read+0x88/0x9c >> (XEN) LR: 000000000026d36c >> (XEN) SP: 000080007ff97c00 >> (XEN) CPSR: 0000000060400249 MODE:64-bit EL2h (Hypervisor, handler) >> (XEN) X0: 00000000467a28bc X1: 00000000065d08bc X2: 00000000000008bc >> (XEN) X3: 000000000000000c X4: 000080007ffc6fd0 X5: 0000000000000000 >> (XEN) X6: 0000000000000014 X7: ffff800011a58000 X8: ffff0000225a0380 >> (XEN) X9: 0000000000000000 X10: 0101010101010101 X11: 0000000000000028 >> (XEN) X12: 0101010101010101 X13: 0000000000000020 X14: ffffffffffffffff >> (XEN) X15: 0000000000000001 X16: ffff800010da6708 X17: 0000000000000020 >> (XEN) X18: 0000000000000002 X19: 0000000000000004 X20: 000080007ff97c5c >> (XEN) X21: 00000000000008bc X22: 00000000000008bc X23: 0000000000000004 >> (XEN) X24: 0000000000000000 X25: 00000000000008bc X26: 00000000000065d0 >> (XEN) X27: 000080007ffb9010 X28: 0000000000000000 FP: 000080007ff97c00 >> (XEN) >> (XEN) VTCR_EL2: 00000000800a3558 >> (XEN) VTTBR_EL2: 00010000bffba000 >> (XEN) >> (XEN) SCTLR_EL2: 0000000030cd183d >> (XEN) HCR_EL2: 00000000807c663f >> (XEN) TTBR0_EL2: 00000000481d5000 >> (XEN) >> (XEN) ESR_EL2: 0000000096000006 >> (XEN) HPFAR_EL2: 0000000000e65d00 >> (XEN) FAR_EL2: 00000000467a28bc >> (XEN) >> [snip] >> (XEN) Xen call trace: >> (XEN) [<000000000026d3d4>] pci_generic_config_read+0x88/0x9c (PC) >> (XEN) [<000000000026d36c>] pci_generic_config_read+0x20/0x9c (LR) >> (XEN) [<000000000026d2c8>] pci-access.c#pci_config_read+0x60/0x84 >> (XEN) [<000000000026d4a8>] pci_conf_read32+0x10/0x18 >> (XEN) [<000000000024dcf8>] vpci.c#vpci_read_hw+0x48/0xb8 >> (XEN) [<000000000024e3e4>] vpci_read+0xac/0x24c >> (XEN) [<000000000024e934>] vpci_ecam_read+0x78/0xa8 >> (XEN) [<000000000026e368>] vpci.c#vpci_mmio_read+0x44/0x7c >> (XEN) [<0000000000275054>] try_handle_mmio+0x1ec/0x264 >> (XEN) [<000000000027ea50>] traps.c#do_trap_stage2_abort_guest+0x18c/0x2d8 >> (XEN) [<000000000027f088>] do_trap_guest_sync+0xf0/0x618 >> (XEN) [<0000000000269c58>] entry.o#guest_sync_slowpath+0xa4/0xd4 >> (XEN) >> (XEN) >> (XEN) **************************************** >> (XEN) Panic on CPU 0: >> (XEN) CPU0: Unexpected Trap: Data Abort >> (XEN) **************************************** >> >>> >>>> >>>> Fix this by adjusting the gpa with respect to the host bridge base address >>>> in a way as it is done for x86. >>>> >>>> Fixes: d59168dc05a5 ("xen/arm: Enable the existing x86 virtual PCI support >>>> for ARM") >>>> >>>> Signed-off-by: Oleksandr Andrushchenko <oleksandr_andrushchenko@xxxxxxxx> >>>> --- >>>> xen/arch/arm/vpci.c | 4 ++-- >>>> 1 file changed, 2 insertions(+), 2 deletions(-) >>>> >>>> diff --git a/xen/arch/arm/vpci.c b/xen/arch/arm/vpci.c >>>> index 8f40a0dec6d2..23f45386f4b3 100644 >>>> --- a/xen/arch/arm/vpci.c >>>> +++ b/xen/arch/arm/vpci.c >>>> @@ -24,7 +24,7 @@ static int vpci_mmio_read(struct vcpu *v, mmio_info_t >>>> *info, >>>> unsigned long data; >>>> /* We ignore segment part and always handle segment 0 */ >>>> - sbdf.sbdf = VPCI_ECAM_BDF(info->gpa); >>>> + sbdf.sbdf = VPCI_ECAM_BDF(info->gpa - GUEST_VPCI_ECAM_BASE); >>> >>> Looking at the rest of the rest, it seems that >>> 1) the issue is latent as the bits 0-27 are clear >>> 2) this will need to be modified to take into account dom0. >>> >>> So I would prefer if the base address is passed differently (maybe in >>> priv?) from the start. This will avoid extra modification that you already >>> plan to have in a follow-up series. >> I was thinking about the same, but the future code will use priv for other >> purpose: >> >> static int vpci_mmio_read(struct vcpu *v, mmio_info_t *info, >> register_t *r, void *p) >> { >> struct pci_host_bridge *bridge = p; >> pci_sbdf_t sbdf; >> /* data is needed to prevent a pointer cast on 32bit */ >> unsigned long data; >> >> if ( bridge ) >> { >> sbdf.sbdf = VPCI_ECAM_BDF(info->gpa - bridge->cfg->phys_addr); >> sbdf.seg = bridge->segment; >> } >> else >> sbdf.sbdf = VPCI_ECAM_BDF(info->gpa - GUEST_VPCI_ECAM_BASE); > > Is it the only place you are doing to use bridge? If so, then I think we can > simply have a structure that would contain phys_addr and segment. > > This would be include in the bridge for dom0 and for guest this could be a > static global variable for now. Hm. I don't think a global is any better than using info->gpa - GUEST_VPCI_ECAM_BASE. But I am fine with the structure: please let me know your preference, so I can have an acceptable fix > > Cheers, > Thank you, Oleksandr

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.