|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [PATCH v9 02/16] vpci: use per-domain PCI lock to protect vpci structure
On Wed, Sep 20, 2023 at 03:16:00PM -0400, Stewart Hildebrand wrote:
> On 9/19/23 11:39, Roger Pau Monné wrote:
> > On Tue, Aug 29, 2023 at 11:19:42PM +0000, Volodymyr Babchuk wrote:
> >> diff --git a/xen/arch/x86/hvm/vmx/vmx.c b/xen/arch/x86/hvm/vmx/vmx.c
> >> index 1edc7f1e91..545a27796e 100644
> >> --- a/xen/arch/x86/hvm/vmx/vmx.c
> >> +++ b/xen/arch/x86/hvm/vmx/vmx.c
> >> @@ -413,8 +413,6 @@ static int cf_check vmx_pi_update_irte(const struct
> >> vcpu *v,
> >>
> >> spin_unlock_irq(&desc->lock);
> >>
> >> - ASSERT(pcidevs_locked());
> >> -
> >
> > Hm, this removal seems dubious, same with some of the removal below.
> > And I don't see any comment in the log message as to why removing the
> > asserts here and in __pci_enable_msi{,x}(), pci_prepare_msix() is
> > safe.
> >
>
> I suspect we may want:
>
> ASSERT(pcidevs_locked() || rw_is_locked(&d->pci_lock));
>
> However, we don't have d. Using v->domain here is tricky because v may be
> NULL. How about passing struct domain *d as an arg to
> {hvm,vmx}_pi_update_irte()? Or ensuring that all callers pass a valid v?
I guess there was a reason to expect a path with v == NULL, but would
need to go trough the call paths that lead here.
Another option might be use use:
ASSERT(pcidevs_locked() || (v && rw_is_locked(&v->domain->pci_lock)));
But we would need some understanding of the call site of
vmx_pi_update_irte().
>
> >> return iommu_update_ire_from_msi(msi_desc, &msi_desc->msg);
> >>
> >> unlock_out:
> >> diff --git a/xen/arch/x86/msi.c b/xen/arch/x86/msi.c
> >> index d0bf63df1d..ba2963b7d2 100644
> >> --- a/xen/arch/x86/msi.c
> >> +++ b/xen/arch/x86/msi.c
> >> @@ -613,7 +613,7 @@ static int msi_capability_init(struct pci_dev *dev,
> >> u8 slot = PCI_SLOT(dev->devfn);
> >> u8 func = PCI_FUNC(dev->devfn);
> >>
> >> - ASSERT(pcidevs_locked());
> >> + ASSERT(pcidevs_locked() || rw_is_locked(&dev->domain->pci_lock));
> >> pos = pci_find_cap_offset(seg, bus, slot, func, PCI_CAP_ID_MSI);
> >> if ( !pos )
> >> return -ENODEV;
> >> @@ -783,7 +783,7 @@ static int msix_capability_init(struct pci_dev *dev,
> >> if ( !pos )
> >> return -ENODEV;
> >>
> >> - ASSERT(pcidevs_locked());
> >> + ASSERT(pcidevs_locked() || rw_is_locked(&dev->domain->pci_lock));
> >>
> >> control = pci_conf_read16(dev->sbdf, msix_control_reg(pos));
> >> /*
> >> @@ -1000,7 +1000,6 @@ static int __pci_enable_msi(struct msi_info *msi,
> >> struct msi_desc **desc)
> >> struct pci_dev *pdev;
> >> struct msi_desc *old_desc;
> >>
> >> - ASSERT(pcidevs_locked());
> >> pdev = pci_get_pdev(NULL, msi->sbdf);
> >> if ( !pdev )
> >> return -ENODEV;
>
> I think we can move the ASSERT here, after we obtain the pdev. Then we can
> add the pdev->domain->pci_lock check into the mix:
>
> ASSERT(pcidevs_locked() || rw_is_locked(&pdev->domain->pci_lock));
Hm, it would be better to perform the ASSERT before possibly accessing
the pdev list without holding any locks, but it's just an assert so
that might be the best option.
>
> >> @@ -1055,7 +1054,6 @@ static int __pci_enable_msix(struct msi_info *msi,
> >> struct msi_desc **desc)
> >> struct pci_dev *pdev;
> >> struct msi_desc *old_desc;
> >>
> >> - ASSERT(pcidevs_locked());
> >> pdev = pci_get_pdev(NULL, msi->sbdf);
> >> if ( !pdev || !pdev->msix )
> >> return -ENODEV;
>
> Same here
>
> >> @@ -1170,8 +1168,6 @@ int pci_prepare_msix(u16 seg, u8 bus, u8 devfn, bool
> >> off)
> >> */
> >> int pci_enable_msi(struct msi_info *msi, struct msi_desc **desc)
> >> {
> >> - ASSERT(pcidevs_locked());
> >> -
>
> This removal inside pci_enable_msi() may be okay if both __pci_enable_msi()
> and __pci_enable_msix() have an appropriate ASSERT.
Hm, yes, that's likely fine, but would want a small mention in the
commit message.
> >> if ( !use_msi )
> >> return -EPERM;
> >>
>
> Related: in xen/drivers/passthrough/pci.c:pci_get_pdev() I run into an ASSERT
> with a PVH dom0:
>
> (XEN) Assertion 'd || pcidevs_locked()' failed at
> drivers/passthrough/pci.c:534
> (XEN) ----[ Xen-4.18-unstable x86_64 debug=y Tainted: C ]----
> ...
> (XEN) Xen call trace:
> (XEN) [<ffff82d040285a3b>] R pci_get_pdev+0x4c/0xab
> (XEN) [<ffff82d04034742e>] F arch/x86/msi.c#__pci_enable_msi+0x1d/0xb4
> (XEN) [<ffff82d0403477b5>] F pci_enable_msi+0x20/0x28
> (XEN) [<ffff82d04034cfa4>] F map_domain_pirq+0x2b0/0x718
> (XEN) [<ffff82d04034e37c>] F allocate_and_map_msi_pirq+0xff/0x26b
> (XEN) [<ffff82d0402e088b>] F arch/x86/hvm/vmsi.c#vpci_msi_enable+0x53/0x9d
> (XEN) [<ffff82d0402e19d5>] F vpci_msi_arch_enable+0x36/0x6c
> (XEN) [<ffff82d04026f49d>] F drivers/vpci/msi.c#control_write+0x71/0x114
> (XEN) [<ffff82d04026d050>] F
> drivers/vpci/vpci.c#vpci_write_helper+0x6f/0x7c
> (XEN) [<ffff82d04026de39>] F vpci_write+0x249/0x2f9
> ...
>
> With the patch applied, it's valid to call pci_get_pdev() with only
> d->pci_lock held, so the ASSERT in pci_get_pdev() needs to be reworked too.
> Inside pci_get_pdev(), d may be null, so we can't easily add ||
> rw_is_locked(&d->pci_lock) into the ASSERT. Instead I propose something like
> the following, which resolves the observed assertion failure:
>
> diff --git a/xen/drivers/passthrough/pci.c b/xen/drivers/passthrough/pci.c
> index 572643abe412..2b4ad804510c 100644
> --- a/xen/drivers/passthrough/pci.c
> +++ b/xen/drivers/passthrough/pci.c
> @@ -531,8 +531,6 @@ struct pci_dev *pci_get_pdev(const struct domain *d,
> pci_sbdf_t sbdf)
> {
> struct pci_dev *pdev;
>
> - ASSERT(d || pcidevs_locked());
> -
> /*
> * The hardware domain owns the majority of the devices in the system.
> * When there are multiple segments, traversing the per-segment list is
> @@ -549,12 +547,18 @@ struct pci_dev *pci_get_pdev(const struct domain *d,
> pci_sbdf_t sbdf)
> list_for_each_entry ( pdev, &pseg->alldevs_list, alldevs_list )
> if ( pdev->sbdf.bdf == sbdf.bdf &&
> (!d || pdev->domain == d) )
> + {
> + ASSERT(d || pcidevs_locked() ||
> rw_is_locked(&pdev->domain->pci_lock));
Hm, strictly speaking iterating over the pseg list while just holding
the d->pci_lock is not safe, we should instead iterate over d->pdev_list.
We might have to slightly modify pci_enable_msi() to take a pdev so
that the search can be done by the caller (holding the right lock).
Thanks, Roger.
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |