[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Xen-devel] [PATCH v4 1/3] xen/pt: fix some pass-thru devices don't work across reboot
I find some pass-thru devices don't work any more across guest reboot. Assigning it to another domain also meets the same issue. And the only way to make it work again is un-binding and binding it to pciback. Someone reported this issue one year ago [1]. If the device's driver doesn't disable MSI-X during shutdown or qemu is killed/crashed before the domain shutdown, this domain's pirq won't be unmapped. Then xen takes over this work, unmapping all pirq-s, when destroying guest. But as pciback has already disabled meory decoding before xen unmapping pirq, Xen has to sets the host_maskall flag and maskall bit to mask a MSI rather than sets maskbit in MSI-x table. The call trace of this process is: ->arch_domain_destroy ->free_domain_pirqs ->unmap_domain_pirq (if pirq isn't unmapped by qemu) ->pirq_guest_force_unbind ->__pirq_guest_unbind ->mask_msi_irq(=desc->handler->disable()) ->the warning in msi_set_mask_bit() The host_maskall bit will prevent guests from clearing the maskall bit even the device is assigned to another guest later. Then guests cannot receive MSIs from this device. To fix this issue, a pirq is unmapped before memory decoding is disabled by pciback. Specifically, when a device is detached from a guest, all established mappings between pirq and msi are destroying before changing the ownership. [1]: https://lists.xenproject.org/archives/html/xen-devel/2017-09/msg02520.html Signed-off-by: Chao Gao <chao.gao@xxxxxxxxx> --- Changes in v4: - split out change to 'msix->warned' field - handle multiple msi cases - use list_first_entry_or_null to traverse 'pdev->msi_list' --- xen/drivers/passthrough/io.c | 57 +++++++++++++++++++++++++++++-------------- xen/drivers/passthrough/pci.c | 51 ++++++++++++++++++++++++++++++++++++++ xen/include/xen/iommu.h | 1 + 3 files changed, 91 insertions(+), 18 deletions(-) diff --git a/xen/drivers/passthrough/io.c b/xen/drivers/passthrough/io.c index a6eb8a4..56ee1ef 100644 --- a/xen/drivers/passthrough/io.c +++ b/xen/drivers/passthrough/io.c @@ -619,6 +619,42 @@ int pt_irq_create_bind( return 0; } +static void pt_irq_destroy_bind_common(struct domain *d, struct pirq *pirq) +{ + struct hvm_pirq_dpci *pirq_dpci = pirq_dpci(pirq); + + ASSERT(spin_is_locked(&d->event_lock)); + + if ( pirq_dpci && (pirq_dpci->flags & HVM_IRQ_DPCI_MAPPED) && + list_empty(&pirq_dpci->digl_list) ) + { + pirq_guest_unbind(d, pirq); + msixtbl_pt_unregister(d, pirq); + if ( pt_irq_need_timer(pirq_dpci->flags) ) + kill_timer(&pirq_dpci->timer); + pirq_dpci->flags = 0; + /* + * See comment in pt_irq_create_bind's PT_IRQ_TYPE_MSI before the + * call to pt_pirq_softirq_reset. + */ + pt_pirq_softirq_reset(pirq_dpci); + + pirq_cleanup_check(pirq, d); + } +} + +void pt_irq_destroy_bind_msi(struct domain *d, struct pirq *pirq) +{ + struct hvm_pirq_dpci *pirq_dpci = pirq_dpci(pirq); + + ASSERT(spin_is_locked(&d->event_lock)); + + if ( pirq_dpci && pirq_dpci->gmsi.posted ) + pi_update_irte(NULL, pirq, 0); + + pt_irq_destroy_bind_common(d, pirq); +} + int pt_irq_destroy_bind( struct domain *d, const struct xen_domctl_bind_pt_irq *pt_irq_bind) { @@ -727,26 +763,11 @@ int pt_irq_destroy_bind( } else what = "bogus"; - } - else if ( pirq_dpci && pirq_dpci->gmsi.posted ) - pi_update_irte(NULL, pirq, 0); - - if ( pirq_dpci && (pirq_dpci->flags & HVM_IRQ_DPCI_MAPPED) && - list_empty(&pirq_dpci->digl_list) ) - { - pirq_guest_unbind(d, pirq); - msixtbl_pt_unregister(d, pirq); - if ( pt_irq_need_timer(pirq_dpci->flags) ) - kill_timer(&pirq_dpci->timer); - pirq_dpci->flags = 0; - /* - * See comment in pt_irq_create_bind's PT_IRQ_TYPE_MSI before the - * call to pt_pirq_softirq_reset. - */ - pt_pirq_softirq_reset(pirq_dpci); - pirq_cleanup_check(pirq, d); + pt_irq_destroy_bind_common(d, pirq); } + else + pt_irq_destroy_bind_msi(d, pirq); spin_unlock(&d->event_lock); diff --git a/xen/drivers/passthrough/pci.c b/xen/drivers/passthrough/pci.c index 1277ce2..ccae0ec 100644 --- a/xen/drivers/passthrough/pci.c +++ b/xen/drivers/passthrough/pci.c @@ -1514,6 +1514,55 @@ static int assign_device(struct domain *d, u16 seg, u8 bus, u8 devfn, u32 flag) return rc; } +/* + * Unmap established mappings between domain's pirq and device's MSI. + * These mappings were set up by qemu/guest and are expected to be + * destroyed when changing the device's ownership. + */ +static void pci_unmap_msi(struct pci_dev *pdev) +{ + struct msi_desc *entry; + struct domain *d = pdev->domain; + + ASSERT(pcidevs_locked()); + + if ( !d ) + return; + + spin_lock(&d->event_lock); + while ( (entry = list_first_entry_or_null(&pdev->msi_list, + struct msi_desc, list)) != NULL ) + { + struct pirq *info; + int pirq = 0; + unsigned int nr = entry->msi_attrib.type != PCI_CAP_ID_MSIX + ? entry->msi.nvec : 1; + + while ( nr -- ) + { + struct hvm_pirq_dpci *pirq_dpci; + + pirq = domain_irq_to_pirq(d, entry[nr].irq); + WARN_ON(pirq < 0); + if ( pirq <= 0 ) + continue; + + info = pirq_info(d, pirq); + if ( !info ) + continue; + + pirq_dpci = pirq_dpci(info); + if ( pirq_dpci && + (pirq_dpci->flags & HVM_IRQ_DPCI_MACH_MSI) && + (pirq_dpci->flags & HVM_IRQ_DPCI_GUEST_MSI) ) + pt_irq_destroy_bind_msi(d, info); + } + if ( pirq > 0 ) + unmap_domain_pirq(d, pirq); + } + spin_unlock(&d->event_lock); +} + /* caller should hold the pcidevs_lock */ int deassign_device(struct domain *d, u16 seg, u8 bus, u8 devfn) { @@ -1529,6 +1578,8 @@ int deassign_device(struct domain *d, u16 seg, u8 bus, u8 devfn) if ( !pdev ) return -ENODEV; + pci_unmap_msi(pdev); + while ( pdev->phantom_stride ) { devfn += pdev->phantom_stride; diff --git a/xen/include/xen/iommu.h b/xen/include/xen/iommu.h index 3d78126..8aecf43 100644 --- a/xen/include/xen/iommu.h +++ b/xen/include/xen/iommu.h @@ -108,6 +108,7 @@ struct pirq; int hvm_do_IRQ_dpci(struct domain *, struct pirq *); int pt_irq_create_bind(struct domain *, const struct xen_domctl_bind_pt_irq *); int pt_irq_destroy_bind(struct domain *, const struct xen_domctl_bind_pt_irq *); +void pt_irq_destroy_bind_msi(struct domain *d, struct pirq *pirq); void hvm_dpci_isairq_eoi(struct domain *d, unsigned int isairq); struct hvm_irq_dpci *domain_get_irq_dpci(const struct domain *); -- 1.8.3.1 _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxxx https://lists.xenproject.org/mailman/listinfo/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |