[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-devel] [BUG] Xen BUG at irq.c:1705 after [VT-D]d1:PCIe: unmap


  • To: Xen Devel <xen-devel@xxxxxxxxxxxxx>
  • From: Håkon Alstadheim <hakon@xxxxxxxxxxxxxxxxxx>
  • Date: Sat, 5 Mar 2016 13:22:44 +0100
  • Delivery-date: Sat, 05 Mar 2016 12:23:18 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xen.org>

This has happened in 2 out of four recent shutdowns of a specific domU
"garbo.hvm", domid=1.
------- domU specs (no tmem in cmdline arg to domU): -----
Linux garbo 3.16.0-4-amd64 #1 SMP Debian 3.16.7-ckt20-1+deb8u4
(2016-02-29) x86_64
garbo:~# cat /proc/cmdline
BOOT_IMAGE=/boot/vmlinuz-3.16.0-4-amd64
root=UUID=042054cf-2a70-46de-bd90-c712fbedbf81 ro intel_iommu=on 3
console=ttyS0 console=vga
--------
serial log:
(XEN) [2016-03-04 23:38:48] tmem: allocating persistent-private tmem
pool for domid=3...<G><2>pool_id=0
(XEN) [2016-03-04 23:39:12] grant_table.c:1491:d2v1 Expanding dom (2)
grant table from (5) to (6) frames.
(XEN) [2016-03-04 23:40:28] [VT-D]d1:PCIe: unmap 0000:08:00.0
(XEN) [2016-03-04 23:40:28] [VT-D]d0:PCIe: map 0000:08:00.0
(XEN) [2016-03-04 23:40:30] tmem: flushing tmem pools for domid=1
(XEN) [2016-03-04 23:40:30] Destroying persistent-private tmem pool
domid=1 pool_id=0
(XEN) [2016-03-04 23:40:30] Xen BUG at irq.c:1705
(XEN) [2016-03-04 23:40:30] ----[ Xen-4.6.1  x86_64  debug=y 
Tainted:    C ]----
(XEN) [2016-03-04 23:40:30] CPU:    0
(XEN) [2016-03-04 23:40:30] RIP:    e008:[<ffff82d08016e89e>]
__pirq_guest_unbind+0x83/0x273
(XEN) [2016-03-04 23:40:30] RFLAGS: 0000000000010046   CONTEXT:
hypervisor (d0v0)
(XEN) [2016-03-04 23:40:30] rax: 0000000000000001   rbx:
ffff8304caaee350   rcx: 0000000000000001
(XEN) [2016-03-04 23:40:30] rdx: ffff8304caaee398   rsi:
ffff8304c0791e80   rdi: ffff8304caa6f000
(XEN) [2016-03-04 23:40:30] rbp: ffff83006e907c08   rsp:
ffff83006e907bb8   r8:  ffff83006e907c48
(XEN) [2016-03-04 23:40:30] r9:  0000000000000040   r10:
ffff8304c0786010   r11: ffffffffffffffc0
(XEN) [2016-03-04 23:40:30] r12: ffff830839c01300   r13:
ffff8304c0791e80   r14: ffff83006e907ca8
(XEN) [2016-03-04 23:40:30] r15: 0000000000000000   cr0:
0000000080050033   cr4: 00000000001526e0
(XEN) [2016-03-04 23:40:30] cr3: 00000004d3ac5000   cr2: 00007f1d1d38ca00
(XEN) [2016-03-04 23:40:30] ds: 0000   es: 0000   fs: 0000   gs: 0000  
ss: e010   cs: e008
(XEN) [2016-03-04 23:40:30] Xen stack trace from rsp=ffff83006e907bb8:
(XEN) [2016-03-04 23:40:30]    ffff82d08012cf3e ffff830839c01300
ffff83006e907c08 ffff82d080170501
(XEN) [2016-03-04 23:40:30]    0000000000000206 ffff830839c01300
ffff8304c0791e80 ffff8304caa6f000
(XEN) [2016-03-04 23:40:30]    ffff83006e907ca8 0000000000000000
ffff83006e907c38 ffff82d080170f0a
(XEN) [2016-03-04 23:40:30]    0000000000000003 ffff8304c0791ea8
0000000000000008 ffff83006e907ca8
(XEN) [2016-03-04 23:40:30]    ffff83006e907c78 ffff82d080149406
0000000000000018 0000000000000003
(XEN) [2016-03-04 23:40:30]    0000000000000013 0000000000000008
ffff83006e907ca8 0000000000000000
(XEN) [2016-03-04 23:40:30]    ffff83006e907d18 ffff82d0801489e0
ffff8304caa6f100 0000000000000000
(XEN) [2016-03-04 23:40:30]    ffff82d0801493e9 ffff8304caa6f000
ffff8304c0786f00 ffff83091583c080
(XEN) [2016-03-04 23:40:30]    ffff83091583c180 ffff8304c0791e80
ffff83091583c280 ffff83091583c380
(XEN) [2016-03-04 23:40:30]    ffff83091583c500 ffff83091583c600
0000000000000000 ffff8304caa6f000
(XEN) [2016-03-04 23:40:30]    ffff8304caa6f0dc ffff8304caa6f000
ffff8304c0786240 00007f1d1e399004
(XEN) [2016-03-04 23:40:30]    ffff83006e907d58 ffff82d08014aa43
ffff83083ff44108 ffff8304caa6f000
(XEN) [2016-03-04 23:40:30]    ffff8304caa6f000 ffff880071f4cb00
0000000000305000 00007f1d1e399004
(XEN) [2016-03-04 23:40:30]    ffff83006e907d78 ffff82d0801652e7
ffff8304caa6f000 ffff8304caa6f000
(XEN) [2016-03-04 23:40:30]    ffff83006e907da8 ffff82d080106a6e
ffff82d08012cfcb 0000000000000000
(XEN) [2016-03-04 23:40:30]    ffff8304caa6f000 ffff880071f4cb00
ffff83006e907ef8 ffff82d080103bdb
(XEN) [2016-03-04 23:40:30]    00000000000000e0 ffff830839c01300
ffff830839b0f000 ffff8304c074f400
(XEN) [2016-03-04 23:40:30]    ffff83006e907df8 ffff82d08016a1c1
ffff83006e907df8 ffff82d08012cf72
(XEN) [2016-03-04 23:40:30]    ffff83006e907e38 ffff82d080170811
ffff83006e6fb000 0000000000000100
(XEN) [2016-03-04 23:40:30]    ffff83006e907e78 0000000000000046
ffff830839b0f100 00000000001526e0
(XEN) [2016-03-04 23:40:30] Xen call trace:
(XEN) [2016-03-04 23:40:30]    [<ffff82d08016e89e>]
__pirq_guest_unbind+0x83/0x273
(XEN) [2016-03-04 23:40:30]    [<ffff82d080170f0a>]
pirq_guest_unbind+0x9e/0xd4
(XEN) [2016-03-04 23:40:30]    [<ffff82d080149406>]
pci_clean_dpci_irq+0x1d/0xc8
(XEN) [2016-03-04 23:40:30]    [<ffff82d0801489e0>]
pt_pirq_iterate+0xa6/0xe8
(XEN) [2016-03-04 23:40:30]    [<ffff82d08014aa43>]
pci_release_devices+0x6e/0x13f
(XEN) [2016-03-04 23:40:30]    [<ffff82d0801652e7>]
domain_relinquish_resources+0x4d/0x2c2
(XEN) [2016-03-04 23:40:30]    [<ffff82d080106a6e>] domain_kill+0x9e/0x13a
(XEN) [2016-03-04 23:40:30]    [<ffff82d080103bdb>] do_domctl+0xa42/0x1d3e
(XEN) [2016-03-04 23:40:30]    [<ffff82d08023bbf2>] lstar_enter+0xe2/0x13c
(XEN) [2016-03-04 23:40:30]
(XEN) [2016-03-04 23:40:30]
(XEN) [2016-03-04 23:40:30] ****************************************
(XEN) [2016-03-04 23:40:30] Panic on CPU 0:
(XEN) [2016-03-04 23:40:30] Xen BUG at irq.c:1705
(XEN) [2016-03-04 23:40:30] ****************************************
(XEN) [2016-03-04 23:40:30]
(XEN) [2016-03-04 23:40:30] Reboot in five seconds...
-----------------------------------------------------------------------------------------------------------------
I do have qemu-dm logs for the domain, but I'm not certain which log
goes with which run. Attaching what I THINK are correct. My timezone is
GMT+1, xen seems to be at GMT, so qemu-dm with file-system timestamp Mar
5 00:40 ought to be right.
--------------------------- Pci device info:
# xl pci-list garbo.hvm
Vdev Device
05.0 0000:08:00.0
# lspci -vv -s 0000:08:00.0
08:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network
Connection (rev 03)
    Subsystem: ASUSTeK Computer Inc. I210 Gigabit Network Connection
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 64 bytes
    Interrupt: pin A routed to IRQ 19
    Region 0: Memory at c6200000 (32-bit, non-prefetchable) [size=512K]
    Region 2: I/O ports at 3000 [size=32]
    Region 3: Memory at c6280000 (32-bit, non-prefetchable) [size=16K]
    Capabilities: [40] Power Management version 3
        Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA
PME(D0+,D1-,D2-,D3hot+,D3cold+)
        Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME-
    Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
        Address: 0000000000000000  Data: 0000
        Masking: 00000000  Pending: 00000000
    Capabilities: [70] MSI-X: Enable+ Count=5 Masked-
        Vector table: BAR=3 offset=00000000
        PBA: BAR=3 offset=00002000
    Capabilities: [a0] Express (v2) Endpoint, MSI 00
        DevCap:    MaxPayload 512 bytes, PhantFunc 0, Latency L0s
<512ns, L1 <64us
            ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+
        DevCtl:    Report errors: Correctable- Non-Fatal- Fatal-
Unsupported-
            RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop+ FLReset-
            MaxPayload 128 bytes, MaxReadReq 512 bytes
        DevSta:    CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr+
TransPend-
        LnkCap:    Port #0, Speed 2.5GT/s, Width x1, ASPM L0s L1, Exit
Latency L0s <2us, L1 <16us
            ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
        LnkCtl:    ASPM L1 Enabled; RCB 64 bytes Disabled- CommClk+
            ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
        LnkSta:    Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+
DLActive- BWMgmt- ABWMgmt-
        DevCap2: Completion Timeout: Range ABCD, TimeoutDis+, LTR-, OBFF
Not Supported
        DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-,
OBFF Disabled
        LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- SpeedDis-
             Transmit Margin: Normal Operating Range,
EnterModifiedCompliance- ComplianceSOS-
             Compliance De-emphasis: -6dB
        LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-,
EqualizationPhase1-
             EqualizationPhase2-, EqualizationPhase3-,
LinkEqualizationRequest-
    Capabilities: [100 v2] Advanced Error Reporting
        UESta:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
        UEMsk:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
        UESvrt:    DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt-
RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
        CESta:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
        CEMsk:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
        AERCap:    First Error Pointer: 14, GenCap+ CGenEn- ChkCap+ ChkEn-
    Capabilities: [140 v1] Device Serial Number 38-2c-4a-ff-ff-c8-82-95
    Capabilities: [1a0 v1] Transaction Processing Hints
        Device specific mode supported
        Steering table in TPH capability structure
    Kernel driver in use: pciback
    Kernel modules: igb
--------------dom0 info: --------
# xl info
host                   : gentoo
release                : 4.1.18-gentoo
version                : #1 SMP Thu Mar 3 00:15:01 CET 2016
machine                : x86_64
nr_cpus                : 24
max_cpu_id             : 23
nr_nodes               : 2
cores_per_socket       : 6
threads_per_core       : 2
cpu_mhz                : 2394
hw_caps                :
bfebfbff:2c100800:00000000:00007f00:77fefbff:00000000:00000021:000037ab
virt_caps              : hvm hvm_directio
total_memory           : 65376
free_memory            : 32820
sharing_freed_memory   : 0
sharing_used_memory    : 0
outstanding_claims     : 0
free_cpus              : 0
xen_major              : 4
xen_minor              : 6
xen_extra              : .1
xen_version            : 4.6.1
xen_caps               : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32
hvm-3.0-x86_32p hvm-3.0-x86_64
xen_scheduler          : credit
xen_pagesize           : 4096
platform_params        : virt_start=0xffff800000000000
xen_changeset          :
xen_commandline        : ssd-xen-dbg-noidle-lx-4.1.18-marker
console_timestamps=date loglvl=all guest_loglvl=all sync_console
iommu=1,verbose,debug iommu_inclusive_mapping=1 com1=115200,8n1
console=com1 dom0_max_vcpus=4 dom0_vcpus_pin=1 dom0_mem=8G,max:8G
cpufreq=xen,performance,verbose tmem=1 sched_smt_power_savings=1
apic_verbosity=debug e820-verbose=1 core_parking=power cpuidle=0
cc_compiler            : x86_64-pc-linux-gnu-gcc (Gentoo 4.9.3 p1.5,
pie-0.6.4) 4.9.3
cc_compile_by          :
cc_compile_domain      : alstadheim.priv.no
cc_compile_date        : Fri Feb 19 18:44:38 CET 2016
xend_config_format     : 4

Attachment: qemu-dm-garbo.hvm.log.1
Description: Text document

Attachment: xl-garbo.hvm.log.1
Description: Binary data

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.