[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH 00/60] xen: add core scheduling support


  • To: Juergen Gross <jgross@xxxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>
  • From: Sergey Dyasli <sergey.dyasli@xxxxxxxxxx>
  • Date: Mon, 22 Jul 2019 15:22:20 +0100
  • Authentication-results: esa2.hc3370-68.iphmx.com; dkim=none (message not signed) header.i=none; spf=None smtp.pra=sergey.dyasli@xxxxxxxxxx; spf=Pass smtp.mailfrom=sergey.dyasli@xxxxxxxxxx; spf=None smtp.helo=postmaster@xxxxxxxxxxxxxxx
  • Autocrypt: addr=sergey.dyasli@xxxxxxxxxx; keydata= mQINBFtMVHEBEADc/hZcLexrB6vGTdGqEUsYZkFGQh6Z1OO7bCtM1go1RugSMeq9tkFHQSOc 9c7W9NVQqLgn8eefikIHxgic6tGgKoIQKcPuSsnqGao2YabsTSSoeatvmO5HkR0xGaUd+M6j iqv3cD7/WL602NhphT4ucKXCz93w0TeoJ3gleLuILxmzg1gDhKtMdkZv6TngWpKgIMRfoyHQ jsVzPbTTjJl/a9Cw99vuhFuEJfzbLA80hCwhoPM+ZQGFDcG4c25GQGQFFatpbQUhNirWW5b1 r2yVOziSJsvfTLnyzEizCvU+r/Ek2Kh0eAsRFr35m2X+X3CfxKrZcePxzAf273p4nc3YIK9h cwa4ZpDksun0E2l0pIxg/pPBXTNbH+OX1I+BfWDZWlPiPxgkiKdgYPS2qv53dJ+k9x6HkuCy i61IcjXRtVgL5nPGakyOFQ+07S4HIJlw98a6NrptWOFkxDt38x87mSM7aSWp1kjyGqQTGoKB VEx5BdRS5gFdYGCQFc8KVGEWPPGdeYx9Pj2wTaweKV0qZT69lmf/P5149Pc81SRhuc0hUX9K DnYBa1iSHaDjifMsNXKzj8Y8zVm+J6DZo/D10IUxMuExvbPa/8nsertWxoDSbWcF1cyvZp9X tUEukuPoTKO4Vzg7xVNj9pbK9GPxSYcafJUgDeKEIlkn3iVIPwARAQABtChTZXJnZXkgRHlh c2xpIDxzZXJnZXkuZHlhc2xpQGNpdHJpeC5jb20+iQJOBBMBCgA4FiEEkI7HMI5EbM2FLA1L Aa+w5JvbyusFAltMVHECGwMFCwkIBwIGFQoJCAsCBBYCAwECHgECF4AACgkQAa+w5JvbyuuQ JBAAry/oRK6m0I+ck1Tarz9a1RrF73r1YoJUk5Bw+PSxsBJOPp3vDeAz3Kqw58qmBXeNlMU4 1cqAxFxCCKMtER1gpmrKWBA1/H1ZoBRtzhaHgPTQLyR7LB1OgdpgwEOjN1Q5gME8Pk21y/3N cG5YBgD/ZHbq8nWS/G3r001Ie3nX55uacGk/Ry175cS48+asrerShKMDNMT1cwimo9zH/3Lm RTpWloh2dG4jjwtCXqB7s+FEE5wQVCpPp9p55+9pPd+3DXmsQEcJ/28XHo/UJW663WjRlRc4 wgPwiC9Co1HqaMKSzdPpZmI5D4HizWH8jF7ppUjWoPapwk4dEA7Al0vx1Bz3gbJAL8DaRgQp H4j/16ifletfGUNbHJR2vWljZ5SEf2vMVcdubf9eFUfBF/9OOR1Kcj1PISP8sPhcP7oCfFtH RcxXh1OStrRFtltJt2VlloKXAUggdewwyyD4xl9UHCfI4lSexOK37wNSQYPQcVcOS1bl4NhQ em6pw2AC32NsnQE5PmczFADDIpWhO/+WtkTFeE2HHfAn++y3YDtKQd7xes9UJjQNiGziArST l6Zrx4/nShVLeYRVW76l27gI5a8BZLWwBVRsWniGM50OOJULvSag7kh+cjsrXXpNuA4rfEoB Bxr7pso9e5YghupDc8XftsYd7mlAgOTCAC8uZme5Ag0EW0xUcQEQAMKi97v3DwwPgYVPYIbQ JAvoMgubJllC9RcE0PQsE6nEKSrfOT6Gh5/LHOXLbQI9nzU/xdr6kMfwbYVTnZIY/SwsLrJa gSKm64t11MjC1Vf03/sncx1tgI7nwqMMIAYLsXnQ9X/Up5L/gLO2YDIPxrQ6g4glgRYPT53i r6/hTz3dlpqyPCorpuF+WY7P2ujhlFlXCAaD6btPPM/9LZSmI0xS4aCBLH+pZeCr0UGSMhsX JYN0QRLjfsIDGyqaXVH9gwV2Hgsq6z8fNPQlBc3IpDvfXa1rYtgldYBfG521L3wnsMcKoFSr R5dpH7Jtvv5YBuAk8r571qlMhyAmVKiEnc+RonWl503D5bAHqNmFNjV248J5scyRD/+BcYLI 2CFG28XZrCvjxq3ux5hpmg2fCu+y98h6/yuwB/JhbFlDOSoluEpysiEL3R5GTKbxOF664q5W fiSObxNONxs86UtghqNDRUJgyS0W6TfykGOnZDVYAC9Gg8SbQDta1ymA0q76S/NG2MrJEOIr 1GtOr/UjNv2x4vW56dzX/3yuhK1ilpgzh1q504ETC6EKXMaFT8cNgsMlk9dOvWPwlsIJ249+ PizMDFGITxGTIrQAaUBO+HRLSBYdHNrHJtytkBoTjykCt7M6pl7l+jFYjGSw4fwexVy0MqsD AZ2coH82RTPb6Q7JABEBAAGJAjYEGAEKACAWIQSQjscwjkRszYUsDUsBr7Dkm9vK6wUCW0xU cQIbDAAKCRABr7Dkm9vK6+9uD/9Ld3X5cvnrwrkFMddpjFKoJ4yphtX2s+EQfKT6vMq3A1dJ tI7zHTFm60uBhX6eRbQow8fkHPcjXGJEoCSJf8ktwx/HYcBcnUK/aulHpvHIIYEma7BHry4x L+Ap7oBbBNiraS3Wu1k+MaX07BWhYYkpu7akUEtaYsCceVc4vpYNITUzPYCHeMwc5pLICA+7 VdI1rrTSAwlCtLGBt7ttbvaAKN4dysiN+/66Hlxnn8n952lZdG4ThPPzafG50EgcTa+dASgm tc6HaQAmJiwb4iWUOoUoM+udLRHcN6cE0bQivyH1bqF4ROeFBRz00MUJKvzUynR9E50F9hmd DOBJkyM3Z5imQ0RayEkRHhlhj7uECaojnUeewq4zjpAg2HTSMkdEzKRbdMEyXCdQXFnSCmUB 5yMIULuDbOODWo3EufExLjAKzIRWEKQ/JidLzO6hrhlQffsJ7MPTU+Hg7WxqWfn4zhuUcIQB SlkiRMalSiJITC2jG7oQRRh9tyNaDMkKzTbeFtHKRmUUAuhE0LBXP8Wc+5W7b3WOf2SO8JMR 4TqDZ0K06s66S5fOTW0h56iCCxTsAnRvM/tA4SERyRoFs/iTqJzboskZY0yKeWV4/IQxfOyC YwdU3//zANM1ZpqeE/8lnW/kx+fyzVyEioLSwkjDvdG++4GQ5r6PHQ7BbdEWhA==
  • Cc: Tim Deegan <tim@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>, George Dunlap <george.dunlap@xxxxxxxxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Ian Jackson <ian.jackson@xxxxxxxxxxxxx>, RobertVanVossen <robert.vanvossen@xxxxxxxxxxxxxxx>, Dario Faggioli <dfaggioli@xxxxxxxx>, Julien Grall <julien.grall@xxxxxxx>, Josh Whitehead <josh.whitehead@xxxxxxxxxxxxxxx>, Meng Xu <mengxu@xxxxxxxxxxxxx>, Jan Beulich <jbeulich@xxxxxxxx>, "sergey.dyasli@xxxxxxxxxx >> Sergey Dyasli" <sergey.dyasli@xxxxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>
  • Delivery-date: Mon, 22 Jul 2019 14:22:42 +0000
  • Ironport-sdr: 7Nl7OwGg0o5/fJbztAznx/vtXtKav7MuBtpH+PJSr3eSkqmYSWq8WhOXGp8b7lKQUwwC6qh3XO fHt1mkTF/aI4RvtrAObUcicbJYjyWrAChXPYHfnWAYZOhS27mzR47NB0AyilAtTkrAkMuFfNIz ymVC9U5TbLjNCUIq99NT7ViX0hCj8G1KIvaw9R8HvemNK3MojaTlLOUCaHqfgD0zYk3DQS9YcQ GiB/Gk8J4Spj0Doew4UTFSuNwXvKhUTeylAqL/OnUTKyeQCWyUOcT/ho9lU9Q2HNZ7ueZhrTQA a8k=
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>
  • Openpgp: preference=signencrypt

On 19/07/2019 14:57, Juergen Gross wrote:

> I have now a git branch with the two problems corrected and rebased to
> current staging available:
> 
> github.com/jgross1/xen.git sched-v1b

Many thanks for the branch! As for the crashes, vcpu_sleep_sync() one
seems to be fixed now. But I can still reproduce the shutdown one.
Interestingly, it now happens only if a host has running VMs (which
are automatically powered off via PV tools):

(XEN) [  332.981355] Preparing system for ACPI S5 state.
(XEN) [  332.981419] Disabling non-boot CPUs ...
(XEN) [  337.703896] Watchdog timer detects that CPU1 is stuck!
(XEN) [  337.709532] ----[ Xen-4.13.0-8.0.6-d  x86_64  debug=y   Not tainted 
]----
(XEN) [  337.716808] CPU:    1
(XEN) [  337.719582] RIP:    e008:[<ffff82d08024041c>] 
sched_context_switched+0xaf/0x101
(XEN) [  337.727384] RFLAGS: 0000000000000202   CONTEXT: hypervisor
(XEN) [  337.733364] rax: 0000000000000002   rbx: ffff83081cc615b0   rcx: 
0000000000000001
(XEN) [  337.741338] rdx: ffff83081cc61634   rsi: ffff83081cc72000   rdi: 
ffff83081cc72000
(XEN) [  337.749312] rbp: ffff83081cc8fdc0   rsp: ffff83081cc8fda0   r8:  
0000000000000000
(XEN) [  337.757284] r9:  0000000000000000   r10: 0000004d88fc535e   r11: 
0000004df8675ce7
(XEN) [  337.765256] r12: ffff83081cc72000   r13: ffff83081cc72000   r14: 
ffff83081ccb0e80
(XEN) [  337.773232] r15: ffff83081cc615b0   cr0: 000000008005003b   cr4: 
00000000001526e0
(XEN) [  337.781206] cr3: 00000000dd2a1000   cr2: ffff88809ed1fb80
(XEN) [  337.787100] fsb: 0000000000000000   gsb: ffff8880a38c0000   gss: 
0000000000000000
(XEN) [  337.795072] ds: 002b   es: 002b   fs: 0000   gs: 0000   ss: e010   cs: 
e008
(XEN) [  337.802525] Xen code around <ffff82d08024041c> 
(sched_context_switched+0xaf/0x101):
(XEN) [  337.810672]  00 00 eb 18 f3 90 8b 02 <85> c0 75 f8 eb 0e 49 8b 7e 30 
48 85 ff 74 05 e8
(XEN) [  337.819080] Xen stack trace from rsp=ffff83081cc8fda0:
(XEN) [  337.824713]    ffff83081cc72000 ffff83081cc72000 0000000000000000 
ffff83081cc615b0
(XEN) [  337.832772]    ffff83081cc8fe00 ffff82d0802404e0 0000000000000082 
ffff83081ccb0e98
(XEN) [  337.840832]    0000000000000001 ffff83081ccb0e98 0000000000000001 
ffff82d080602628
(XEN) [  337.848895]    ffff83081cc8fe60 ffff82d080240aca 0000004d873bd669 
0000000000000001
(XEN) [  337.856952]    ffff83081cc72000 0000004d873bdc1c ffff8308000000ff 
ffff82d0805bba00
(XEN) [  337.865012]    ffff82d0805bb980 ffffffffffffffff ffff83081cc8ffff 
0000000000000001
(XEN) [  337.873072]    ffff83081cc8fe90 ffff82d080242315 0000000000000080 
ffff82d0805bb980
(XEN) [  337.881132]    0000000000000001 ffff82d0806026f0 ffff83081cc8fea0 
ffff82d08024236a
(XEN) [  337.889196]    ffff83081cc8fef0 ffff82d08027a151 ffff82d080242315 
000000010665f000
(XEN) [  337.897256]    ffff83081cc72000 ffff83081cc72000 ffff83080665f000 
ffff83081cc63000
(XEN) [  337.905313]    0000000000000001 ffff830806684000 ffff83081cc8fd78 
ffff88809ee08000
(XEN) [  337.913373]    ffff88809ee08000 0000000000000000 0000000000000000 
0000000000000003
(XEN) [  337.921434]    ffff88809ee08000 0000000000000246 aaaaaaaaaaaaaaaa 
0000000000000000
(XEN) [  337.929497]    0000000096968abe 0000000000000000 ffffffff810013aa 
ffffffff8203c190
(XEN) [  337.937554]    deadbeefdeadf00d deadbeefdeadf00d 0000010000000000 
ffffffff810013aa
(XEN) [  337.945615]    000000000000e033 0000000000000246 ffffc900400afeb0 
000000000000e02b
(XEN) [  337.953674]    000000000000beef 000000000000beef 000000000000beef 
000000000000beef
(XEN) [  337.961736]    0000e01000000001 ffff83081cc72000 000000379c66db80 
00000000001526e0
(XEN) [  337.969797]    0000000000000000 0000000000000000 0000060000000000 
0000000000000000
(XEN) [  337.977856] Xen call trace:
(XEN) [  337.981152]    [<ffff82d08024041c>] sched_context_switched+0xaf/0x101
(XEN) [  337.988083]    [<ffff82d0802404e0>] 
schedule.c#sched_context_switch+0x72/0x151
(XEN) [  337.995796]    [<ffff82d080240aca>] schedule.c#sched_slave+0x2a3/0x2b2
(XEN) [  338.002817]    [<ffff82d080242315>] softirq.c#__do_softirq+0x85/0x90
(XEN) [  338.009664]    [<ffff82d08024236a>] do_softirq+0x13/0x15
(XEN) [  338.015471]    [<ffff82d08027a151>] domain.c#idle_loop+0xb2/0xc9
(XEN) [  338.021970]
(XEN) [  338.023965] CPU7 @ e008:ffff82d080242f94 
(stop_machine.c#stopmachine_action+0x30/0xa0)
(XEN) [  338.032372] CPU5 @ e008:ffff82d080242f94 
(stop_machine.c#stopmachine_action+0x30/0xa0)
(XEN) [  338.040776] CPU4 @ e008:ffff82d080242f94 
(stop_machine.c#stopmachine_action+0x30/0xa0)
(XEN) [  338.049182] CPU2 @ e008:ffff82d080242f9a 
(stop_machine.c#stopmachine_action+0x36/0xa0)
(XEN) [  338.057591] CPU6 @ e008:ffff82d080242f9a 
(stop_machine.c#stopmachine_action+0x36/0xa0)
(XEN) [  338.065999] CPU3 @ e008:ffff82d080242f9a 
(stop_machine.c#stopmachine_action+0x36/0xa0)
(XEN) [  338.074406] CPU0 @ e008:ffff82d0802532d1 
(ns16550.c#ns_read_reg+0x21/0x42)
(XEN) [  338.081773]
(XEN) [  338.083764] ****************************************
(XEN) [  338.089226] Panic on CPU 1:
(XEN) [  338.092521] FATAL TRAP: vector = 2 (nmi)
(XEN) [  338.096940] [error_code=0000]
(XEN) [  338.100491] ****************************************
(XEN) [  338.105951]
(XEN) [  338.107946] Reboot in five seconds...
(XEN) [  338.112105] Executing kexec image on cpu1
(XEN) [  338.117383] Shot down all CPUs

And since Igor managed to fix kdump, I can now post backtraces from
all CPUs as well: https://paste.debian.net/1092609/

Thanks,
Sergey

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.