[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH 00/60] xen: add core scheduling support


  • To: Juergen Gross <jgross@xxxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>
  • From: Sergey Dyasli <sergey.dyasli@xxxxxxxxxx>
  • Date: Wed, 24 Jul 2019 15:54:58 +0100
  • Authentication-results: esa6.hc3370-68.iphmx.com; dkim=none (message not signed) header.i=none; spf=None smtp.pra=sergey.dyasli@xxxxxxxxxx; spf=Pass smtp.mailfrom=sergey.dyasli@xxxxxxxxxx; spf=None smtp.helo=postmaster@xxxxxxxxxxxxxxx
  • Autocrypt: addr=sergey.dyasli@xxxxxxxxxx; keydata= mQINBFtMVHEBEADc/hZcLexrB6vGTdGqEUsYZkFGQh6Z1OO7bCtM1go1RugSMeq9tkFHQSOc 9c7W9NVQqLgn8eefikIHxgic6tGgKoIQKcPuSsnqGao2YabsTSSoeatvmO5HkR0xGaUd+M6j iqv3cD7/WL602NhphT4ucKXCz93w0TeoJ3gleLuILxmzg1gDhKtMdkZv6TngWpKgIMRfoyHQ jsVzPbTTjJl/a9Cw99vuhFuEJfzbLA80hCwhoPM+ZQGFDcG4c25GQGQFFatpbQUhNirWW5b1 r2yVOziSJsvfTLnyzEizCvU+r/Ek2Kh0eAsRFr35m2X+X3CfxKrZcePxzAf273p4nc3YIK9h cwa4ZpDksun0E2l0pIxg/pPBXTNbH+OX1I+BfWDZWlPiPxgkiKdgYPS2qv53dJ+k9x6HkuCy i61IcjXRtVgL5nPGakyOFQ+07S4HIJlw98a6NrptWOFkxDt38x87mSM7aSWp1kjyGqQTGoKB VEx5BdRS5gFdYGCQFc8KVGEWPPGdeYx9Pj2wTaweKV0qZT69lmf/P5149Pc81SRhuc0hUX9K DnYBa1iSHaDjifMsNXKzj8Y8zVm+J6DZo/D10IUxMuExvbPa/8nsertWxoDSbWcF1cyvZp9X tUEukuPoTKO4Vzg7xVNj9pbK9GPxSYcafJUgDeKEIlkn3iVIPwARAQABtChTZXJnZXkgRHlh c2xpIDxzZXJnZXkuZHlhc2xpQGNpdHJpeC5jb20+iQJOBBMBCgA4FiEEkI7HMI5EbM2FLA1L Aa+w5JvbyusFAltMVHECGwMFCwkIBwIGFQoJCAsCBBYCAwECHgECF4AACgkQAa+w5JvbyuuQ JBAAry/oRK6m0I+ck1Tarz9a1RrF73r1YoJUk5Bw+PSxsBJOPp3vDeAz3Kqw58qmBXeNlMU4 1cqAxFxCCKMtER1gpmrKWBA1/H1ZoBRtzhaHgPTQLyR7LB1OgdpgwEOjN1Q5gME8Pk21y/3N cG5YBgD/ZHbq8nWS/G3r001Ie3nX55uacGk/Ry175cS48+asrerShKMDNMT1cwimo9zH/3Lm RTpWloh2dG4jjwtCXqB7s+FEE5wQVCpPp9p55+9pPd+3DXmsQEcJ/28XHo/UJW663WjRlRc4 wgPwiC9Co1HqaMKSzdPpZmI5D4HizWH8jF7ppUjWoPapwk4dEA7Al0vx1Bz3gbJAL8DaRgQp H4j/16ifletfGUNbHJR2vWljZ5SEf2vMVcdubf9eFUfBF/9OOR1Kcj1PISP8sPhcP7oCfFtH RcxXh1OStrRFtltJt2VlloKXAUggdewwyyD4xl9UHCfI4lSexOK37wNSQYPQcVcOS1bl4NhQ em6pw2AC32NsnQE5PmczFADDIpWhO/+WtkTFeE2HHfAn++y3YDtKQd7xes9UJjQNiGziArST l6Zrx4/nShVLeYRVW76l27gI5a8BZLWwBVRsWniGM50OOJULvSag7kh+cjsrXXpNuA4rfEoB Bxr7pso9e5YghupDc8XftsYd7mlAgOTCAC8uZme5Ag0EW0xUcQEQAMKi97v3DwwPgYVPYIbQ JAvoMgubJllC9RcE0PQsE6nEKSrfOT6Gh5/LHOXLbQI9nzU/xdr6kMfwbYVTnZIY/SwsLrJa gSKm64t11MjC1Vf03/sncx1tgI7nwqMMIAYLsXnQ9X/Up5L/gLO2YDIPxrQ6g4glgRYPT53i r6/hTz3dlpqyPCorpuF+WY7P2ujhlFlXCAaD6btPPM/9LZSmI0xS4aCBLH+pZeCr0UGSMhsX JYN0QRLjfsIDGyqaXVH9gwV2Hgsq6z8fNPQlBc3IpDvfXa1rYtgldYBfG521L3wnsMcKoFSr R5dpH7Jtvv5YBuAk8r571qlMhyAmVKiEnc+RonWl503D5bAHqNmFNjV248J5scyRD/+BcYLI 2CFG28XZrCvjxq3ux5hpmg2fCu+y98h6/yuwB/JhbFlDOSoluEpysiEL3R5GTKbxOF664q5W fiSObxNONxs86UtghqNDRUJgyS0W6TfykGOnZDVYAC9Gg8SbQDta1ymA0q76S/NG2MrJEOIr 1GtOr/UjNv2x4vW56dzX/3yuhK1ilpgzh1q504ETC6EKXMaFT8cNgsMlk9dOvWPwlsIJ249+ PizMDFGITxGTIrQAaUBO+HRLSBYdHNrHJtytkBoTjykCt7M6pl7l+jFYjGSw4fwexVy0MqsD AZ2coH82RTPb6Q7JABEBAAGJAjYEGAEKACAWIQSQjscwjkRszYUsDUsBr7Dkm9vK6wUCW0xU cQIbDAAKCRABr7Dkm9vK6+9uD/9Ld3X5cvnrwrkFMddpjFKoJ4yphtX2s+EQfKT6vMq3A1dJ tI7zHTFm60uBhX6eRbQow8fkHPcjXGJEoCSJf8ktwx/HYcBcnUK/aulHpvHIIYEma7BHry4x L+Ap7oBbBNiraS3Wu1k+MaX07BWhYYkpu7akUEtaYsCceVc4vpYNITUzPYCHeMwc5pLICA+7 VdI1rrTSAwlCtLGBt7ttbvaAKN4dysiN+/66Hlxnn8n952lZdG4ThPPzafG50EgcTa+dASgm tc6HaQAmJiwb4iWUOoUoM+udLRHcN6cE0bQivyH1bqF4ROeFBRz00MUJKvzUynR9E50F9hmd DOBJkyM3Z5imQ0RayEkRHhlhj7uECaojnUeewq4zjpAg2HTSMkdEzKRbdMEyXCdQXFnSCmUB 5yMIULuDbOODWo3EufExLjAKzIRWEKQ/JidLzO6hrhlQffsJ7MPTU+Hg7WxqWfn4zhuUcIQB SlkiRMalSiJITC2jG7oQRRh9tyNaDMkKzTbeFtHKRmUUAuhE0LBXP8Wc+5W7b3WOf2SO8JMR 4TqDZ0K06s66S5fOTW0h56iCCxTsAnRvM/tA4SERyRoFs/iTqJzboskZY0yKeWV4/IQxfOyC YwdU3//zANM1ZpqeE/8lnW/kx+fyzVyEioLSwkjDvdG++4GQ5r6PHQ7BbdEWhA==
  • Cc: Tim Deegan <tim@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>, George Dunlap <george.dunlap@xxxxxxxxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Ian Jackson <ian.jackson@xxxxxxxxxxxxx>, RobertVanVossen <robert.vanvossen@xxxxxxxxxxxxxxx>, Dario Faggioli <dfaggioli@xxxxxxxx>, Julien Grall <julien.grall@xxxxxxx>, Josh Whitehead <josh.whitehead@xxxxxxxxxxxxxxx>, Meng Xu <mengxu@xxxxxxxxxxxxx>, Jan Beulich <jbeulich@xxxxxxxx>, "sergey.dyasli@xxxxxxxxxx >> Sergey Dyasli" <sergey.dyasli@xxxxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>
  • Delivery-date: Wed, 24 Jul 2019 14:55:13 +0000
  • Ironport-sdr: YDM2oIZEMC66S2j01jc5ucaKKl9b5GkKTLhVo6A1akIE5PQLU1aBvdLlcxbHwgSBtwK68X9i1e ixowW1YZD90lT7C+8I5JCKINDt0NfrMcMYslL8qcUZzLifwv7ClvOHavWo+aQENMO2LcxWSmFK SN65rVcr2xuCAwOGJQ/c8OpCcOT3zPcEEz7xiRVgFUo80gPDMCF5+3DSv8SgZhlWhb89el/FBk wtEBWN9UlJ2KpjnIQruJLjSwZjMfdW6qrwWRUum9Gla/HFERaaDj3sgEwccIWFJfwLpL+Jnq+O Bdg=
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>
  • Openpgp: preference=signencrypt

On 24/07/2019 10:13, Juergen Gross wrote:
> The fix is a one-liner. :-)
> 
> diff --git a/xen/common/schedule.c b/xen/common/schedule.c
> index f0bc5b3161..da9efb147f 100644
> --- a/xen/common/schedule.c
> +++ b/xen/common/schedule.c
> @@ -2207,6 +2207,7 @@ static struct sched_unit 
> *sched_wait_rendezvous_in(struct sched_unit *prev,
>          if ( unlikely(!scheduler_active) )
>          {
>              ASSERT(is_idle_unit(prev));
> +            atomic_set(&prev->next_task->rendezvous_out_cnt, 0);
>              prev->rendezvous_in_cnt = 0;
>          }
>      }

Even with that applied, I'm still seeing it :(

(XEN) [  311.223780] Watchdog timer detects that CPU1 is stuck!

(XEN) [  311.229413] ----[ Xen-4.13.0  x86_64  debug=y   Not tainted ]----

(XEN) [  311.236002] CPU:    1

(XEN) [  311.238774] RIP:    e008:[<ffff82d0802408a8>] 
sched_context_switched+0x92/0x101

(XEN) [  311.246575] RFLAGS: 0000000000000202   CONTEXT: hypervisor

(XEN) [  311.252556] rax: 0000000000000002   rbx: ffff83081cc635b0   rcx: 
0000000000000001

(XEN) [  311.260530] rdx: ffff83081cc63634   rsi: ffff83081cc8f000   rdi: 
ffff83081cc8f000

(XEN) [  311.268502] rbp: ffff83081cc87df0   rsp: ffff83081cc87dd0   r8:  
0000000000000000

(XEN) [  311.276474] r9:  ffff83081cc62000   r10: ffff83081cc62000   r11: 
ffff83081cc6b000

(XEN) [  311.284448] r12: ffff83081cc8f000   r13: ffff83081cc8f000   r14: 
ffff83081cc61e80

(XEN) [  311.292422] r15: ffff82d0805e2260   cr0: 000000008005003b   cr4: 
00000000001526e0

(XEN) [  311.300395] cr3: 00000000dd4ac000   cr2: 0000559b05a94048

(XEN) [  311.306288] fsb: 0000000000000000   gsb: ffff8880a3940000   gss: 
0000000000000000

(XEN) [  311.314262] ds: 002b   es: 002b   fs: 0000   gs: 0000   ss: e010   cs: 
e008

(XEN) [  311.321716] Xen code around <ffff82d0802408a8> 
(sched_context_switched+0x92/0x101):

(XEN) [  311.329862]  85 c0 74 08 f3 90 8b 02 <85> c0 75 f8 49 8b 44 24 10 66 
81 38 ff 7f 75 05

(XEN) [  311.338269] Xen stack trace from rsp=ffff83081cc87dd0:

(XEN) [  311.343904]    ffff83081cc8f000 ffff83081cc8f000 0000000000000000 
ffff83081cc635b0

(XEN) [  311.351963]    ffff83081cc87e28 ffff82d080240996 ffff83081cc61e98 
ffff82d08060a4a8

(XEN) [  311.360022]    ffff83081cc61e98 ffff82d08060a4a8 ffff83081cc635b0 
ffff83081cc87e80

(XEN) [  311.368083]    ffff82d080240f7a 0000000000000001 ffff83081cc8f000 
00000047588837ec

(XEN) [  311.376142]    000000011cc87ec0 ffff82d0805c3a00 ffff82d0805c3980 
ffffffffffffffff

(XEN) [  311.384205]    ffff82d0805d3980 ffff82d0805e2260 ffff83081cc87eb0 
ffff82d08024274a

(XEN) [  311.392263]    0000000000000001 ffff82d0805c3a00 0000000000000001 
0000000000000001

(XEN) [  311.400324]    ffff83081cc87ec0 ffff82d0802427bf ffff83081cc87ef0 
ffff82d080279a1d

(XEN) [  311.408385]    ffff83081cc8f000 ffff83081cc8f000 0000000000000001 
ffff83081cc635b0

(XEN) [  311.416443]    ffff83081cc87df0 ffff88809ee1ba00 ffff88809ee1ba00 
0000000000000000

(XEN) [  311.424504]    0000000000000000 0000000000000005 ffff88809ee1ba00 
0000000000000246

(XEN) [  311.432563]    aaaaaaaaaaaaaaaa 0000000000000000 000000000001ca00 
0000000000000000

(XEN) [  311.440625]    ffffffff810013aa ffffffff8203c190 deadbeefdeadf00d 
deadbeefdeadf00d

(XEN) [  311.448685]    0000010000000000 ffffffff810013aa 000000000000e033 
0000000000000246

(XEN) [  311.456747]    ffffc900400bfeb0 000000000000e02b 000000000000beef 
000000000000beef

(XEN) [  311.464807]    000000000000beef 000000000000beef 0000e01000000001 
ffff83081cc8f000

(XEN) [  311.472864]    000000379c665d00 00000000001526e0 0000000000000000 
0000000000000000

(XEN) [  311.480926]    0000060000000000 0000000000000000

(XEN) [  311.486041] Xen call trace:

(XEN) [  311.489332]    [<ffff82d0802408a8>] sched_context_switched+0x92/0x101

(XEN) [  311.496266]    [<ffff82d080240996>] 
schedule.c#sched_context_switch+0x7f/0x160

(XEN) [  311.503980]    [<ffff82d080240f7a>] schedule.c#sched_slave+0x28f/0x2b5

(XEN) [  311.510999]    [<ffff82d08024274a>] softirq.c#__do_softirq+0x61/0x8c

(XEN) [  311.517846]    [<ffff82d0802427bf>] do_softirq+0x13/0x15

(XEN) [  311.523653]    [<ffff82d080279a1d>] domain.c#idle_loop+0x52/0xa7

(XEN) [  311.530152]

(XEN) [  311.532144] CPU0 @ e008:ffff82d08024334d 
(stop_machine.c#stopmachine_wait_state+0x19/0x24)

(XEN) [  311.540899] CPU5 @ e008:ffff82d080243398 
(stop_machine.c#stopmachine_action+0x40/0x93)

(XEN) [  311.549307] CPU3 @ e008:ffff82d08024339e 
(stop_machine.c#stopmachine_action+0x46/0x93)

(XEN) [  311.557712] CPU4 @ e008:ffff82d08024339e 
(stop_machine.c#stopmachine_action+0x46/0x93)

(XEN) [  311.566119] CPU7 @ e008:ffff82d08024339e 
(stop_machine.c#stopmachine_action+0x46/0x93)

(XEN) [  311.574526] CPU2 @ e008:ffff82d080243398 
(stop_machine.c#stopmachine_action+0x40/0x93)

(XEN) [  311.582931] CPU6 @ e008:ffff82d080243398 
(stop_machine.c#stopmachine_action+0x40/0x93)

(XEN) [  311.591919]

(XEN) [  311.593914] ****************************************

(XEN) [  311.599374] Panic on CPU 1:

(XEN) [  311.602669] FATAL TRAP: vector = 2 (nmi)

(XEN) [  311.607088] [error_code=0000]

(XEN) [  311.610641] ****************************************

(XEN) [  311.616101]

(XEN) [  311.618095] Reboot in five seconds...

(XEN) [  311.622254] Executing kexec image on cpu1

(XEN) [  311.627534] Shot down all CPUs

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.