[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [Xen-users] kernel 3.9.2 - xen 4.2.2/4.3rc1 => BUG unable to handle kernel paging request netif_poll+0x49c/0xe8



Ping

--

Best regards,

Eugene Istomin

 

 


On Saturday, May 18, 2013 01:06:56 PM Eugene Istomin wrote:

Hello,

 

Do you need any other logs?

--

Best regards,

Eugene Istomin

 


On Friday, May 17, 2013 04:00:07 PM Eugene Istomin wrote:

Bump, here it is:

 

 

 

template:/home/local # iperf -s

------------------------------------------------------------

Server listening on TCP port 5001

TCP window size: 85.3 KByte (default)

------------------------------------------------------------

 

 

 

[ 4] local 10.251.2.201 port 5001 connected with 10.251.2.202 port 23902

[ ID] Interval Transfer Bandwidth

[ 4] 0.0-10.1 sec 38.5 MBytes 32.0 Mbits/sec

[ 5] local 10.251.2.201 port 5001 connected with 10.251.2.202 port 23903

[ 124.555698] BUG: unable to handle kernel paging request at ffff880078453000

[ 124.555760] IP: [<ffffffffa001a75c>] netif_poll+0x49c/0xe80 [xennet]

[ 124.555813] PGD a83067 PUD a93067 PMD 7fc2f067 PTE 8010000078453065

[ 124.555866] Oops: 0003 [#1] SMP

[ 124.555894] Modules linked in: af_packet hwmon domctl crc32_pclmul crc32c_intel ghash_clmulni_intel joydev aesni_intel ablk_helper cryptd lrw aes_x86_64 xts gf128mul autofs4 scsi_dh_emc scsi_dh_alua scsi_dh_rdac scsi_dh_hp_sw scsi_dh xenblk cdrom xennet ata_generic ata_piix

[ 124.556126] CPU 0

[ 124.556143] Pid: 0, comm: swapper/0 Not tainted 3.9.2-4.756ee56-xen #1

[ 124.556190] RIP: e030:[<ffffffffa001a75c>] [<ffffffffa001a75c>] netif_poll+0x49c/0xe80 [xennet]

[ 124.556252] RSP: e02b:ffff88007b403d18 EFLAGS: 00010286

[ 124.556286] RAX: ffff88007da5f650 RBX: ffff880078452ec0 RCX: ffff880078453000

[ 124.556332] RDX: ffff880078876810 RSI: ffff880078452ec0 RDI: ffff880078451580

[ 124.556377] RBP: ffff88007802ac80 R08: 00000000000009e0 R09: 0000000000000000

[ 124.556423] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000011

[ 124.556468] R13: 0000000000005afa R14: ffff88007b403dd8 R15: ffff880078440800

[ 124.556518] FS: 00007faae5316700(0000) GS:ffff88007b400000(0000) knlGS:0000000000000000

[ 124.556569] CS: e033 DS: 0000 ES: 0000 CR0: 000000008005003b

[ 124.557637] CR2: ffff880078453000 CR3: 0000000079f61000 CR4: 0000000000002660

[ 124.558714] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000

[ 124.559681] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

[ 124.559681] Process swapper/0 (pid: 0, threadinfo ffffffff808b6000, task ffffffff808c1960)

[ 124.559681] Stack:

[ 124.559681] ffff8800786afb80 000000007953e510 000000000000ad00 ffff88007b403db8

[ 124.559681] ffff88007b403d98 0000001200005ae9 ffff880078441d50 0000000000000000

[ 124.559681] ffff880078440858 ffff88007b40e878 0000000100000012 0000004000005afb

[ 124.559681] Call Trace:

[ 124.559681] [<ffffffff80441070>] net_rx_action+0x170/0x2c0

[ 124.559681] [<ffffffff80036dbe>] __do_softirq+0xee/0x230

[ 124.559681] [<ffffffff80037085>] irq_exit+0x95/0xa0

[ 124.559681] [<ffffffff803933ed>] evtchn_do_upcall+0x2ad/0x2f0

[ 124.559681] [<ffffffff80534d5e>] do_hypervisor_callback+0x1e/0x30

[ 124.559681] [<ffffffff800033aa>] HYPERVISOR_sched_op_new+0xa/0x20

[ 124.559681] [<ffffffff8000e671>] xen_idle+0x41/0x110

[ 124.559681] [<ffffffff8000e7ef>] cpu_idle+0xaf/0x110

[ 124.559681] [<ffffffff80944b1f>] start_kernel+0x424/0x42f

[ 124.559681] Code: 44 21 ea 48 8d 54 d0 40 8b 87 d8 00 00 00 44 0f bf 42 06 44 0f b7 4a 02 48 8b 44 01 30 49 63 cc 48 83 c1 03 48 c1 e1 04 48 01 f1 <48> 89 01 44 89 49 08 44 89 41 0c 48 8b 08 80 e5 80 0f 85 54 09

[ 124.559681] RIP [<ffffffffa001a75c>] netif_poll+0x49c/0xe80 [xennet]

[ 124.559681] RSP <ffff88007b403d18>

[ 124.559681] CR2: ffff880078453000

[ 124.559681] ---[ end trace d5028239de5a7a42 ]---

[ 124.559681] Kernel panic - not syncing: Fatal exception in interrupt

 

--

Best regards,

Eugene Istomin

 


On Friday, May 17, 2013 03:52:31 PM Eugene Istomin wrote:

Jan,

 

after 10 seconds of working (speed is 33 Mb/s!!) here is new oops:

 

template:/home/local # iperf -s

------------------------------------------------------------

Server listening on TCP port 5001

TCP window size: 85.3 KByte (default)

------------------------------------------------------------

[ 4] local 10.251.2.201 port 5001 connected with 10.251.2.202 port 61806

[ ID] Interval Transfer Bandwidth

[ 4] 0.0- 5.4 sec 21.6 MBytes 33.5 Mbits/sec

[ 5] local 10.251.2.201 port 5001 connected with 10.251.2.202 port 61807

[ 5] 0.0-10.0 sec 39.5 MBytes 33.0 Mbits/sec

[ 4] local 10.251.2.201 port 5001 connected with 10.251.2.202 port 61808

[ 4] 0.0-10.0 sec 149 MBytes 124 Mbits/sec

[ 5] local 10.251.2.201 port 5001 connected with 10.251.2.202 port 61809

[ 118.230386] BUG: unable to handle kernel paging request at ffff8800791e4000

[ 118.230426] IP: [<ffffffffa001a75c>] netif_poll+0x49c/0xe80 [xennet]

[ 118.230501] PGD a83067 PUD a93067 PMD 7fc29067 PTE 80100000791e4065

[ 118.230534] Oops: 0003 [#1] SMP

[ 118.230552] Modules linked in: af_packet hwmon domctl crc32_pclmul joydev crc32c_intel ghash_clmulni_intel aesni_intel ablk_helper cryptd lrw aes_x86_64 xts gf128mul autofs4 scsi_dh_emc scsi_dh_alua scsi_dh_rdac scsi_dh_hp_sw scsi_dh xenblk cdrom xennet ata_generic ata_piix

[ 118.230690] CPU 0

[ 118.230700] Pid: 0, comm: swapper/0 Not tainted 3.9.2-4.756ee56-xen #1

[ 118.230729] RIP: e030:[<ffffffffa001a75c>] [<ffffffffa001a75c>] netif_poll+0x49c/0xe80 [xennet]

[ 118.230766] RSP: e02b:ffff88007b403d18 EFLAGS: 00010286

[ 118.230787] RAX: ffff88007da89500 RBX: ffff8800791e3ec0 RCX: ffff8800791e4000

[ 118.230814] RDX: ffff880078446150 RSI: ffff8800791e3ec0 RDI: ffff880078434880

[ 118.230841] RBP: ffff88007852db80 R08: 0000000000000ba8 R09: 0000000000000000

[ 118.230867] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000011

[ 118.230895] R13: 0000000000025322 R14: ffff88007b403dd8 R15: ffff880078470800

[ 118.230925] FS: 00007ff5457b37c0(0000) GS:ffff88007b400000(0000) knlGS:0000000000000000

[ 118.230956] CS: e033 DS: 0000 ES: 0000 CR0: 000000008005003b

[ 118.230978] CR2: ffff8800791e4000 CR3: 0000000078c97000 CR4: 0000000000002660

[ 118.231629] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000

[ 118.232272] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

[ 118.232906] Process swapper/0 (pid: 0, threadinfo ffffffff808b6000, task ffffffff808c1960)

[ 118.233554] Stack:

[ 118.234194] ffff88007b403d38 ffffffff80060845 ffff88007b403d68 ffff88007b403db8

[ 118.234373] ffff88007b403d98 0000001200025311 ffff880078471d50 0000000000000000

[ 118.234373] ffff880078470858 ffff88007b40e878 0000000100000012 0000004000025323

[ 118.234373] Call Trace:

[ 118.234373] [<ffffffff80441070>] net_rx_action+0x170/0x2c0

[ 118.234373] [<ffffffff80036dbe>] __do_softirq+0xee/0x230

[ 118.234373] [<ffffffff80037085>] irq_exit+0x95/0xa0

[ 118.234373] [<ffffffff803933ed>] evtchn_do_upcall+0x2ad/0x2f0

[ 118.234373] [<ffffffff80534d5e>] do_hypervisor_callback+0x1e/0x30

[ 118.234373] [<ffffffff800033aa>] HYPERVISOR_sched_op_new+0xa/0x20

[ 118.234373] [<ffffffff8000e671>] xen_idle+0x41/0x110

[ 118.234373] [<ffffffff8000e7ef>] cpu_idle+0xaf/0x110

[ 118.234373] [<ffffffff80944b1f>] start_kernel+0x424/0x42f

 

 

I restarting Dom0 server now and will try to get first case call trace.

 

--

Best regards,

Eugene Istomin

 

 


On Friday, May 17, 2013 01:36:38 PM Jan Beulich wrote:

> >>> On 17.05.13 at 14:30, Eugene Istomin <e.istomin@xxxxxxx> wrote:

> >> That's quite big a jump.

> >

> > Kernels from 3.5 to 3.8 had OCFS2 bugs, in 3.9 all OCFS2 patches are

> > merged.

> >

> >> Did you look at these before sending - there's no sign of an oops

> >> anywhere afaict.

> >

> > Yes, there are no oops in logs. oop is listed only in xl console, and this

> > is

> > looks strage to me.

>

> That likely depends on some configuration setting within the guest.

>

> > [ 221.826637] BUG: unable to handle kernel paging request at

> > ffff880078b85000

> > [ 221.826674] IP: [<ffffffffa001a75c>] netif_poll+0x49c/0xe80 [xennet]

> > [ 221.826703] PGD a83067 PUD a93067 PMD 7fc2c067 PTE

> > 8010000078b85065

> > [ 221.826732] Oops: 0003 [#1] SMP

> > [ 221.826748] Modules linked in: af_packet hwmon domctl crc32_pclmul

> > crc32c_intel ghash_clmulni_intel joydev aesni_intel ablk_helper cryptd lrw

> > aes_x86_64 xts gf128mul autofs4 scsi_dh_emc scsi_dh_alua scsi_dh_rdac

> > scsi_dh_hp_sw scsi_dh xenblk cdrom xennet ata_generic ata_piix

> > [ 221.826875] CPU 0

> > [ 221.826885] Pid: 0, comm: swapper/0 Not tainted 3.9.2-4.756ee56-xen

> > #1

> > [ 221.826911] RIP: e030:[<ffffffffa001a75c>] [<ffffffffa001a75c>]

> > netif_poll+0x49c/0xe80 [xennet]

> > [ 221.826945] RSP: e02b:ffff88007b403d18 EFLAGS: 00010286

> > [ 221.826964] RAX: ffff88007da9b438 RBX: ffff880078b84ec0 RCX:

> > ffff880078b85000

> > [ 221.826989] RDX: ffff880078bdf7c0 RSI: ffff880078b84ec0 RDI:

> > ffff880078536580

> > [ 221.827014] RBP: ffff880078479a80 R08: 0000000000000248 R09:

> > 0000000000000000

> > [ 221.827039] R10: 0000000000000000 R11: 0000000000000001 R12:

> > 0000000000000011

> > [ 221.827064] R13: 00000000000030f0 R14: ffff88007b403dd8 R15:

> > ffff880078be8800

>

> Sorry, but as said before, this is incomplete (stack dump, no call stack).

>

> Jan







_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.