[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] Xen 4.12.1/4.13.0-rc2 unable to load module coretemp



On Sun, Nov 17, 2019 at 7:18 AM Bobbi Sanchez <bobbisanchez@xxxxxx> wrote:
>
> Am 16.11.19 um 04:21 schrieb Rishi:
> > On Sat, Nov 16, 2019 at 4:29 AM Bobbi Sanchez <bobbisanchez@xxxxxx> wrote:
> >> Hi,
> >>
> >>
> >> i am using (Debian10) Kernel 5.3.7-amd64 self compiled with XEN-dom0 
> >> Options activated and also Xen from source Version 4.12.1 and 4.13.0-rc2 
> >> self compiled, everything seems to work find but i am not able to load the 
> >> coretemp.ko module, not possible under Xen4.12.1 and 4.13.0-rc2!
> >>
> >> If i dont load the xen.gz at boot, so that i only run Kernel 5.3.7 
> >> coretemp is no problem to load!
> >>
> >>
> >> Thats the output if i try to load the coretemp module:
> >>
> >> sudo modprobe -vvvvv coretemp
> >> modprobe: INFO: ../libkmod/libkmod.c:364 kmod_set_log_fn() custom logging 
> >> function 0x56006c91c150 registered
> >> modprobe: DEBUG: ../libkmod/libkmod-index.c:755 index_mm_open() 
> >> file=/lib/modules/5.3.7-4-xen0/modules.dep.bin
> >> modprobe: DEBUG: ../libkmod/libkmod-index.c:755 index_mm_open() 
> >> file=/lib/modules/5.3.7-4-xen0/modules.alias.bin
> >> modprobe: DEBUG: ../libkmod/libkmod-index.c:755 index_mm_open() 
> >> file=/lib/modules/5.3.7-4-xen0/modules.symbols.bin
> >> modprobe: DEBUG: ../libkmod/libkmod-index.c:755 index_mm_open() 
> >> file=/lib/modules/5.3.7-4-xen0/modules.builtin.bin
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:556 
> >> kmod_module_new_from_lookup() input alias=coretemp, normalized=coretemp
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:562 
> >> kmod_module_new_from_lookup() lookup modules.dep coretemp
> >> modprobe: DEBUG: ../libkmod/libkmod.c:574 kmod_search_moddep() use mmaped 
> >> index 'modules.dep' modname=coretemp
> >> modprobe: DEBUG: ../libkmod/libkmod.c:402 kmod_pool_get_module() get 
> >> module name='coretemp' found=(nil)
> >> modprobe: DEBUG: ../libkmod/libkmod.c:410 kmod_pool_add_module() add 
> >> 0x56006d416f20 key='coretemp'
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:202 
> >> kmod_module_parse_depline() 0 dependencies for coretemp
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:583 
> >> kmod_module_new_from_lookup() lookup coretemp=0, list=0x56006d4166d0
> >> modprobe: DEBUG: ../libkmod/libkmod.c:501 lookup_builtin_file() use mmaped 
> >> index 'modules.builtin' modname=coretemp
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1750 
> >> kmod_module_get_initstate() could not open 
> >> '/sys/module/coretemp/initstate': No such file or directory
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1760 
> >> kmod_module_get_initstate() could not open '/sys/module/coretemp': No such 
> >> file or directory
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=snd_pcsp mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=snd_usb_audio mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=cx88_alsa mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=snd_atiixp_modem mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=snd_intel8x0m mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=snd_via82xx_modem mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=loop mod->name=coretemp mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=bonding mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1393 
> >> kmod_module_get_options() modname=dummy mod->name=coretemp 
> >> mod->alias=(null)
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1750 
> >> kmod_module_get_initstate() could not open 
> >> '/sys/module/coretemp/initstate': No such file or directory
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:1760 
> >> kmod_module_get_initstate() could not open '/sys/module/coretemp': No such 
> >> file or directory
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:744 kmod_module_get_path() 
> >> name='coretemp' 
> >> path='/lib/modules/5.3.7-4-xen0/kernel/drivers/hwmon/coretemp.ko'
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:744 kmod_module_get_path() 
> >> name='coretemp' 
> >> path='/lib/modules/5.3.7-4-xen0/kernel/drivers/hwmon/coretemp.ko'
> >> insmod /lib/modules/5.3.7-4-xen0/kernel/drivers/hwmon/coretemp.ko
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:744 kmod_module_get_path() 
> >> name='coretemp' 
> >> path='/lib/modules/5.3.7-4-xen0/kernel/drivers/hwmon/coretemp.ko'
> >> modprobe: INFO: ../libkmod/libkmod-module.c:886 
> >> kmod_module_insert_module() Failed to insert module 
> >> '/lib/modules/5.3.7-4-xen0/kernel/drivers/hwmon/coretemp.ko': No such 
> >> device
> >> modprobe: ERROR: could not insert 'coretemp': No such device
> >> modprobe: DEBUG: ../libkmod/libkmod-module.c:468 kmod_module_unref() 
> >> kmod_module 0x56006d416f20 released
> >> modprobe: DEBUG: ../libkmod/libkmod.c:418 kmod_pool_del_module() del 
> >> 0x56006d416f20 key='coretemp'
> >> modprobe: INFO: ../libkmod/libkmod.c:331 kmod_unref() context 
> >> 0x56006d4164e0 released
> >>
> >>
> >> The Module is there:
> >>
> >> ls -hal /lib/modules/5.3.7-4-xen0/kernel/drivers/hwmon/ | grep coretemp
> >> -rw-r--r--  1 root root  21K Nov 15 00:55 coretemp.ko
> >>
> >>
> >> Thats the xl info:
> >>
> >> sudo xl info
> >> host                   : nanadag
> >> release                : 5.3.7-4-xen0
> >> version                : #0 SMP PREEMPT Fri Nov 15 00:55:59 CET 2019
> >> machine                : x86_64
> >> nr_cpus                : 4
> >> max_cpu_id             : 3
> >> nr_nodes               : 1
> >> cores_per_socket       : 4
> >> threads_per_core       : 1
> >> cpu_mhz                : 2397.616
> >> hw_caps                : 
> >> bfebfbff:0000e3bd:20100800:00000001:00000000:00000000:00000000:00000000
> >> virt_caps              : pv hvm shadow
> >> total_memory           : 7101
> >> free_memory            : 128
> >> sharing_freed_memory   : 0
> >> sharing_used_memory    : 0
> >> outstanding_claims     : 0
> >> free_cpus              : 0
> >> xen_major              : 4
> >> xen_minor              : 13
> >> xen_extra              : .0-rc
> >> xen_version            : 4.13.0-rc
> >> xen_caps               : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 
> >> hvm-3.0-x86_32p hvm-3.0-x86_64
> >> xen_scheduler          : credit2
> >> xen_pagesize           : 4096
> >> platform_params        : virt_start=0xffff800000000000
> >> xen_changeset          :
> >> xen_commandline        : placeholder
> >> cc_compiler            : gcc (Debian 8.3.0-6) 8.3.0
> >> cc_compile_by          : benutzer
> >> cc_compile_domain      : dag
> >> cc_compile_date        : Fri Nov 15 22:15:27 CET 2019
> >> build_id               : ded65a38fabc4057e67d85d1bf6bf4165e88b6ca
> >> xend_config_format     : 4
> >>
> >>
> >> uname -a
> >> Linux nanadag 5.3.7-4-xen0 #0 SMP PREEMPT Fri Nov 15 00:55:59 CET 2019 
> >> x86_64 GNU/Linux
> >>
> >> and here from cat /proc/cpuinfo
> >>
> >> processor       : 0
> >> vendor_id       : GenuineIntel
> >> cpu family      : 6
> >> model           : 15
> >> model name      : Intel(R) Core(TM)2 Quad CPU    Q6600  @ 2.40GHz
> >> stepping        : 11
> >> cpu MHz         : 2397.616
> >> cache size      : 4096 KB
> >> physical id     : 0
> >> siblings        : 4
> >> core id         : 0
> >> cpu cores       : 4
> >> apicid          : 0
> >> initial apicid  : 0
> >> fpu             : yes
> >> fpu_exception   : yes
> >> cpuid level     : 10
> >> wp              : yes
> >> flags           : fpu de tsc msr pae mce cx8 apic sep mca cmov pat clflush 
> >> acpi mmx fxsr sse sse2 ht syscall nx lm constant_tsc rep_good nopl cpuid 
> >> pni monitor est ssse3 cx16 hypervisor lahf_lm
> >> bugs            : null_seg cpu_meltdown spectre_v1 spectre_v2 
> >> spec_store_bypass l1tf mds swapgs
> >> bogomips        : 4795.23
> >> clflush size    : 64
> >> cache_alignment : 64
> >> address sizes   : 36 bits physical, 48 bits virtual
> >> power management:
> >> --- snip ---
> >>
> >>
> >> What can i do to ge the coretemp module loaded ?
> >>
> >> Best Regards
> >>
> >>
> >> _______________________________________________
> >> Xen-users mailing list
> >> Xen-users@xxxxxxxxxxxxxxxxxxxx
> >> https://lists.xenproject.org/mailman/listinfo/xen-users
> > This change of behavior was introduced with
> > https://xenbits.xen.org/gitweb/?p=xen.git;a=commitdiff;h=72e038450d3d5de1a39f0cfa2d2b0f9b3d43c6c6
> > It was intended to hide Thermal info from PV guests but seems it got
> > hidden from Dom0 as well.
> >
> > I was having same problem and there are 2 ways through which it could
> > be solved. Modify Xen to not hide EAX or modify linux Dom0 kernel.
> > To modify dom0 kernel, you'd have to edit
> > https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/arch/x86/kernel/cpu/common.c?h=v5.3.7#n913
> > -               c->x86_capability[CPUID_6_EAX] = cpuid_eax(0x00000006);
> > +               c->x86_capability[CPUID_6_EAX] = 
> > native_cpuid_eax(0x00000006);
> >
> > _______________________________________________
> > Xen-users mailing list
> > Xen-users@xxxxxxxxxxxxxxxxxxxx
> > https://lists.xenproject.org/mailman/listinfo/xen-users
>
> Thanks for your information,
>
> i tried to change common.c inside of the kernel 5.3.7 sources but now dmesg 
> prints this while booting the system with xen 4.13.0-rc2:
>
> [   13.090126] coretemp coretemp.0: Using relative temperature scale!
> [   13.090208] coretemp coretemp.0: Using relative temperature scale!
> [   13.090219] sysfs: cannot create duplicate filename 
> '/devices/platform/coretemp.0/hwmon/hwmon0/temp2_label'
> [   13.090221] CPU: 1 PID: 14 Comm: cpuhp/1 Not tainted 5.3.7-3-xen0-amd64 #0
> [   13.090223] Hardware name:  /DP965LT, BIOS 
> MQ96510J.86A.1761.2009.0326.0001 03/26/2009
> [   13.090224] Call Trace:
> [   13.090234]  dump_stack+0x5c/0x80
> [   13.090237]  sysfs_warn_dup.cold.5+0x17/0x23
> [   13.090240]  sysfs_add_file_mode_ns+0x144/0x170
> [   13.090242]  internal_create_group+0x117/0x380
> [   13.090248]  ? coretemp_cpu_offline+0x1d0/0x1d0 [coretemp]
> [   13.090250]  create_core_data+0x34d/0x550 [coretemp]
> [   13.090253]  ? create_core_data+0x550/0x550 [coretemp]
> [   13.090256]  coretemp_cpu_online+0x114/0x16f [coretemp]
> [   13.090259]  ? finish_task_switch+0x77/0x230
> [   13.090261]  cpuhp_invoke_callback+0x94/0x540
> [   13.090264]  ? __schedule+0x2be/0x630
> [   13.090266]  ? sort_range+0x20/0x20
> [   13.090267]  cpuhp_thread_fun+0xb0/0x110
> [   13.090269]  smpboot_thread_fn+0xc5/0x160
> [   13.090271]  kthread+0x113/0x130
> [   13.090273]  ? __kthread_parkme+0x70/0x70
> [   13.090275]  ret_from_fork+0x35/0x40
> [   13.090283] coretemp coretemp.0: Adding Core 1 failed
> [   13.100454] coretemp coretemp.0: Using relative temperature scale!
> [   13.100501] coretemp coretemp.0: Using relative temperature scale!
> [   13.100510] sysfs: cannot create duplicate filename 
> '/devices/platform/coretemp.0/hwmon/hwmon0/temp2_label'
> [   13.100513] CPU: 3 PID: 24 Comm: cpuhp/3 Not tainted 5.3.7-3-xen0-amd64 #0
> [   13.100514] Hardware name:  /DP965LT, BIOS 
> MQ96510J.86A.1761.2009.0326.0001 03/26/2009
> [   13.100515] Call Trace:
> [   13.100524]  dump_stack+0x5c/0x80
> [   13.100527]  sysfs_warn_dup.cold.5+0x17/0x23
> [   13.100531]  sysfs_add_file_mode_ns+0x144/0x170
> [   13.100532]  internal_create_group+0x117/0x380
> [   13.100536]  ? coretemp_cpu_offline+0x1d0/0x1d0 [coretemp]
> [   13.100539]  create_core_data+0x34d/0x550 [coretemp]
> [   13.100542]  ? create_core_data+0x550/0x550 [coretemp]
> [   13.100544]  coretemp_cpu_online+0x114/0x16f [coretemp]
> [   13.100547]  ? finish_task_switch+0x77/0x230
> [   13.100549]  cpuhp_invoke_callback+0x94/0x540
> [   13.100552]  ? __schedule+0x2be/0x630
> [   13.100554]  ? sort_range+0x20/0x20
> [   13.100555]  cpuhp_thread_fun+0xb0/0x110
> [   13.100557]  smpboot_thread_fn+0xc5/0x160
> [   13.100559]  kthread+0x113/0x130
> [   13.100561]  ? __kthread_parkme+0x70/0x70
> [   13.100563]  ret_from_fork+0x35/0x40
> [   13.100571] coretemp coretemp.0: Adding Core 3 failed
>
>
> lsmod shows that coretemp is loaded
>
> $ lsmod | grep coretemp
> coretemp               20480  0
>
> but lmsensors doenst show temperature, even if i try sudo sensors-detect
>
> $ sensors
> No sensors found!
> Make sure you loaded all the kernel drivers you need.
> Try sensors-detect to find out which these are.
>
> if i try to
>
> $ sudo rmmod coretemp
>
> the input hangs and cannot be breaked/canceled, even not CTRL+C or CTRL+Z, or 
> CTRL+D doesnt work, and then on another console dmesg prints this:
>
> [  425.193352] BUG: kernel NULL pointer dereference, address: 0000000000000188
> [  425.207238] #PF: supervisor read access in kernel mode
> [  425.217475] #PF: error_code(0x0000) - not-present page
> [  425.227713] PGD 0 P4D 0
> [  425.232759] Oops: 0000 [#1] SMP NOPTI
> [  425.240056] CPU: 0 PID: 13 Comm: cpuhp/0 Not tainted 5.3.7-3-xen0-amd64 #0
> [  425.253757] Hardware name:  /DP965LT, BIOS 
> MQ96510J.86A.1761.2009.0326.0001 03/26/2009
> [  425.269541] RIP: e030:sysfs_remove_group+0xc/0x80
> [  425.278919] Code: 66 66 90 48 89 f2 be 01 00 00 00 e9 5e fc ff ff 66 66 2e 
> 0f 1f 84 00 00 00 00 00 0f 1f 00 66 66 66 66 90 41 54 55 48 89 f5 53 <48> 8b 
> 36 48 8b 5f 30 48 85 f6 74 40 31 d2 49 89 fc 48 89 df e8 1b
> [  425.316343] RSP: e02b:ffffc9004009fe08 EFLAGS: 00010216
> [  425.326752] RAX: 0000000000000000 RBX: ffff8881b6aa6828 RCX: 
> 0000000000000000
> [  425.340982] RDX: 0000000000000007 RSI: 0000000000000188 RDI: 
> ffff8881b6018008
> [  425.355201] RBP: 0000000000000188 R08: 0000000000000000 R09: 
> ffff8881b9a2aa00
> [  425.369422] R10: 0000000000007ff0 R11: 0000000000000400 R12: 
> 0000000000000000
> [  425.383641] R13: ffff8881b6aa6818 R14: 0000000000010340 R15: 
> ffff8881b6aa6828
> [  425.397873] FS:  00007f5c585e8940(0000) GS:ffff8881be800000(0000) 
> knlGS:0000000000000000
> [  425.414000] CS:  e030 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  425.425450] CR2: 0000000000000188 CR3: 00000001b58d4000 CR4: 
> 0000000000000660
> [  425.439675] Call Trace:
> [  425.444552]  coretemp_cpu_offline+0x120/0x1d0 [coretemp]
> [  425.455140]  ? coretemp_remove+0x60/0x60 [coretemp]
> [  425.464862]  cpuhp_invoke_callback+0x94/0x540
> [  425.473543]  ? __schedule+0x2be/0x630
> [  425.480840]  ? sort_range+0x20/0x20
> [  425.487789]  cpuhp_thread_fun+0xb0/0x110
> [  425.495604]  smpboot_thread_fn+0xc5/0x160
> [  425.503592]  kthread+0x113/0x130
> [  425.510021]  ? __kthread_parkme+0x70/0x70
> [  425.518009]  ret_from_fork+0x35/0x40
> [  425.525131] Modules linked in: bridge stp llc mei_me iTCO_wdt sg mei 
> serio_raw iTCO_vendor_support pcspkr evdev button xen_acpi_processor 
> xen_netback xen_blkback xen_gntalloc coretemp(-) ip_tables x_tables autofs4 
> ext4 crc32c_generic crc16 mbcache jbd2 sd_mod ata_generic pata_acpi i2c_i801 
> ahci lpc_ich libahci skge uhci_hcd ehci_pci pata_marvell ehci_hcd libata 
> usbcore e1000e
> [  425.591494] CR2: 0000000000000188
> [  425.598096] ---[ end trace e7bf697ded5c8bcf ]---
> [  425.607296] RIP: e030:sysfs_remove_group+0xc/0x80
> [  425.616668] Code: 66 66 90 48 89 f2 be 01 00 00 00 e9 5e fc ff ff 66 66 2e 
> 0f 1f 84 00 00 00 00 00 0f 1f 00 66 66 66 66 90 41 54 55 48 89 f5 53 <48> 8b 
> 36 48 8b 5f 30 48 85 f6 74 40 31 d2 49 89 fc 48 89 df e8 1b
> [  425.654094] RSP: e02b:ffffc9004009fe08 EFLAGS: 00010216
> [  425.664507] RAX: 0000000000000000 RBX: ffff8881b6aa6828 RCX: 
> 0000000000000000
> [  425.678729] RDX: 0000000000000007 RSI: 0000000000000188 RDI: 
> ffff8881b6018008
> [  425.692951] RBP: 0000000000000188 R08: 0000000000000000 R09: 
> ffff8881b9a2aa00
> [  425.707184] R10: 0000000000007ff0 R11: 0000000000000400 R12: 
> 0000000000000000
> [  425.721404] R13: ffff8881b6aa6818 R14: 0000000000010340 R15: 
> ffff8881b6aa6828
> [  425.735627] FS:  00007f5c585e8940(0000) GS:ffff8881be800000(0000) 
> knlGS:0000000000000000
> [  425.751754] CS:  e030 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  425.763204] CR2: 0000000000000188 CR3: 00000001b58d4000 CR4: 
> 0000000000000660
>
>
> and lsmod shows now:
>
> $ lsmod | grep coretemp
> coretemp               20480  -1
>
> Well lets see may the other way what you talked about the change the xen 
> source may work... lets see. I will try this way now
>
> Best Regards
>
>
> _______________________________________________
> Xen-users mailing list
> Xen-users@xxxxxxxxxxxxxxxxxxxx
> https://lists.xenproject.org/mailman/listinfo/xen-users

Sure, modifying from Xen should create more stable effect.
Before that, can you share # cpuid -r -1 ?
We need to look for 0x00000006 0x00: eax=0x00000077

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-users

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.