Current-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: cpu temperature readings



    Date:        Wed, 28 Jun 2023 15:08:17 +0900
    From:        Masanobu SAITOH <msaitoh%execsw.org@localhost>
    Message-ID:  <1b1763d8-f565-612c-9336-9fb71d496da5%execsw.org@localhost>

  | Please test the following diff:
  | 	https://www.netbsd.org/~msaitoh/coretemp-20230628-0.dif

Done that, doesn't seem to make any difference.   dmesg -t | grep Tjmax
is now:

coretemp0 at cpu0: thermal sensor, 1 C resolution, Tjmax=100
coretemp1 at cpu1: thermal sensor, 1 C resolution, Tjmax=100
coretemp2 at cpu2: thermal sensor, 1 C resolution, Tjmax=100
coretemp3 at cpu3: thermal sensor, 1 C resolution, Tjmax=100
coretemp4 at cpu4: thermal sensor, 1 C resolution, Tjmax=100
coretemp5 at cpu5: thermal sensor, 1 C resolution, Tjmax=100
coretemp6 at cpu6: thermal sensor, 1 C resolution, Tjmax=100
coretemp7 at cpu7: thermal sensor, 1 C resolution, Tjmax=100
coretemp8 at cpu8: thermal sensor, 1 C resolution, Tjmax=100
coretemp9 at cpu9: thermal sensor, 1 C resolution, Tjmax=100
coretemp10 at cpu10: thermal sensor, 1 C resolution, Tjmax=100
coretemp11 at cpu11: thermal sensor, 1 C resolution, Tjmax=100
coretemp12 at cpu12: thermal sensor, 1 C resolution, Tjmax=100
coretemp13 at cpu13: thermal sensor, 1 C resolution, Tjmax=100
coretemp14 at cpu14: thermal sensor, 1 C resolution, Tjmax=100
coretemp15 at cpu15: thermal sensor, 1 C resolution, Tjmax=100

I did a full power off, for long enough, for the BIOS (or cpu) to
decide that the "economy" cores (cpu8..15) should be resurrected.
That's unrelated to the patch (they were shown as missing in the BIOS
before the power off state, and were back again, in the BIOS, after it).

The envstat behaviour wrt changing the CPU target frequency seems to
be the same, at initial boot the coretemp values were in the same range
(a bit higher, but all that booting means some work was done) as the
BIOS was showing (system high 30's to about 40, BIOS mid 30's - but the
BIOS just shows "cpu temperature" not one for each core).   When the
cpu frequency target is changed to 3400, all the core temp values drop
to lower than room air temp (which according to my probably inaccurate
desk lamp, is currently 22.5, the coretemp values are all in the 15-18
range at the minute).

I wonder if perhaps something should cause sc_tjmax_fixed to be re-calculated
when the CPU frequency is changed, as it is now with the patch installed,
it looks like that is intended to be able to happen, except that once
sc_tjmax_fixed is set, it is never set again (and so, behaves just the
same as the old sc_tjmax).



I am still getting "not configured" (from both GENERIC and my kernel) for:

Intel 600 Series PCH-H I2C 0 (miscellaneous serial bus, revision 0x11) at pci0 dev 21 function 0 not configured
Intel 600 Series PCH-H SPI (FLASH) (miscellaneous serial bus, revision 0x11) at pci0 dev 31 function 5 not configured

I (temporarily) enabled a bunch of iic related config lines in my kernel
config, none of that made any difference at all (wrt what sensor devices
were found) - I did boot a generic as well (from today's sources, with
that patch applied) that behaved just the same as my custom kernel in all
material respects (there are quite a few dmesg diffs, but none of it looks
to be in any way related to anything - most is just because of drives,
SATA and USB attached) configuring themselves in different orders, and of
course the raidframe mod counters are different.

lspci says of those two "not configured" devices:

jacaranda# lspci -nn -vv -s 0:15.0
00:15.0 Serial bus controller [0c80]: Intel Corporation Alder Lake-S PCH I2C Controller #0 [8086:7acc] (rev 11)
        Subsystem: ASRock Incorporation Device [1849:7acc]
        Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Interrupt: pin A routed to IRQ 255
        Region 0: Memory at <unassigned> (64-bit, non-prefetchable) [disabled]
        Capabilities: [80] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D3 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [90] Vendor Specific Information: Len=14 <?>

jacaranda# lspci -nn -vv -s 0:1f.5
00:1f.5 Serial bus controller [0c80]: Intel Corporation Alder Lake-S PCH SPI Controller [8086:7aa4] (rev 11)
        Subsystem: ASRock Incorporation Device [1849:7aa4]
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Region 0: Memory at fe010000 (32-bit, non-prefetchable)

One more which might be relevant is:

Intel 600 Series PCH-H HECI 1 (miscellaneous communications, revision 0x11) at pci0 dev 22 function 0 not configured

jacaranda# lspci -nn -vv -s 0:16.0
00:16.0 Communication controller [0780]: Intel Corporation Alder Lake-S PCH HECI Controller #1 [8086:7ae8] (rev 11)
        Subsystem: ASRock Incorporation Device [1849:7ae8]
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Interrupt: pin A routed to IRQ 255
        Region 0: Memory at 405a11d000 (64-bit, non-prefetchable)
        Region 1: Memory at <unassigned> (32-bit, non-prefetchable)
        Capabilities: [50] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold-)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [8c] MSI: Enable- Count=1/1 Maskable- 64bit+
                Address: 0000000000000000  Data: 0000
        Capabilities: [a4] Vendor Specific Information: Len=14 <?>

Apart from those:

AWAC (ACPI000E) at acpi0 not configured
PMAX (INT3533) at acpi0 not configured
acpiwmibus at acpiwmi0 not configured
acpiwmibus at acpiwmi1 not configured
PEPD (INT33A1) at acpi0 not configured
TPM (MSFT0101) at acpi0 not configured

plus the Gauss Newton Algorithm processor, WiFi, & Thunderbolt (NHCI USB),
and in GENERIC only, wsbell, are all "not configured" (my kernel includes
wsbell).   None of that seems relevant or important.

The motherboard is an AsRock Z690 Taichi.

kre



Home | Main Index | Thread Index | Old Index