Port-sparc64 archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: envstat drivers on SUN Ultra-45?



On Wed, Feb 11, 2026 at 03:52:20PM +0100, Julian Coleman wrote:
> Hi,
> 
> After looking at the V240 OFW patches, I had a chance to get back to the
> U45.  I currently have patches to fix up the OFW errors and drivers for
> the monitoring.  The ADT7462 driver needs the voltage logic and limits
> added, but it's enough for me to see:
> 
>   zeus# envstat -c etc/envsys.conf
>   zeus# envstat
>                       Current  CritMax  WarnMax  WarnMin  CritMin  Unit
>   [adt7462sm0]
>           cpu0-fan:      6108                                       RPM
>           cpu1-fan:     11020                                       RPM
>            pci-fan:      3229                                       RPM
>        system-fan3:      3216                                       RPM
>        system-fan4:      3187                                       RPM
>     adt7462-sensor:    47.000                                      degC
>        cpu0-sensor:    55.250                                      degC
>        cpu1-sensor:    75.500                                      degC
>          mb-sensor:    45.750                                      degC
>   [lm75a0]
>         psu-sensor:    34.125   80.000                             degC
>   [lm95221ts0]
>     lm95221-sensor:    34.000                                      degC
>        fire-sensor:    54.375                                      degC
>     lsi1064-sensor:    62.000                                      degC
> 
> If you'd like to try it out, I've put the patch and other files at:
> 
>   https://ftp.netbsd.org/pub/NetBSD/misc/jdc/sparc64/

Thanks, grabbed & tested.
 
> "u45.diff" is the patch file, "netbsd" and "netbsd.gdb" is the kernel with
> these and "u45-envsys.conf" is the sensor names for /etc/envsys.conf.
> "u45.dmesg" is the full dmesg, but relevant lines are:

Just grabbed kernel image and envsys config, applied and rebooted:

 ==================================================================
galadriel# envstat 
                    Current  CritMax  WarnMax  WarnMin  CritMin  Unit
[adt7462sm0]
        cpu0-fan:     11416                                       RPM
        cpu1-fan:     11612                                       RPM
         pci-fan:         0                                       RPM
     system-fan3:         0                                       RPM
     system-fan4:         0                                       RPM
  adt7462-sensor:    60.750                                      degC
     cpu0-sensor:    64.500                                      degC
     cpu1-sensor:    60.250                                      degC
       mb-sensor:    52.500                                      degC
[lm75a0]
      psu-sensor:    39.250   80.000                             degC
[lm95221ts0]
  lm95221-sensor:    54.000                                      degC
     fire-sensor:    64.750                                      degC
  lsi1064-sensor:    83.125                                      degC
 ==================================================================

And yes, the 3x fan group (one FRU, two fans pointed at CPUs and memory
banks, one at the PCI(e,-X) cards) not running is something I suspected
(system thermal issues), but it's nice to be able to check for it, now
I can figure out how to fix it. I think I misrouted at least one cable
when I replaced the mainboard, thus causing mechanical alignment issues
for the fan group connector.

>   [     1.000000] bq4802rtc0 at ebus0 addr 100000-10000f: real time clock
>   [     1.000000] seeprom2 at iic0 addr 0x57: psu-fru-prom: size 4096
>   [     1.000000] adt7462sm0 at iic0 addr 0x58: ADT7462 system monitor: rev. 0x4
>   [     1.000000] adt7462sm0: 5 fans, 4 temperatures, 0 voltages
>   [     1.000000] lm95221ts0 at iic0 addr 0x2b: LM95221 temperature sensor
>   [     1.000000] lm75a0 at iic0 addr 0x4f: LM75A temperature sensor
> 
> I also patched the psu-fru-prom, but I don't see anything useful if I try:
> 
>   dd if=/dev/seeprom2 bs=1 skip=0 count=4096 | hexdump -C
> 
> whereas I can see the part number and location from one of the others:
> 
>   dd if=/dev/seeprom0 bs=1 skip=7168 count=1024 | hexdump -C
> 
> I'd be interested to see what numbers you get for cpu1 temperature and fan
> speed, because I wonder if my cpu1 needs new thermal paste.

One other reason for the high thermal readouts besides the non-running
fan group in my machine is that ambient temperature is ~ 30 degC, which
is probably close to the upper end of the machine design envelope ;-)

Thank you very much for your work improving the sensor support for the
Ultra-45, I very much appreciate it.

Kind regards,
            Alex.
-- 
"Opportunity is missed by most people because it is dressed in overalls and
 looks like work."                                      -- Thomas A. Edison


Home | Main Index | Thread Index | Old Index