Port-macppc archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Spontaneous panic on macppc NetBSD 9.99.100 built 9/21.



Hi,

From the first few hours of updating NetBSD 9.3 to 9.99.100 built 9/21.

Machine is a 15" PowerBook5,4 g4 1.5GHz aluminium

uname -a
NetBSD macppc.labs.local 9.99.100 NetBSD 9.99.100 (GENERIC) #0: Wed
Sep 21 01:33:53 UTC 2022
mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/macppc/compile/GENERIC
macppc

For 9.2 and 9.3 I used to get a fairly reliable system hang (requiring
hard power off) while decompressing large tarballs locally on disk,
including 9.2 to 9.3 sysupgrade run. It happened randomly, but I got
through an upgrade.

For 9.99 this has not happened after several tarballs, including twice
sysupgrade to 9.99, however

I have experienced the following while installing packages through pkgin.
To be clear, I've installed a bunch of pkgin successfully. Then during
one of them, the machine spontaneously rebooted.

The system rebooted fine,  and the partitions mounted fine, but
unknown to me there was filesystem corruption lurking in the /usr/pkg
directory, so when I went to reinstall the packages, I got another
reboot.

I didn't capture the first dmesg, but I did get the filesystem panic
shown below (#1b) .

find -ls produced a file list, aborting at the corrupted directory.

/usr/pkg/pkgdb/qt5-qtbase-5.15.2nb8
/usr/pkg/pkgdb/qt5-qtbase-5.15.2nb8/B<binary junk.....>

The subsequent fsck relinked the contents into lost+found.  A further
run of pkgin to try installing the package again worked fine.

dmesg reboot #1b

[  3905.295318] panic: /usr: bad dir ino 649970 at offset 42568: null entry

[  3905.295318] cpu0: Begin traceback...
[  3905.295318] 0x10305bc0: at vpanic+0x158
[  3905.305321] 0x10305bf0: at panic+0x50
[  3905.305321] 0x10305c30: at ufs_lookup+0x418
[  3905.305321] 0x10305ce0: at VOP_LOOKUP+0x4c
[  3905.305321] 0x10305d00: at lookup_once+0x238
[  3905.305321] 0x10305d50: at namei_tryemulroot.constprop.0+0x4a8
[  3905.305321] 0x10305e20: at namei+0x58
[  3905.305321] 0x10305e50: at do_sys_unlinkat.isra.0+0x168
[  3905.305321] 0xM-2M^^M^MM^Fe=^[^M: at syscall+0x294
[  3905.305321] 0x10305f20: user SC trap #10 by 0xfd8c01c0: srr1=0xd032
[  3905.305321]             r1=0xffffe360 cr=0x22004442 xer=0 ctr=0xfd8c01b8
[  3905.305321] cpu0: End traceback...
[  3905.305321] dumpsys: TBD
[  3905.305321] rebooting

The subsequent fsck relinked the contents into lost+found.  A further
run of pkgin to try installing the package again worked fine.

I was then installing another package, and it did it again (#2),

dmesg reboot #2

[  2573.658974] panic: pmap_pvo_find_va: returning NULL for kernel
pmap, va: 0x4911000

[  2573.658974] cpu0: Begin traceback...
[  2573.658974] 0x10003ba0: at vpanic+0x158
[  2573.658974] 0x10003bd0: at panic+0x50
[  2573.658974] 0x10003c10: at pmap_pvo_find_va+0xe4
[  2573.658974] 0x10003c20: at pmap_extract+0xc8
[  2573.658974] 0x10003c40: at kauai_dma_init+0x1dc
[  2573.658974] 0x10003ca0: at wdc_ata_bio_start+0x6e4
[  2573.658974] 0x10003d00: at ata_xfer_start+0x34
[  2573.658974] 0x10003d20: at atastart+0x188
[  2573.658974] 0x10003d50: at __wdccommand_intr+0x20c
[  2573.658974] 0x10003d90: at wdcintr+0xcc
[  2573.658974] 0x10003db0: at intr_deliver.constprop.0+0x94
[  2573.658974] 0x10003dd0: at pic_handle_intr+0xe8
[  2573.658974] 0x10003e10: at trapstart+0x6b0
[  2573.658974] 0x10003ee0: at idle_loop+0xac
[  2573.658974] 0x10003f20: at cpu_lwp_bootstrap+0xc
[  2573.658974] saved LR(0xa5655aa1) is invalid.cpu0: End traceback...
[  2573.658974] dumpsys: TBD
[  2573.658974] rebooting


The take aways are,

The kernel panic above on an otherwise clean filesystem
The system booting up without detecting filesystem corruption.

The recovery for both of these was to boot into single user and
manually force a fsck,

Boot into single mode,
ofw> boot hd:2,ofwboot.xcf hd:0,/netbsd -s

Manually force fsck, since the clean journal check already passed,
(/) fsck -fy /dev/wd0a
(/usr) fsck -fy /dev/wd0f
(/var) fsck -fy /dev/wd0e

exit and boot up normally.

[     1.000000] NetBSD 9.99.100 (GENERIC) #0: Wed Sep 21 01:33:53 UTC 2022
[     1.000000]
mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/macppc/compile/GENERIC
[     1.000000] total memory = 1024 MB
[     1.000000] oea_startup: failed to allocate DEAD ZONE: error=12
[     1.000000] avail memory = 977 MB
[     1.000000] timecounter: Timecounters tick every 10.000 msec
[     1.000000] Kernelized RAIDframe activated
[     1.000000] found openpic PIC at 80040000
[     1.000000] OpenPIC Version 1.2: Supports 4 CPUs and 64 interrupt sources.
[     1.000000] bootpath: /pci@f4000000/ata-6@d/disk@0:0,/netbsd
[     1.000000] mainbus0 (root)
[     1.000000] cpu0 at mainbus0: 7447A (Revision 1.1), ID 0 (primary)
[     1.000000] cpu0: HID0
0x84d0c1bc<EMCP,TBEN,HIGH_BAT_EN,NAP,DPM,ICE,DCE,XBSEN,SGE,BTIC,LRSTK,FOLD,BHT>,
powersave: 1
[     1.000000] cpu0: 1504.64 MHz, 512KB L2 cache no parity
[     1.000000] memory0 at mainbus0: len=256
[     1.000000] spdmem0 at memory0
[     1.000000] spdmem0: DDR SDRAM, no parity or ECC, 1GB, 333MHz (PC-2700)
[     1.000000] spdmem0: 13 rows, 11 cols, 2 ranks, 4 banks/chip,
6.0ns cycle time
[     1.000000] spdmem0: tAA-tRCD-tRP-tRAS: 1-3-3-7
[     1.000000] spdmem0: voltage SSTL 2.5V, refresh time 7.8us (self-refreshing)
[     1.000000] spdmem1 at memory0
[     1.000000] spdmem1: DDR SDRAM, no parity or ECC, 1GB, 333MHz (PC-2700)
[     1.000000] spdmem1: 13 rows, 11 cols, 2 ranks, 4 banks/chip,
6.0ns cycle time
[     1.000000] spdmem1: tAA-tRCD-tRP-tRAS: 1-3-3-7
[     1.000000] spdmem1: voltage SSTL 2.5V, refresh time 7.8us (self-refreshing)
[     1.000000] uni_n0 at mainbus0 address 0xf8000000
[     1.000000] hw-clock at uni_n0 not configured
[     1.000000] ki2c0 at uni_n0 address 0xf8001000
[     1.000000] iic0 at ki2c0: I2C bus
[     1.000000] dbcool0 at iic0 addr 0xae
[     1.000000] dbcool0: ADT7463 dBCool(tm) Controller (rev 0x06, stepping 0x02)
[     1.000000] lmu0 at iic0 addr 0x42: ambient light sensor
[     1.000000] cereal (cereal) at iic0 addr 0xe0 not configured
[     1.000000] uninorth0 at mainbus0
[     1.000000] pci0 at uninorth0 bus 0
[     1.000000] pci0: i/o space, memory space enabled
[     1.000000] pchb0 at pci0 dev 11 function 0
[     1.000000] pchb0: Apple Computer UniNorth AGP Bridge (rev. 0x00)
[     1.000000] radeonfb0 at pci0 dev 16 function 0: ATI Technologies
Radeon Mobility 9600/9700 (M10/11) NP (rev. 0x00)
[     1.000000] trying to read disabled BIOS...
[     1.000000] radeonfb0: No video BIOS, using default clocks
[     1.000000] radeonfb0: refclk = 27.000 MHz, refdiv = 12 minpll =
125000, maxpll = 400000
[     1.000000] radeonfb0: 64 MB aperture at 0xb8000000, 64 KB
registers at 0xb0000000
[     1.000000] radeonfb0: display 0: initial virtual resolution
1280x854 at 8 bpp
[     1.000000] radeonfb0: using 32 MB per display
[     1.000000] radeonfb0: port 0: physical 1280x854 60Hz
[     1.000000] radeonfb0: port 1: physical 1280x854 60Hz
[     1.000000] wsdisplay0 at radeonfb0 kbdmux 1: console (fb, vt100 emulation)
[     1.000000] wsmux1: connecting to wsdisplay0
[     1.000000] drm at radeonfb0 not configured
[     1.000000] uninorth1 at mainbus0
[     1.000000] pci1 at uninorth1 bus 0
[     1.000000] pci1: i/o space, memory space enabled
[     1.000000] pchb1 at pci1 dev 11 function 0
[     1.000000] pchb1: Apple Computer UniNorth Host-PCI Bridge (rev. 0x00)
[     1.000000] bwi0 at pci1 dev 18 function 0: Broadcom Wireless
[     1.000000] bwi0: interrupting at irq 52
[     1.000000] bwi0: BBP id 0x4306, BBP rev 0x3, BBP pkg 0
[     1.000000] bwi0: MAC: rev 5
[     1.000000] bwi0: PHY type 2, rev 2, ver 2
[     1.000000] bwi0: RF manu 0x17f, type 0x2050, rev 2
[     1.000000] bwi0: autoconfiguration error: invalid antenna gain in sprom
[     1.000000] cbb0 at pci1 dev 19 function 0: Texas Instruments
PCI1510 PCI-CardBus Bridge (rev. 0x00)
[     1.000000] obio0 at pci1 dev 23 function 0: addr 0x80000000
[     1.000000] obio0: enabling Intrepid CPU speed control
[     1.000000] hiclock: 1500
[     1.000000] speed: 1504642
[     1.000000] zsc0 at obio0 irq 22,23
[     1.000000] zstty0 at zsc0 channel 0
[     1.000000] zstty1 at zsc0 channel 1
[     1.000000] snapper0 at obio0 offset 0x10000: irq 30,1,2
[     1.000000] pmu0 at obio0 offset 0x16000 irq 47:  rev. 12
[     1.000000] pmu0: initializing IIC bus
[     1.000000] iic1 at pmu0: I2C bus
[     1.000000] pmu0: initializing RTC
[     1.000000] pmu0: initializing ADB
[     1.000000] nadb0 at pmu0: Apple Desktop Bus
[     1.000000] pmu0: power-mgt not configured
[     1.000000] smartbat0 at pmu0 addr 0: smart battery
[     1.000000]  flags: 5 AC present
[     1.000000] ki2c1 at obio0
[     1.000000] iic2 at ki2c1: I2C bus
[     1.000000] cereal (cereal) at iic2 addr 0xe0 not configured
[     1.000000] deq0 at iic2 addr 0x35 Audio Codec (tas3004)
[     1.000000] wdc0 at obio0 offset 0x20000 irq 24, level triggered:
DMA transfer
[     1.000000] atabus0 at wdc0 channel 0
[     1.000000] model PowerBook5,4
[     1.000000] snapper0: codec: TAS3004
[     1.000000] audio0 at snapper0: playback, capture, full duplex
[     1.000000] audio0: slinear_be:16 -> slinear_be:16 2ch 48000Hz,
blk 12288 bytes (64ms) for playback
[     1.000000] audio0: slinear_be:16 <- slinear_be:16 2ch 48000Hz,
blk 12288 bytes (64ms) for recording
[     1.000000] spkr0 at audio0: PC Speaker (synthesized)
[     1.000000] wsbell at spkr0 not configured
[     1.000000] ohci0 at pci1 dev 24 function 0: Apple Computer
Intrepid USB Controller (rev. 0x00)
[     1.000000] ohci0: autoconfiguration error: memory access is disabled
[     1.000000] ohci1 at pci1 dev 25 function 0: Apple Computer
Intrepid USB Controller (rev. 0x00)
[     1.000000] ohci1: autoconfiguration error: memory access is disabled
[     1.000000] ohci2 at pci1 dev 26 function 0: Apple Computer
Intrepid USB Controller (rev. 0x00)
[     1.000000] ohci2: interrupting at irq 29
[     1.000000] ohci2: OHCI version 1.0, legacy support
[     1.000000] usb0 at ohci2: USB revision 1.0
[     1.000000] ohci3 at pci1 dev 27 function 0: NEC USB Host
Controller (rev. 0x43)
[     1.000000] ohci3: interrupting at irq 63
[     1.000000] ohci3: OHCI version 1.0
[     1.000000] usb1 at ohci3: USB revision 1.0
[     1.000000] ohci4 at pci1 dev 27 function 1: NEC USB Host
Controller (rev. 0x43)
[     1.000000] ohci4: interrupting at irq 63
[     1.000000] ohci4: OHCI version 1.0
[     1.000000] usb2 at ohci4: USB revision 1.0
[     1.000000] ehci0 at pci1 dev 27 function 2: NEC USB2 Host
Controller (rev. 0x04)
[     1.000000] ehci0: 32-bit DMA
[     1.000000] ehci0: interrupting at irq 63
[     1.000000] ehci0: EHCI version 1.0
[     1.000000] ehci0: 2 companion controllers, 3 ports each: ohci3 ohci4
[     1.000000] usb3 at ehci0: USB revision 2.0
[     1.000000] cbb0: cacheline 0x8 lattimer 0x10
[     1.000000] cbb0: bhlc 0x21008
[     1.000000] cbb0: interrupting at irq 53
[     1.000000] cardslot0 at cbb0
[     1.000000] cardbus0 at cardslot0: bus 1
[     1.000000] pcmcia0 at cardslot0
[     1.000000] uninorth2 at mainbus0
[     1.000000] pci2 at uninorth2 bus 0
[     1.000000] pci2: i/o space, memory space enabled
[     1.000000] pchb2 at pci2 dev 11 function 0
[     1.000000] pchb2: Apple Computer UniNorth Host-PCI Bridge (rev. 0x00)
[     1.000000] wdc1 at pci2 dev 13 function 0: interrupting at irq 39
[     1.000000] atabus1 at wdc1 channel 0
[     1.000000] fwohci0 at pci2 dev 14 function 0: Apple Computer
UniNorth Firewire (rev. 0x81)
[     1.000000] fwohci0: interrupting at irq 40
[     1.000000] fwohci0: OHCI version 1.10 (ROM=0)
[     1.000000] fwohci0: No. of Isochronous channels is 8.
[     1.000000] fwohci0: EUI64 00:0d:93:ff:fe:ae:bc:8e
[     1.000000] fwohci0: autoconfiguration error: invalid speed 7 (fixed to 3).
[     1.000000] fwohci0: Phy 1394a available S800, 3 ports.
[     1.000000] fwohci0: Link S800, max_rec 4096 bytes.
[     1.000000] ieee1394if0 at fwohci0: IEEE1394 bus
[     1.000000] fwip0 at ieee1394if0: IP over IEEE1394
[     1.000000] fwohci0: Initiate bus reset
[     1.000000] fwohci0: Clear PME# now
[     1.000000] gem0 at pci2 dev 15 function 0: Apple Computer GMAC
Ethernet (rev. 0x80)
[     1.000000] gem0: interrupting at irq 41
[     1.000000] makphy0 at gem0 phy 0: Marvell 88E1111 Gigabit PHY, rev. 1
[     1.000000] makphy0: 10baseT, 10baseT-FDX, 100baseTX,
100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
[     1.000000] gem0: Ethernet address 00:0d:93:ae:bc:8e, 10KB RX
fifo, 4KB TX fifo
[     1.000000] vmmask 737e0000 schedmask 7ffe0000 highmask 7ffe0000
[     1.000000] timecounter: Timecounter "mftb" frequency 18432000 Hz
quality 100
[     1.000024] fwohci0: BUS reset
[     1.000024] fwohci0: node_id=0xc800ffc0, gen=2, CYCLEMASTER mode
[     1.000024] ieee1394if0: 1 nodes, maxhop <= 0 cable IRM irm(0) (me)
[     1.000024] ieee1394if0: bus manager 0
[     1.000024] timecounter: Timecounter "clockinterrupt" frequency
100 Hz quality 0
[     1.249990] uhub0 at usb0: NetBSD (0x0000) OHCI root hub (0x0000),
class 9/0, rev 1.00/1.00, addr 1
[     1.249990] uhub0: 2 ports with 2 removable, self powered
[     1.259993] atapibus0 at atabus0: 2 targets
[     1.269993] uhub1 at usb1: NetBSD (0x0000) OHCI root hub (0x0000),
class 9/0, rev 1.00/1.00, addr 1
[     1.269993] uhub1: 3 ports with 3 removable, self powered
[     1.269993] cd0 at atapibus0 drive 0: <MATSHITADVD-R   UJ-825, ,
DAM5> cdrom removable
[     1.269993] cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA
mode 4 (Ultra/66)
[     1.269993] cd0(wdc0:0:0): using PIO mode 4, DMA mode 2 (using DMA)
[     1.289991] uhub2 at usb2: NetBSD (0x0000) OHCI root hub (0x0000),
class 9/0, rev 1.00/1.00, addr 1
[     1.289991] uhub2: 2 ports with 2 removable, self powered
[     1.299993] adbkbd0 at nadb0 addr 2: iBook keyboard
[     1.299993] wskbd0 at adbkbd0 mux 1
[     1.299993] wskbd0: connecting to wsdisplay0
[     1.299993] wsmouse0 at adbkbd0 mux 0
[     1.299993] adbms0 at nadb0 addr 3: uhub3 at usb3: NetBSD (0x0000)
EHCI root hub (0x0000), class 9/0, rev 2.00/1.00, addr 1
[     1.309993] uhub3: 5 ports with 5 removable, self powered
[     1.319992] IPsec: Initialized Security Association Processing.
[     1.319992] WARNING: system needs entropy for security; see entropy(7)
[     1.319992] EMP trackpadaes: BearSSL aes_ct
[     1.329998] chacha: Portable C ChaCha
[     1.379990]  <tpad> 4-button, 400 dpi
[     1.379990] wsmouse1 at adbms0 mux 0
[     1.379990] adbbt0 at nadb0 addr 7: button device
[     2.039991] AC Power: AC adapter online.
[     2.859992] ubt0 at uhub0 port 1
[     2.859992] ubt0: Apple Computer (0x05ac) Bluetooth (0x8203), rev
1.10/5.26, addr 2
[     4.229995] wd0 at atabus1 drive 0
[     4.229995] wd0: <TS64GPSD320>
[     4.229995] wd0: drive supports 1-sector PIO transfers, LBA48 addressing
[     4.229995] wd0: 61057 MB, 132322 cyl, 15 head, 63 sec, 512
bytes/sect x 125044736 sectors
[     4.239997] wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA
mode 6 (Ultra/133)
[     4.239997] wd0(wdc1:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA
mode 5 (Ultra/100) (using DMA)
[     4.239997] swwdog0: software watchdog initialized
[     4.239997] WARNING: 4 errors while detecting hardware; check system log.
[     4.239997] boot device: wd0
[     4.239997] root on wd0a dumps on wd0b
[     4.249999] /: replaying log to memory
[     4.249999] root file system type: ffs
[     4.249999] kern.module.path=/stand/macppc/9.99.100/modules
[    31.060079] smartbat0: normal capacity on 'Battery charge state'
[    98.860357] entropy: ready

/dev/wd0a on / type ffs (log, local)
/dev/wd0e on /var type ffs (log, local)
/dev/wd0f on /usr type ffs (log, local)
kernfs on /kern type kernfs (local)
ptyfs on /dev/pts type ptyfs (local)
procfs on /proc type procfs (local)
tmpfs on /var/shm type tmpfs (local)


Thanks for continuing to support this port.
Hope this helps,


-- 

Robert Fielding


Home | Main Index | Thread Index | Old Index