NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/57621: Updated 10.0_BETA macppc MP kernel prone to hangs



>> Recently, I frequently observe similar stalls on Mac mini G4 (UP).
>>
>> I can work around the problem by this patch:
> 
> Excellent!
> I will apply this to my local tree and check the fix from this
> coming evening my time.

Unfortunately, that did not result in a complete success, and the
build got stuck overnight, at

      compile  libcrypto/x25519_ref10.o
      build  libcrypto/librumpkern_crypto.so.0.0
      build  libcrypto/librumpkern_crypto.a

I now have an 'ld' stuck in 'tstile' (probably building the
.so.0.0), together with 'vdrain'.  Output from 'crash':

crash> ps
PID    LID S CPU     FLAGS       STRUCT LWP *               NAME WAIT
12110 12110 3   1       180           1a41ea00             pickup kqueue
4483  4483 3   0         0           102ee400                 ld tstile
26251 26251 3   1       180           15d106c0           collect2 wait
23329 23329 3   1       180           13cff100   powerpc--netbsd- wait
24864 24864 3   0       180           181c3400                 sh wait
21433 21433 3   0       180           1a41e100             nbmake poll
14503 14503 3   0       180           19661a00                 sh wait
4702  4702 3   0       180           181c3a00             nbmake poll
5602  5602 3   0       180           19661700                 sh wait
25075 25075 3   1       180           19661100             nbmake poll
9198  9198 3   0       180           22089340             nbmake wait
8760  8760 3   1       180           13cffd00                 sh wait
22443 22443 3   0       180           21f5e3c0             nbmake poll
27361 27361 3   0       180           1ae133c0                 sh wait
6730  6730 3   0       180           1e36a640             nbmake poll
1460  1460 3   0       180           1073f340             systat ttyraw
897  > 897 7   0       100           1a41ed00              crash
10856 10856 3   1       180           19996640               tcsh pause
9905  9905 3   1       180           181c3d00               tcsh pause
10297>10297 7   1       100           19661d00               sshd
9285  9285 3   1       180           19996040               sshd poll
6102  6102 3   1       180           1a95f0c0                top select
1826  1826 3   1       180           1e36ac40                 sh wait
2345  2345 3   0       180           1df8dc80             nbmake poll
2611  2611 3   1       180           1df8d080                 sh wait
2335  2335 3   0       180           1df8d980             nbmake poll
2399  2399 3   1       180           1e36a940                 sh wait
2683  2683 3   1       180           1df8d680             nbmake poll
1074  1074 3   0       180           12175d00                 sh wait
1072  1072 3   1       180           181c3100               tail kqueue
361    361 3   1       180           10d63cc0               tcsh pause
1108  1108 3   1       180           12175700               tcsh pause
1092  1092 3   1       180           1073f940               tcsh pause
1089  1089 3   0       180           10d87680               tcsh pause
1151  1151 3   0       180           15d10cc0               tcsh pause
850    850 3   0       180           12175400              xterm select
959    959 3   1       180           1073f040               tcsh pause
962    962 3   0       180           15d109c0               sshd poll
952    952 3   0       180           12175100              xterm select
942    942 3   0       180           102eea00              xterm select
949    949 3   0       180           10d639c0               sshd poll
1251  1251 3   0       180           102ee700               tcsh pause
943    943 3   1       180           10d87c80               sshd poll
704    704 3   1       180           1073fc40               sshd poll
928    928 3   0       180           10d87980               tcsh pause
1122  1122 3   0       180           102eed00               sshd poll
983    983 3   1       180           15d103c0              getty ttyraw
981    981 3   1       180           15d100c0              getty nanoslp
813    813 3   0       180           13cd7c80              getty nanoslp
986    986 3   1       180           13cd7980              getty nanoslp
985    985 3   0       180           13cd7680              getty ttyraw
998    998 3   1       180           12b77640               cron nanoslp
974    974 3   1       180           12b77c40               sshd poll
971    971 3   0       180           10d630c0              inetd kqueue
957    957 3   1       180           10d87380               qmgr kqueue
828    828 3   0       180           12175a00             master kqueue
723    723 3   1       180           10d636c0               sshd poll
604    604 3   0       180           12b77940             powerd kqueue
567    567 3   0       180           12b77340               ntpd pause
318    318 3   0       180           10d87080            syslogd kqueue
1        1 3   0       180           102af0c0               init wait
0      198 3   1       200           1022f100            physiod physiod
0      166 3   1       200           102ee100          pooldrain pooldrain
0      165 3   1       200           102afcc0            ioflush syncer
0      164 3   0       200           102af9c0           pgdaemon pgdaemon
0      126 3   0       200           7f87cc80          swwreboot swwreboot
0      124 3   1       200           102af6c0          atapibus0 sccomp
0      122 3   0       200           102196c0               usb0 usbevt
0      121 3   0       200           102af3c0               usb1 usbevt
0      120 3   0       200           1022f400             npfgc0 npfgcw
0      119 3   1       200           1028cc80            rt_free rt_free
0      118 3   1       200           1028c980              unpgc unpgc
0      117 3   0       200           1028c680    key_timehandler key_timehandler

0      116 3   1       200           1028c380    icmp6_wqinput/1 icmp6_wqinput
0      115 3   0       200           1028c080    icmp6_wqinput/0 icmp6_wqinput
0      114 3   1       200           1027ec40          nd6_timer nd6_timer
0      113 3   1       200           1027e940    carp6_wqinput/1 carp6_wqinput
0      112 3   0       200           1027e640    carp6_wqinput/0 carp6_wqinput
0      111 3   1       200           1027e340     carp_wqinput/1 carp_wqinput
0      110 3   0       200           1027e040     carp_wqinput/0 carp_wqinput
0      109 3   1       200           1022fd00     icmp_wqinput/1 icmp_wqinput
0      108 3   0       200           102199c0     icmp_wqinput/0 icmp_wqinput
0      107 3   0       200           1022fa00           rt_timer rt_timer
0      106 3   0       200           10219cc0        vmem_rehash vmem_rehash
0       97 3   0       200           102193c0            lmtemp0 lmtemp0
0       96 3   1       200           102190c0            dbcool0 dbcool0
0       30 3   0       200           7f87c980          entbutler entropy
0       29 3   1       380           7f87c680           fw0probe ieee1394
0       28 3   1       240           7f87c380            atabus2 atath
0       27 3   0       200           7f87c080         usbtask-dr usbtsk
0       26 3   0       200           7f901c40         usbtask-hc usbtsk
0       25 3   1       240           7f901940            atabus1 atath
0       24 3   1       200           7f901640            atabus0 atath
0       23 3   0       200           7f901340                pmu wait
0       22 3   1       200           7f901040            xcall/1 xcall
0       21 1   1       200           7f90ed00          softser/1
0       20 1   1     40200           7f90ea00          softclk/1
0       19 1   1       200           7f90e700          softbio/1
0       18 1   1       200           7f90e400          softnet/1
0       17 1   1       201           7f90e100             idle/1
0       16 3   0       200           7f91acc0             sysmon smtaskq
0       15 3   0       200           7f91a9c0         pmfsuspend pmfsuspend
0       14 3   0       200           7f91a6c0           pmfevent pmfevent
0       13 3   0       200           7f91a3c0         sopendfree sopendfr
0       12 3   0       200           7f91a0c0             ifwdog ifwdog
0       11 3   0       200           7fb26c80            iflnkst iflnkst
0       10 3   0       200           7fb26980           nfssilly nfssilly
0        9 3   1       240           7fb26680             vdrain tstile
0        8 3   1       200           7fb26380          modunload mod_unld
0        7 3   0       200           7fb26080            xcall/0 xcall
0        6 1   0       200           7fb30c40          softser/0
0        5 1   0     40200           7fb30940          softclk/0
0        4 1   0       200           7fb30640          softbio/0
0        3 1   0     40200           7fb30340          softnet/0
0        2 1   0       201           7fb30040             idle/0
0        0 3   0       200             c32dc0            swapper uvm
crash> bt/a 102ee400
trace: pid 4483 lid 4483 at 0x1b623930
0x1b623990: at cpu_switchto+0x28
0x1b6239a0: at mi_switch+0x140
0x1b6239e0: at sleepq_block+0xe0
0x1b623a00: at turnstile_block+0x284
0x1b623a60: at rw_enter+0x124
0x1b623aa0: at cache_lookup+0xe8
0x1b623ad0: at ufs_lookup+0xd4
0x1b623b80: at VOP_LOOKUP+0x4c
0x1b623ba0: at lookup_once+0x1fc
0x1b623bf0: at namei_tryemulroot.constprop.0+0x4a8
0x1b623cc0: at namei+0x58
0x1b623cf0: at vn_open+0xfc
0x1b623e10: at do_open+0xf0
0x1b623e60: at do_sys_openat+0x9c
0x1b623ea0: at sys_open+0x2c
0x1b623ec0: at syscall+0x294
0x1b623f20: user SC trap #5 by 0xfdc70e18: srr1=0xd032
            r1=0xffffda20 cr=0x84000444 xer=0x20000000 ctr=0xfdc70e10
crash> 
crash> bt/a 7fb26680
trace: pid 0 lid 9 at 0x1001fc60
0x1001fcc0: at cpu_switchto+0x28
0x1001fcd0: at mi_switch+0x140
0x1001fd10: at sleepq_block+0xe0
0x1001fd30: at turnstile_block+0x284
0x1001fd90: at rw_enter+0x124
0x1001fdd0: at cache_purge1+0x198
0x1001fe00: at vcache_reclaim+0xc8
0x1001fe80: at vrecycle.part.0+0xc0
0x1001feb0: at vdrain_thread+0x384
0x1001ff20: at cpu_lwp_bootstrap+0xc
saved LR(0x55aa55a6) is invalid.
crash> 

Some of the threads which are not waiting:

crash> bt/a 7f90ed00
trace: pid 0 lid 21 at 0x1004fe40
0x1004fea0: at cpu_switchto+0x28
0x1004feb0: at mi_switch+0x140
0x1004fef0: at softint_thread+0x190
0x1004ff20: at cpu_lwp_bootstrap+0xc
saved LR(0xaa55aa51) is invalid.
crash> bt/a 7f90ea00
trace: pid 0 lid 20 at 0x1004be40
0x1004bea0: at cpu_switchto+0x28
0x1004beb0: at mi_switch+0x140
0x1004bef0: at softint_thread+0x190
0x1004bf20: at cpu_lwp_bootstrap+0xc
saved LR(0xaa55aa51) is invalid.
crash> bt/a 7f90e700
trace: pid 0 lid 19 at 0x10047e40
0x10047ea0: at cpu_switchto+0x28
0x10047eb0: at mi_switch+0x140
0x10047ef0: at softint_thread+0x190
0x10047f20: at cpu_lwp_bootstrap+0xc
saved LR(0xaa55aa51) is invalid.
crash> bt/a 7f90e400
trace: pid 0 lid 18 at 0x10043e40
0x10043ea0: at cpu_switchto+0x28
0x10043eb0: at mi_switch+0x140
0x10043ef0: at softint_thread+0x190
0x10043f20: at cpu_lwp_bootstrap+0xc
saved LR(0xaa55aa51) is invalid.
crash> bt/a 7f90e100
trace: pid 0 lid 17 at 0x1003fdc0
0x1003fe20: at cpu_switchto+0x28
0x1003fe30: at mi_switch+0x140
0x1003fe70: at idle_loop+0x10c
0x1003feb0: at cpu_spinup_trampoline+0x3c
saved LR(0x902e) is invalid.
crash> bt/a 7fb30c40
trace: pid 0 lid 6 at 0x10013e40
0x10013ea0: at cpu_switchto+0x28
0x10013eb0: at mi_switch+0x140
0x10013ef0: at softint_thread+0x190
0x10013f20: at cpu_lwp_bootstrap+0xc
saved LR(0x55aa55a6) is invalid.
crash> bt/a 7fb30940
trace: pid 0 lid 5 at 0x1000fe40
0x1000fea0: at cpu_switchto+0x28
0x1000feb0: at mi_switch+0x140
0x1000fef0: at softint_thread+0x190
0x1000ff20: at cpu_lwp_bootstrap+0xc
saved LR(0x55aa55a6) is invalid.
crash> bt/a 7fb30640
trace: pid 0 lid 4 at 0x1000be40
0x1000bea0: at cpu_switchto+0x28
0x1000beb0: at mi_switch+0x140
0x1000bef0: at softint_thread+0x190
0x1000bf20: at cpu_lwp_bootstrap+0xc
saved LR(0x55aa55a6) is invalid.
crash> bt/a 7fb30340
trace: pid 0 lid 3 at 0x10007e40
0x10007ea0: at cpu_switchto+0x28
0x10007eb0: at mi_switch+0x140
0x10007ef0: at softint_thread+0x190
0x10007f20: at cpu_lwp_bootstrap+0xc
saved LR(0x55aa55a6) is invalid.
crash> bt/a 7fb30040
trace: pid 0 lid 2 at 0x10003e30
0x10003e90: at cpu_switchto+0x28
0x10003ea0: at mi_switch+0x140
0x10003ee0: at idle_loop+0x10c
0x10003f20: at cpu_lwp_bootstrap+0xc
saved LR(0x55aa55a6) is invalid.
crash>

but as near as I can tell these do not provide a 'smoking gun'.

'top' says:

load averages:  0.05,  0.01,  0.00;               up 0+09:04:53        08:23:16
62 processes: 61 sleeping, 1 on CPU
CPU states:  0.0% user,  0.0% nice,  0.0% system,  0.0% interrupt,  100% idle
Memory: 1124M Act, 560M Inact, 14M Wired, 23M Exec, 1600M File, 24M Free
Swap: 2000M Total, 2000M Free / Pools: 241M Used

and is still running.

Regards,

Havard


Home | Main Index | Thread Index | Old Index