NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

kern/54736: pthread t_kill test locks up the kernel



>Number:         54736
>Category:       kern
>Synopsis:       pthread t_kill test locks up the kernel
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Wed Dec 04 18:20:00 +0000 2019
>Originator:     Martin Husemann
>Release:        NetBSD 9.99.19
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD gethsemane.duskware.de 9.99.19 NetBSD 9.99.19 (GETHSEMANE) #9: Wed Dec 4 14:51:10 CET 2019 martin%seven-days-to-the-wolves.aprisoft.de@localhost:/work/src/sys/arch/macppc/compile/GETHSEMANE macppc
Architecture: powerpc
Machine: macppc
>Description:

Running atf tests on macppc locks up (untill the watchdog reboots the
machine):


lib/libpthread/t_fpu (396/836): 1 test cases
    fpu: [0.024678s] Passed.
[0.025354s]

lib/libpthread/t_join (397/836): 1 test cases
    pthread_join: [0.023626s] Passed.
[0.024253s]

lib/libpthread/t_kill (398/836): 1 test cases
    simple: 
[.. locks up immediately ..]

[ 1864.0196291] swwdog: 60 second timer expired
[ 1864.0196291] panic: watchdog timer expired
[ 1864.0196291] cpu0: Begin traceback...
[ 1864.0196291] 0x1000fe20: at vpanic+0x144
[ 1864.0496531] 0x1000fe50: at panic+0x50
[ 1864.0596500] 0x1000fea0: at swwdog_panic+0x94
[ 1864.0596500] 0x1000feb0: at callout_softclock+0x284
[ 1864.0796635] 0x1000fee0: at softint_dispatch+0xac
[ 1864.0896652] 0x1000ff20: at softint_fast_dispatch+0xdc
[ 1864.0996705] saved LR(0xfffffb79) is invalid.cpu0: End traceback...
[ 1864.0996705] halting CPU 1

This is reproducable on my machine.

Disabling the watchdog upfront avoids the reset

db{0}> ps
PID    LID S CPU     FLAGS       STRUCT LWP *               NAME WAIT
685  >   3 7   0         0           5d25e660             t_kill
685  >   2 7   1         0           5d25ec20             t_kill
685      1 2   0         0           5e905620             t_kill
508      1 2   1         0           5f451180            atf-run
478      1 3   0        80           5f451460               tcsh pause
520      1 3   0        80           5f451a20               tcsh pause
511      1 3   0        80           5d25e0a0               sshd select
467      1 3   1        80           5e6e7c00               sshd poll
362      1 2   0         0           5e6e7920              getty
503      1 3   1        80           5e6e7640              getty nanoslp
517      1 3   0        80           5e6e7360              getty nanoslp
470      1 3   1        80           5eee4020              getty nanoslp
491      1 3   0        80           5f7540e0              getty nanoslp
477      1 2   1         0           5ed21bc0               cron
402      1 3   1        80           5e905060              inetd kqueue
431      1 3   1        80           5e905be0               sshd select
300      1 3   0        80           5e905900               ntpd netio
404      1 3   0        80           5e6e7080             powerd kqueue
411      1 2   0   1000000           5e905340               ntpd
281      1 2   1         0           5ed21600            syslogd
206      1 3   0        80           5eee4300             dhcpcd kqueue
1        1 3   0        80           5f6f4140               init wait
0       66 3   1       200           5f6bb160            physiod physiod
0       65 3   1       200           5f6bbce0           aiodoned aiodoned
0       64 3   0       200           5f6bba00          pooldrain pooldrain
0       63 2   0       200           5f6bb720            ioflush
0       62 3   0       200           5f6bb440           pgdaemon pgdaemon
0       59 3   1       200           5f6f4700          swwreboot swwreboot
0       58 3   0       200           5f7c5c40               usb1 usbevt
0       57 3   1       200           5f6f4cc0          atapibus0 sccomp
0       55 3   0       200           5f6f49e0               usb0 usbevt
0       53 2   1       200           5f6f4420            npfgc-0
0       52 3   1       200           5f712ca0            rt_free rt_free
0       51 3   1       200           5f7129c0              unpgc unpgc
0       50 2   0       200           5f7126e0    key_timehandler
0       49 3   1       200           5f712400    icmp6_wqinput/1 icmp6_wqinput
0       48 3   0       200           5f712120    icmp6_wqinput/0 icmp6_wqinput
0       47 2   1       200           5f753c80          nd6_timer
0       46 3   1       200           5f7539a0    carp6_wqinput/1 carp6_wqinput
0       45 3   0       200           5f7536c0    carp6_wqinput/0 carp6_wqinput
0       44 3   1       200           5f7546a0     carp_wqinput/1 carp_wqinput
0       43 3   0       200           5f754c60     carp_wqinput/0 carp_wqinput
0       42 3   1       200           5f7543c0     icmp_wqinput/1 icmp_wqinput
0       41 3   0       200           5f754980     icmp_wqinput/0 icmp_wqinput
0       40 2   0       200           5f753100           rt_timer
0       39 3   0       200           5f7533e0        vmem_rehash vmem_rehash
0       30 3   1       280           5f7c5960           fw0probe ieee1394
0       29 3   1       200           5f7c5680         usbtask-dr usbtsk
0       28 3   0       200           5f7c53a0         usbtask-hc usbtsk
0       27 3   0       200           5f7c50c0            atabus2 atath
0       26 3   1       200           5f7f5c20            atabus1 atath
0       25 3   0       200           5f7f5940            atabus0 atath
0       24 2   1       200           5f7f5660               iic1
0       23 2   0       200           5f7f5380                pmu
0       22 2   0       200           5f7f50a0               iic0
0       21 3   1       200           5f800c00            xcall/1 xcall
0       20 1   1       200           5f800920          softser/1
0       19 1   1       200           5f800640          softclk/1
0       18 1   1       200           5f800360          softbio/1
0       17 1   1       200           5f800080          softnet/1
0       16 1   1       201           5f814be0             idle/1
0       15 3   0       200           5f814900             sysmon smtaskq
0       14 3   0       200           5f814620         pmfsuspend pmfsuspend
0       13 3   1       200           5f814340           pmfevent pmfevent
0       12 3   0       200           5f814060         sopendfree sopendfr
0       11 3   0       200           5fb27bc0           nfssilly nfssilly
0       10 2   0       200           5fb278e0            cachegc
0        9 3   0       200           5fb27600             vdrain vdrain
0        8 3   1       200           5fb27320          modunload mod_unld
0        7 3   0       200           5fb27040            xcall/0 xcall
0        6 1   0       200           5fb31ba0          softser/0
0        5 1   0       200           5fb318c0          softclk/0
0        4 1   0       200           5fb315e0          softbio/0
0        3 1   0       200           5fb31300          softnet/0
0        2 1   0       201           5fb31020             idle/0
0        1 2   1       200             b8a580            swapper


Unfortunately trying to get backtraces kills the kernel:

db{0}> bt
0x102feea0: at zsc_intr_hard+0x74
0x102feec0: at zshard+0x18
0x102feed0: at intr_deliver.isra.1+0x90
0x102feef0: at pic_handle_intr+0x178
0x102fef20: at trapstart+0x6b0
[  77.1787888] trap: kernel read DSI trap @ 0xfd5effd4 by 0x12a434 (DSISR 0x40000000, err=14), lr 0x12a8c0
[  77.1787888] panic: trap
[  77.1787888] cpu0: Begin traceback...
[  77.1787888] 0x102fe930: at vpanic+0x144
[  77.1787888] 0x102fe960: at panic+0x50
[  77.1787888] 0x102fe9b0: at trap+0x100
[  77.1787888] 0x102fea70: kernel DSI read trap @ 0xfd5effd4 by db_stack_trace_print+0x11c: srr1=0x32
[  77.1787888]             r1=0x102feb40 cr=0x20244244 xer=0 ctr=0x10a1fc dsisr=0x40000000
[  77.1787888] 0x102feb40: at db_stack_trace_print+0x5a4
[  77.1787888] 0x102febc0: at db_command+0x124
[  77.1787888] 0x102fec60: at db_command_loop+0xd4
[  77.1787888] 0x102fed40: at db_trap+0xe0
[  77.1787888] 0x102fed70: at kdb_trap+0x120
[  77.1787888] 0x102fedb0: at trapstart+0x95c
[  77.1787888] saved LR(0x3a7342b6) is invalid.cpu0: End traceback...
[  77.1787888] halting CPU 1



>How-To-Repeat:
s/a

>Fix:
n/a



Home | Main Index | Thread Index | Old Index