Subject: send-pr? supposed lock-up in network stack
To: None <port-hp700@netbsd.org>
From: Rudi Ludwig <rudihl@gmx.de>
List: port-hp700
Date: 10/06/2005 21:17:32
hello all:

as previously posted I run a 712/60 - 32 MB with
NetBSD 3.99.8 diskless mode.

So I typically log in with two xterms one for top
to see about swap and another to do the work.

Already happened 3 times, that the terminals froze
with top showing 0% interrupt and 0% idle.

Trying to ping the 712, it is not responding. But
the power-switch is still recognized. It prints
to the console but the shut-down hangs at unmounting
the filesystems, no surprise since they are on the
network.

So I suspect that there is a lock-up somewhere in
the network code.

Would it make sense to post this as a PR? Or goes
port-hp700 as work in progress without tracking?

Is there a way to further narrow the problem down?
My next will be to set up a serial console connection
and see wether this continues to work.


Rudi


encl.
	dmesg of the machine
	typical "frozen" top


Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 3.99.8 (GENERIC) #0: Sat Sep 10 04:25:01 UTC 2005
	builds@works.netbsd.org:/home/builds/ab/HEAD/hp700/200509090000Z-obj/home/builds/ab/HEAD/src/sys/arch/hp700/compile/GENERIC
HP9000/712/60 (Gecko)
real mem = 32768 KB (73728 reserved for PROM, 20876 KB used by NetBSD)
avail mem = 20496 KB
mainbus0 (root) [flex fff80000]
pdc0 at mainbus0
cpu0 at mainbus0 hpa 0xfffbe000 path 8 irq 31 ipl 0: PA7100LC (Hummingbird) rev 6
cpu0: PCX-L, PA-RISC 1.1c, lev 1, cat A, 60 MHz clk
cpu0: shadows, 32K/32K D/I caches, 64 shared TLB, 8 shared BTLB
cpu0: PCX-L (CMOS-26B) floating point, rev 1
mem0 at mainbus0 hpa 0xfffbf000 path 9: viper rev 0, ctrl 40400102<eisa_prf> size 32MB
"GIO Graphics" at mainbus0 (type 0xa, sv 0x85) hpa 0xf8000000 path 1 not configured
lasi0 at mainbus0 hpa 0xf0000000 path 2 irq 28: rev 3.0
gsc0 at lasi0
osiop0 at gsc0 hpa 0xf0106000 path 2/0/1 irq 9 ipl 1: NCR53C710 rev 2, 40MHz, SCSI ID 7
scsibus0 at osiop0: 8 targets, 8 luns per target
iee0 at gsc0 hpa 0xf0107000 path 2/0/2 irq 8 ipl 2: Intel 82596CA address 08:00:09:9d:9e:ea
com0 at gsc0 hpa 0xf0105000 path 2/0/4 irq 5 ipl 3: ns16550a, working fifo
lpt0 at gsc0 hpa 0xf0102000 path 2/0/6 irq 7 ipl 4
harmony0 at gsc0 hpa 0xf0104000 path 2/0/8 irq 13 ipl 5: rev 18
audio0 at harmony0: full duplex
"floppy controller" at gsc0 (type 0xa, sv 0x83) hpa 0xf010a000 path 2/0/10 not configured
gsckbc0 at gsc0 hpa 0xf0108000 path 2/0/11 irq 26 ipl 6: keyboard
gsckbc1 at gsc0 hpa 0xf0108100 path 2/0/12: mouse
biomask 00000182 netmask 00000186 ttymask 000003de
Kernelized RAIDframe activated
scsibus0: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 6 lun 0: <IBM, DDRS-34560W, S97B> disk fixed
sd0: 4357 MB, 8387 cyl, 5 head, 212 sec, 512 bytes/sect x 8925000 sectors
sd0: sync (100.00ns offset 8), 8-bit (10.000MB/s) transfers
sd0: fabricating a geometry
sd0: no disk label
boot device: iee0
root on iee0
nfs_boot: trying DHCP/BOOTP
nfs_boot: BOOTP next-server: 192.168.2.2
nfs_boot: my_name=gecko
nfs_boot: my_addr=192.168.2.4
nfs_boot: my_mask=255.255.255.0
root on steinlaus:/export/hppa/gecko/root
root file system type: nfs
WARNING: clock lost 12911 days -- CHECK AND RESET THE DATE!
sd0: no disk label


load averages:  1.35,  1.42,  1.04                                     19:45:41
34 processes:  1 runnable, 32 sleeping, 1 on processor
CPU states: 95.1% user,  0.0% nice,  4.4% system,  0.0% interrupt,  0.5% idle
Memory: 11M Act, 5436K Inact, 484K Wired, 2720K Exec, 1984K File, 396K Free
Swap: 64M Total, 8508K Used, 56M Free

  PID USERNAME PRI NICE   SIZE   RES STATE      TIME   WCPU    CPU COMMAND
 1518 root      61    0  5152K   13M RUN        0:37 93.20% 80.62% cc1
    5 root      10    0     0K  764K nfsidl     0:11  1.03%  1.03% [nfsio]
  474 root      28    0   196K  392K CPU        0:11  0.93%  0.93% top
 1109 root      10    0   232K  144K wait       0:01  1.53%  0.73% <sh>
    6 root      10    0     0K  764K nfsidl     0:08  0.44%  0.44% [nfsio]
  350 root       2    0   308K  140K select     0:08  0.00%  0.00% <sshd>
  703 root      10    0  1352K  140K wait       0:07  0.00%  0.00% <make>
   76 root      10    0  1144K  140K wait       0:06  0.00%  0.00% <make>
  413 root       2    0   340K  484K select     0:05  0.00%  0.00% sshd
  103 root       2    0   344K  332K select     0:04  0.00%  0.00% <sshd>
  409 root       2    0   340K  332K select     0:04  0.00%  0.00% <sshd>
    9 root     -18    0     0K  764K pgdaemon   0:03  0.00%  0.00% [pagedaemon]
    7 root      10    0     0K  764K nfsidl     0:02  0.00%  0.00% [nfsio]
 1078 root      10    0   664K  144K wait       0:02  0.00%  0.00% <make>
    8 root      10    0     0K  764K nfsidl     0:01  0.00%  0.00% [nfsio]
  408 root      10    0   232K  160K nanoslee   0:01  0.00%  0.00% <cron>
  106 root       3    0   256K  136K ttyin      0:01  0.00%  0.00% <csh>