Subject: make -j 3 hang on amd64
To: None <current-users@netbsd.org>
From: Kurt Schreiner <ks@ub.uni-mainz.de>
List: current-users
Date: 11/29/2005 16:40:56
Hi,

I just tried to build a distribution for vax on my dual-opteron system.
This work for a while and then stalled. /usr/src is a symbolic link
to /u/NetBSD/src which is a umion mount:

/dev/sd0a on / type ffs (local)
/dev/sd0i on /var type ffs (noatime, local)
/dev/sd0h on /usr type ffs (noatime, local)
/dev/sd0j on /opt type ffs (noatime, local)
/dev/sd0k on /home type ffs (noatime, soft dependencies, local)
/dev/sd1h on /u type ffs (noatime, soft dependencies, NFS exported, local)
tmpfs on /tmp type tmpfs (nosuid, nodev, local)
kernfs on /kern type kernfs (local)
<above>:/u/NetBSD/lpkgsrc on /u/NetBSD/pkgsrc type union (nosuid, nodev, local, mounted by ks)
<above>:/u/NetBSD/lsrc on /u/NetBSD/src type union (nosuid, nodev, local, mounted by ks)

The process table shows some processes hanging in "D" state:
("df" and "ls" tried after the make didn't "answer" anymore)

 77   692   240     0  18  0 2344  1716 pause    Ss   ttyp0 0:00.06 -tcsh 
 77  6335   692     0  28  0  120   844 -        R+   ttyp0 0:00.00 ps axl 
 77 14416   692     0  -2  0   60   696 vnlock   D    ttyp0 0:00.00 df -k 
 77 17940   692     0  -2  0   72   820 vnlock   D    ttyp0 0:00.00 ls -CF -a -ol /usr/src/ 
 77   286 19261 52645  -2  0  204   944 vnlock   D    ttyp1 0:00.00 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77   334  6579 52645  -2  0  664  1448 vnlock   D    ttyp1 0:00.01 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77   610 15349 52645  -2  0  204   944 vnlock   D    ttyp1 0:00.00 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77  1954 15333     0   2  0  460  1288 poll     S    ttyp1 0:00.14 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77  2767 20854 25914  10  0  180  1024 wait     I    ttyp1 0:00.01 sh 
 77  6485 27995     0   2  0 4876  5704 poll     S    ttyp1 0:00.19 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77  6579 22944 52645  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 10219 25114     0   2  0   40   660 piperd   I+   ttyp1 0:00.10 tee -a /var/tmp/mkvax-051129.15
 77 13911 15418     0   2  0  836  1676 poll     S    ttyp1 0:00.12 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77 13938 19395     0   2  0  504  1336 poll     S    ttyp1 0:00.09 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77 14886 25114  2387  10  0  188   528 wait     I+   ttyp1 0:00.00 sh NBscripts/build-netbsd -d -j
 77 15333  6485 52645  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 15349  1954 52645  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 15418 26621 25914  10  0  188  1032 wait     I    ttyp1 0:00.00 /bin/sh -c _makedirtarget() {  
 77 18002 13938 39254  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 19261 25770 52645  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 19262 28193  2387  10  0  188  1032 wait     I+   ttyp1 0:00.00 sh NBscripts/build-netbsd -d -j
 77 19395 13911 25914  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 20854 25515     0   2  0  836  1676 poll     S+   ttyp1 0:00.11 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77 20855 13911 50633  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 22944 25529     0   2  0  500  1328 poll     S    ttyp1 0:00.01 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77 25114 19262  2387  10  0  188   812 wait     I+   ttyp1 0:00.00 sh NBscripts/build-netbsd -d -j
 77 25515 14886 25914  10  0  312  1168 wait     I+   ttyp1 0:00.02 sh ./build.sh -N 1 -u -j 3 -U -
 77 25529 28403 52645  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 25770 20855     0   2  0 3272  4116 poll     S    ttyp1 0:00.08 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77 26621  2767 25914  10  0  836  1672 wait     I    ttyp1 0:00.02 /u/NetBSD/arch/vax/TOOLS/bin/nb
 77 27995 13911 50633  10  0  184  1028 wait     I    ttyp1 0:00.00 sh 
 77 28193 19109     0  18  0 2260  1656 pause    Is   ttyp1 0:00.03 -tcsh 
 77 28403 18002     0   2  0  692  1540 poll     S    ttyp1 0:00.03 /u/NetBSD/arch/vax/TOOLS/bin/nb

That's the 2nd or 3rd time I had this happen and wonder how to debug this...

Any ideas anyone?

Kurt

PS: I'll try to get a dump of the machine...
-- 
===============================================================================
Dipl.-Inf. Kurt Schreiner               Tel.  : +49 6131 39 25134
Universitaetsbibliothek Mainz           Fax.  : +49 6131 39 24159 o. 23976
Jakob Welder Weg 6, 55128 Mainz         eMail : ks at ub.uni-mainz.de
===============================================================================