NetBSD-Users archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: Bind ending up in Parked state.
Thus spake Christos Zoulas, on Tue, Mar 10, 2015 at 04:08:34PM +0000:
> >I hadn't thought of that at all.
> >
> >I tried -current, that broke the machine rather violently :) ended up in a
> >db{6}: prompt without any USB support. After a bit of fiddling around, I
> >realized that starting over would probably be the fastest way of going. It
> Would be nice to know the trace of that and the steps to reproduce :-)
The steps to reproduce the db{6} - or what I did was to fetch amd64 binaries
from http://nyftp.netbsd.org/pub/NetBSD-daily/HEAD/201503060730Z/amd64/binary/sets/ - then run
mkdir /I
cd /I
echo â??mget *gzâ?? | ftp -Va ftp://nyftp.netbsd.org/pub/NetBSD-daily/HEAD/201503060730Z/amd64/binary/sets/
cd /
progress -z -f /I/modules.tgz pax -rpe
progress -z -f /I/kern-GENERIC.tgz pax -rpe
reboot
After a while I ventured into the datacenter, even though it is a HP-G5 it
shouldn't take this long to get back to life. Here I noticed it was hanging
with a db{6} - The log leading up to this was:
panic: cnopen: no console device
fatal breakpoint trap in supervisor mode
trap type 1 code 0 rip ffffffff8028994d cs 8 rflags 2467 cr2 7f7ff6c08be8 ilevel 0 rsp fffffe811db66ad8
curlwp 0xfffffe843b1538a0 pid 2.1 lowest kstack 0xfffffe811db632c0
Stopped in pid 2.1 (init) at netbsd:breakpoint +0x5: leave
db{6}>
The server had an USB keyboard attached, but it didn't seem to respond.
Fortunately I had an old PS2 keyboard that I connected up and entered "c",
this resulted in the server doing a dump from 632 to 0, and a reboot. Here it
ended up in the same spot, with the same error.
Tired and a slight feeling of frustration, boot on my NetBSD install USB stick
to get a 6.1.5 kernel and modules back. Alas, when the server was rebooted into
the 6.1.5 kernel, it just kept rebooting right after the boot menu. I could
see the first 2 seconds of swirly lines after I pressed enter on the menu, then
it hung up on me and rebooted.
I ended up reinstalling the machine, and getting it back to a 6.1.5. This is
where the machine is currently.
>
> >is now happily chomping away, with a lot less load on the system overall.
> >Without --disable-threads it would ever so often spike all 8 Cores, and it
> >wasn't uncommon for it to have a load >2.0 - now it is sitting comfortably
> >around 0.32. And it doesn't seem to have any impact on the system.
>
> Still I would like to know what is taking all this time... ktrace it
> and then kdump -R to display relative timestamps.
>
> >I still see
> >
> > 3417 named 43 0 713M 503M parked/5 3:34 19.82% 19.82% named
>
> Hit 't' to switch to the thread view to get more details about what's going
> on for each individual thread (for the threaded named).
Never get too excited about things too soon it seems ;-) The Named just broke
down again. I've started it up and is currently running ktrace against it.
Currently a top (with t) shows the following regarding named:
127 threads: 35 idle, 85 sleeping, 7 on CPU
CPU0 states: 4.6% user, 0.0% nice, 14.6% system, 5.6% interrupt, 75.2% idle
CPU1 states: 5.6% user, 0.0% nice, 19.0% system, 0.0% interrupt, 75.4% idle
CPU2 states: 5.6% user, 0.0% nice, 5.4% system, 0.0% interrupt, 89.0% idle
CPU3 states: 4.2% user, 0.0% nice, 10.4% system, 0.0% interrupt, 85.4% idle
CPU4 states: 7.6% user, 0.0% nice, 12.6% system, 0.0% interrupt, 79.8% idle
CPU5 states: 3.4% user, 0.0% nice, 10.0% system, 0.2% interrupt, 86.4% idle
CPU6 states: 4.0% user, 0.0% nice, 5.2% system, 0.2% interrupt, 90.6% idle
CPU7 states: 5.0% user, 0.0% nice, 8.4% system, 0.0% interrupt, 86.6% idle
Memory: 4494M Act, 6052K Wired, 24M Exec, 3905M File, 11G Free
Swap: 16G Total, 16G Free
PID LID USERNAME PRI STATE TIME WCPU CPU NAME COMMAND
23874 11 named 34 CPU/6 0:47 21.00% 21.00% - named
23874 6 named 39 parked/1 0:28 17.09% 17.09% - named
23874 5 named 41 parked/2 0:27 14.55% 14.55% - named
23874 4 named 42 parked/3 0:27 12.94% 12.94% - named
23874 8 named 40 parked/4 0:25 12.89% 12.89% - named
23874 9 named 39 parked/6 0:25 12.79% 12.79% - named
23874 2 named 40 parked/7 0:26 12.70% 12.70% - named
23874 7 named 39 parked/5 0:26 11.43% 11.43% - named
23874 3 named 39 parked/5 0:27 9.67% 9.67% - named
23874 1 named 85 sigwai/7 0:00 0.00% 0.00% - named
23874 10 named 43 parked/1 0:00 0.00% 0.00% - named
anycast-dns-01# ldd /usr/pkg/sbin/named
/usr/pkg/sbin/named:
-llwres.141 => /usr/pkg/lib/liblwres.so.141
-ljson-c.2 => /usr/pkg/lib/libjson-c.so.2
-lc.12 => /usr/lib/libc.so.12
-lpthread.1 => /usr/lib/libpthread.so.1
-lgcc_s.1 => /usr/lib/libgcc_s.so.1
-lxml2.2 => /usr/pkg/lib/libxml2.so.2
-lz.1 => /usr/lib/libz.so.1
-llzma.1 => /usr/lib/liblzma.so.1
-lm.0 => /usr/lib/libm.so.0
-lbind9.140 => /usr/pkg/lib/libbind9.so.140
-lisccfg.140 => /usr/pkg/lib/libisccfg.so.140
-ldns.146 => /usr/pkg/lib/libdns.so.146
-lisc.142 => /usr/pkg/lib/libisc.so.142
-lgssapi.10 => /usr/lib/libgssapi.so.10
-lkrb5.26 => /usr/lib/libkrb5.so.26
-lhx509.5 => /usr/lib/libhx509.so.5
-lasn1.9 => /usr/lib/libasn1.so.9
-lcom_err.7 => /usr/lib/libcom_err.so.7
-lcrypt.1 => /usr/lib/libcrypt.so.1
-lcrypto.8 => /usr/lib/libcrypto.so.8
-lroken.19 => /usr/lib/libroken.so.19
-lutil.7 => /usr/lib/libutil.so.7
-lwind.0 => /usr/lib/libwind.so.0
-lheimbase.1 => /usr/lib/libheimbase.so.1
-lheimntlm.4 => /usr/lib/libheimntlm.so.4
-lcrypto.1.0.0 => /usr/pkg/lib/libcrypto.so.1.0.0
-lisccc.140 => /usr/pkg/lib/libisccc.so.140
even with --disable-threads it still links into phtreads it seems?
Best regards
Søren P. Skou
--
GPG Key: 8E58ACB3
Home |
Main Index |
Thread Index |
Old Index