NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: port-arm/56842
The following reply was made to PR port-arm/56842; it has been noted by GNATS.
From: Jim Spath <jspath55%gmail.com@localhost>
To: gnats-bugs%netbsd.org@localhost
Cc:
Subject: Re: port-arm/56842
Date: Fri, 20 May 2022 09:20:51 -0400
--000000000000aaee7005df715c67
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
Dave:
Thank you for the feedback. I have NetBSD running also on a Pi3 and a Pi4;
this is the first time getting a Zero 2W working. The other systems are
running current:
NetBSD [pi3] 9.99.82 NetBSD 9.99.82 (GENERIC64) #0: Tue Apr 27 05:40:29 UTC
2021 mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/evbarm/compile/GENERIC64
evbarm
NetBSD [pi4] 9.99.93 NetBSD 9.99.93 (GENERIC64) #0: Sun Jan 2 23:46:21 UTC
2022 mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/evbarm/compile/GENERIC64
evbarm
Neither of those, nor an earlier 9.2 system have shown cron hangs; I have
the identical script running.
Your questions:
- what does ps(1) tell you about the state of cron?
Nothing useful to me yet; but see below for results from top.
USER PID %CPU %MEM VSZ RSS TTY STAT STARTED TIME COMMAND
root 18233 0.0 0.4 6728 1824 ? Ss Mon05PM 0:00.00
/usr/sbin/cron
- You might try enabling extra debugging information with the -x option.
I tried one iteration with debug flags and captured logs but saw nothing
useful there.
- What happens if you disable particular cron entries, like the script
meant to run every minute?
I will try lowering the frequency, after doing a reboot and seeing if/when
the issue recurs. It seems this might be a =E2=80=9Cslow leak=E2=80=9D that=
will take
patience to track.
I investigated further and found hangs on both top and vmstat, at varying
times.
For vmstat, the first line (summary) is returned, but then nothing:
n0b:jim> date ; vmstat 1 10
Tue May 17 13:09:14 UTC 2022
procs memory page disks faults cpu
r b avm fre flt re pi po fr sr l0 n0 in sy cs us sy id
1 0 304608 88784 23 0 0 0 0 0 0 0 8882 44 14 0 1 99
^C
n0b:jim> date
Tue May 17 13:09:45 UTC 2022
That stall is inconsistent though, as the results today are nominal:
n0b:jim> date
Fri May 20 12:58:26 UTC 2022
n0b:jim> vmstat 1 3
procs memory page disks faults cpu
r b avm fre flt re pi po fr sr l0 n0 in sy cs us sy id
1 0 310320 82568 22 0 0 0 0 0 0 0 8870 43 13 0 1 99
0 0 310320 82568 0 0 0 0 0 0 0 0 8826 32 11 0 1 99
0 0 310320 82568 0 0 0 0 0 0 0 0 8902 30 10 0 1 99
n0b:jim> date
Fri May 20 12:58:36 UTC 2022
n0b:jim>
The top command starts up, displays some data, but then does not refresh.
The data are incomplete (values are all 0):
load averages: 0.01, 0.02, 0.00; up 11+21:37:48 13:06:53
46 processes: 44 sleeping, 2 on CPU
CPU0 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle
CPU1 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle
CPU2 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle
CPU3 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle
Memory: 298M Act, 104K Inact, 12M Wired, 15M Exec, 259M File, 86M Free
Swap:
Like vmstat, top worked later (except one core shows all zeroes).
load averages: 0.01, 0.02, 0.00; up 14+21:32:34 13:01:39
50 processes: 48 sleeping, 2 on CPU
CPU0 states: 0.0% user, 0.0% nice, 0.0% system, 1.6% interrupt, 98.4% idle
CPU1 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 100% idle
CPU2 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle
CPU3 states: 0.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 100% idle
Memory: 303M Act, 96K Inact, 12M Wired, 15M Exec, 262M File, 80M Free
Swap:
However, cron commands have not run since.
My next steps will be:
1. Reboot, taking note of initial state
2. Try adding a swap device (have seen some odd Pi behavior with 0 swap)
3. Decrease the cron job frequency
Jim
--000000000000aaee7005df715c67
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
<div dir=3D"ltr">
=09
=09
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Dave=
:</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Than=
k you for the
feedback. I have NetBSD running also on a Pi3 and a Pi4; this is the
first time getting a Zero 2W working. The other systems are running
current:</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">NetBSD [pi3] 9.99.82
NetBSD 9.99.82 (GENERIC64) #0: Tue Apr 27 05:40:29 UTC 2021=20
mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/evbarm/compile/GENERIC64
evbarm</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">NetBSD [pi4] 9.99.93
NetBSD 9.99.93 (GENERIC64) #0: Sun Jan 2 23:46:21 UTC 2022=20
mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/evbarm/compile/GENERIC64
evbarm</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">Neither of those,
nor an earlier 9.2 system have shown cron hangs; I have the identical scrip=
t running.</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">Your questions:</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">- </span><span style=3D"background=
-color:transparent;font-variant-numeric:normal;font-variant-east-asian:norm=
al"><font color=3D"#222222"><font face=3D"Arial, Helvetica, sans-serif"><fo=
nt style=3D"font-size:12pt">what
does ps(1) tell you about the state of cron?</font></font></font></span><br=
></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">Nothing useful to me yet;
but see below for results from top.</span><br></p><p style=3D"line-height:1=
00%;margin-bottom:0in;background:transparent"><span style=3D"background-col=
or:transparent"><br></span></p><p style=3D"line-height:100%;margin-bottom:0=
in;background:transparent">USER =C2=A0 =C2=A0 =C2=A0PID %CPU %MEM =C2=A0 VS=
Z =C2=A0 RSS TTY =C2=A0 STAT STARTED =C2=A0 =C2=A0TIME COMMAND<br></p><p st=
yle=3D"line-height:100%;margin-bottom:0in;background:transparent">root =C2=
=A0 =C2=A018233 =C2=A00.0 =C2=A00.4 =C2=A06728 =C2=A01824 ? =C2=A0 =C2=A0 S=
s =C2=A0 Mon05PM 0:00.00 /usr/sbin/cron=C2=A0<br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"font-size:12pt;background-color:transparent">-
You might try enabling extra debugging information with the -x
option.</span><br></p><p style=3D"line-height:100%;margin-bottom:0in;backgr=
ound:transparent"><span style=3D"background-color:transparent">I tried one
iteration with debug flags and captured logs but saw nothing useful
there.</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">- </span><span style=3D"background=
-color:transparent;font-variant-numeric:normal;font-variant-east-asian:norm=
al"><font color=3D"#222222"><font face=3D"Arial, Helvetica, sans-serif"><fo=
nt style=3D"font-size:12pt">What
happens if you disable particular cron entries, like the script meant
to run every minute?</font></font></font></span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">I will try lowering
the frequency, after doing a reboot and seeing if/when the issue
recurs. It seems this might be a =E2=80=9Cslow leak=E2=80=9D that will take
patience to track.</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">I investigated
further and found hangs on both top and vmstat, at varying times.</span><br=
></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">For vmstat, the
first line (summary) is returned, but then nothing:</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">n0b:jim> date ;
vmstat 1 10</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Tue =
May 17 13:09:14
UTC 2022</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> pro=
cs memory =20
page disks faults cpu</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> r b=
avm fre
flt re pi po fr sr l0 n0 in sy cs us sy id</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> 1 0=
304608 88784
23 0 0 0 0 0 0 0 8882 44 14 0 1 99</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">^C</=
p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">n0b:=
jim> date</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Tue =
May 17 13:09:45
UTC 2022</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">That stall is
inconsistent though, as the results today are nominal:</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">n0b:jim> date</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Fri =
May 20 12:58:26
UTC 2022</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">n0b:=
jim> vmstat 1
3</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> pro=
cs memory =20
page disks faults cpu</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> r b=
avm fre
flt re pi po fr sr l0 n0 in sy cs us sy id</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> 1 0=
310320 82568
22 0 0 0 0 0 0 0 8870 43 13 0 1 99</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> 0 0=
310320 82568
0 0 0 0 0 0 0 0 8826 32 11 0 1 99</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"> 0 0=
310320 82568
0 0 0 0 0 0 0 0 8902 30 10 0 1 99</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">n0b:=
jim> date</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Fri =
May 20 12:58:36
UTC 2022</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">n0b:=
jim>=20
</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">The top command
starts up, displays some data, but then does not refresh. The data
are incomplete (values are all 0):</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">load averages:=20
0.01, 0.02, 0.00; up 11+21:37:48 =20
13:06:53</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">46 p=
rocesses: 44
sleeping, 2 on CPU</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU0=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU1=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU2=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU3=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Memo=
ry: 298M Act,
104K Inact, 12M Wired, 15M Exec, 259M File, 86M Free</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Swap=
:=20
</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><br>
</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Like=
vmstat, top
worked later (except one core shows all zeroes).</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">load averages:=20
0.01, 0.02, 0.00; up 14+21:32:34 13:01:39</span><br><=
/p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">50 p=
rocesses: 48
sleeping, 2 on CPU</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU0=
states: 0.0%
user, 0.0% nice, 0.0% system, 1.6% interrupt, 98.4% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU1=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 100% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU2=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 0.0% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">CPU3=
states: 0.0%
user, 0.0% nice, 0.0% system, 0.0% interrupt, 100% idle</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Memo=
ry: 303M Act,
96K Inact, 12M Wired, 15M Exec, 262M File, 80M Free</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">Swap=
:</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">However, cron
commands have </span>not run<span style=3D"background-color:transparent"> s=
ince.</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">My next steps will
be:</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent"><spa=
n style=3D"background-color:transparent">1. Reboot, taking
note of initial state</span><br></p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">2. T=
ry adding a swap
device (have seen some odd Pi behavior with 0 swap)</p>
<p style=3D"line-height:100%;margin-bottom:0in;background:transparent">3. D=
ecrease the cron
job frequency</p><p style=3D"line-height:100%;margin-bottom:0in;background:=
transparent"><br></p><p style=3D"line-height:100%;margin-bottom:0in;backgro=
und:transparent">Jim</p></div>
--000000000000aaee7005df715c67--
Home |
Main Index |
Thread Index |
Old Index