NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/52700: Testbed VMs hang on wd1 access since recent ATA changes



The following reply was made to PR kern/52700; it has been noted by GNATS.

From: =?UTF-8?B?SmFyb23DrXIgRG9sZcSNZWs=?= <jaromir.dolecek%gmail.com@localhost>
To: gnats-bugs%netbsd.org@localhost
Cc: mlelstv%netbsd.org@localhost, gnats-admin%netbsd.org@localhost, netbsd-bugs%netbsd.org@localhost, 
	Andreas Gustafsson <gson%gson.org@localhost>
Subject: Re: kern/52700: Testbed VMs hang on wd1 access since recent ATA changes
Date: Sun, 5 Nov 2017 14:57:10 +0100

 --94eb2c05f42cdb57c8055d3cb67b
 Content-Type: text/plain; charset="UTF-8"
 
 There was a fix for wdbiorestart(), but that should not affect normal
 operation.
 
 I had a brief look into wd.c, seems after the dksubrification, the wddone()
 no longer calls ata_channel_start(), which likely will cause I/O to stall
 on machines with port multiplier. This should be fixed of course, but I
 don't think it causes this issue.
 
 Jaromir
 
 2017-11-05 14:40 GMT+01:00 Martin Husemann <martin%duskware.de@localhost>:
 
 > The following reply was made to PR kern/52700; it has been noted by GNATS.
 >
 > From: Martin Husemann <martin%duskware.de@localhost>
 > To: gnats-bugs%NetBSD.org@localhost
 > Cc:
 > Subject: Re: kern/52700: Testbed VMs hang on wd1 access since recent ATA
 >  changes
 > Date: Sun, 5 Nov 2017 14:36:10 +0100
 >
 >  On Sun, Nov 05, 2017 at 01:20:01PM +0000, Andreas Gustafsson wrote:
 >  > >Number:         52700
 >  > >Category:       kern
 >  > >Synopsis:       Testbed VMs hang on wd1 access since recent ATA changes
 >  > >Confidential:   no
 >  > >Severity:       critical
 >  > >Priority:       high
 >  > >Responsible:    kern-bug-people
 >  > >State:          open
 >  > >Class:          sw-bug
 >  > >Submitter-Id:   net
 >  > >Arrival-Date:   Sun Nov 05 13:20:00 +0000 2017
 >  > >Originator:     Andreas Gustafsson
 >  > >Release:        NetBSD-current, source date >= 2017.11.01.19.34.46
 >  > >Organization:
 >  >
 >  > >Environment:
 >  > System: NetBSD
 >  > Architecture: i386
 >  > Machine: i386
 >  > >Description:
 >  >
 >  > The test results at
 >  >
 >  >   http://releng.netbsd.org/b5reports/i386/commits-2017.
 > 11.html#2017.11.01.19.15.31
 >  >
 >  > include "ATF output: raw, xml, html" links as expected up to
 >  > and including source date 2017.11.01.19.15.31:
 >  >
 >  >   build: OK with 483383 lines of log, install: OK, tests: 6004 passed,
 > 173 skipped, 123 expected_failure, 8 failed, ATF output: raw, xml, html
 >  >
 >  > but they are missing from the next run, of source date
 >  > 2017.11.01.21.13.26, and subsequent ones:
 >  >
 >  >   build: OK with 484342 lines of log, install: OK, tests: 6004 passed,
 > 173 skipped, 123 expected_failure, 8 failed
 >  >
 >  > Since the ATF output files are exported from the test VM using a
 >  > secondary virtual hard disk (wd1), it seems likely that this is
 >  > related to the ATA changes committed between these two dates:
 >  >
 >  >   commit 2017.11.01.19.34.45 mlelstv src/sys/dev/files.dev 1.4
 >
 >  There have been follow-up fixes on 2017.11.03, didn't those make this
 >  work again?
 >
 >  Martin
 >
 >
 
 --94eb2c05f42cdb57c8055d3cb67b
 Content-Type: text/html; charset="UTF-8"
 Content-Transfer-Encoding: quoted-printable
 
 <div dir=3D"ltr">There was a fix for wdbiorestart(), but that should not af=
 fect normal operation.<div><br></div><div>I had a brief look into wd.c, see=
 ms after the dksubrification, the wddone() no longer calls ata_channel_star=
 t(), which likely will cause I/O to stall on machines with port multiplier.=
  This should be fixed of course, but I don&#39;t think it causes this issue=
 .</div><div><br></div><div>Jaromir</div></div><div class=3D"gmail_extra"><b=
 r><div class=3D"gmail_quote">2017-11-05 14:40 GMT+01:00 Martin Husemann <sp=
 an dir=3D"ltr">&lt;<a href=3D"mailto:martin%duskware.de@localhost"; target=3D"_blank">=
 martin%duskware.de@localhost</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" sty=
 le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">The fo=
 llowing reply was made to PR kern/52700; it has been noted by GNATS.<br>
 <br>
 From: Martin Husemann &lt;<a href=3D"mailto:martin%duskware.de@localhost";>martin@dusk=
 ware.de</a>&gt;<br>
 To: gnats-bugs%NetBSD.org@localhost<br>
 Cc:<br>
 Subject: Re: kern/52700: Testbed VMs hang on wd1 access since recent ATA<br=
 >
 =C2=A0changes<br>
 Date: Sun, 5 Nov 2017 14:36:10 +0100<br>
 <div><div class=3D"h5"><br>
 =C2=A0On Sun, Nov 05, 2017 at 01:20:01PM +0000, Andreas Gustafsson wrote:<b=
 r>
 =C2=A0&gt; &gt;Number:=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A052700<br>
 =C2=A0&gt; &gt;Category:=C2=A0 =C2=A0 =C2=A0 =C2=A0kern<br>
 =C2=A0&gt; &gt;Synopsis:=C2=A0 =C2=A0 =C2=A0 =C2=A0Testbed VMs hang on wd1 =
 access since recent ATA changes<br>
 =C2=A0&gt; &gt;Confidential:=C2=A0 =C2=A0no<br>
 =C2=A0&gt; &gt;Severity:=C2=A0 =C2=A0 =C2=A0 =C2=A0critical<br>
 =C2=A0&gt; &gt;Priority:=C2=A0 =C2=A0 =C2=A0 =C2=A0high<br>
 =C2=A0&gt; &gt;Responsible:=C2=A0 =C2=A0 kern-bug-people<br>
 =C2=A0&gt; &gt;State:=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 open<br>
 =C2=A0&gt; &gt;Class:=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 sw-bug<br>
 =C2=A0&gt; &gt;Submitter-Id:=C2=A0 =C2=A0net<br>
 =C2=A0&gt; &gt;Arrival-Date:=C2=A0 =C2=A0Sun Nov 05 13:20:00 +0000 2017<br>
 =C2=A0&gt; &gt;Originator:=C2=A0 =C2=A0 =C2=A0Andreas Gustafsson<br>
 =C2=A0&gt; &gt;Release:=C2=A0 =C2=A0 =C2=A0 =C2=A0 NetBSD-current, source d=
 ate &gt;=3D 2017.11.01.19.34.46<br>
 =C2=A0&gt; &gt;Organization:<br>
 =C2=A0&gt;<br>
 =C2=A0&gt; &gt;Environment:<br>
 =C2=A0&gt; System: NetBSD<br>
 =C2=A0&gt; Architecture: i386<br>
 =C2=A0&gt; Machine: i386<br>
 =C2=A0&gt; &gt;Description:<br>
 =C2=A0&gt;<br>
 =C2=A0&gt; The test results at<br>
 =C2=A0&gt;<br>
 =C2=A0&gt;=C2=A0 =C2=A0<a href=3D"http://releng.netbsd.org/b5reports/i386/c=
 ommits-2017.11.html#2017.11.01.19.15.31" rel=3D"noreferrer" target=3D"_blan=
 k">http://releng.netbsd.org/<wbr>b5reports/i386/commits-2017.<wbr>11.html#2=
 017.11.01.19.15.31</a><br>
 =C2=A0&gt;<br>
 =C2=A0&gt; include &quot;ATF output: raw, xml, html&quot; links as expected=
  up to<br>
 =C2=A0&gt; and including source date 2017.11.01.19.15.31:<br>
 =C2=A0&gt;<br>
 =C2=A0&gt;=C2=A0 =C2=A0build: OK with 483383 lines of log, install: OK, tes=
 ts: 6004 passed, 173 skipped, 123 expected_failure, 8 failed, ATF output: r=
 aw, xml, html<br>
 =C2=A0&gt;<br>
 =C2=A0&gt; but they are missing from the next run, of source date<br>
 =C2=A0&gt; 2017.11.01.21.13.26, and subsequent ones:<br>
 =C2=A0&gt;<br>
 =C2=A0&gt;=C2=A0 =C2=A0build: OK with 484342 lines of log, install: OK, tes=
 ts: 6004 passed, 173 skipped, 123 expected_failure, 8 failed<br>
 =C2=A0&gt;<br>
 =C2=A0&gt; Since the ATF output files are exported from the test VM using a=
 <br>
 =C2=A0&gt; secondary virtual hard disk (wd1), it seems likely that this is<=
 br>
 =C2=A0&gt; related to the ATA changes committed between these two dates:<br=
 >
 =C2=A0&gt;<br>
 =C2=A0&gt;=C2=A0 =C2=A0commit 2017.11.01.19.34.45 mlelstv src/sys/dev/files=
 .dev 1.4<br>
 <br>
 </div></div>=C2=A0There have been follow-up fixes on 2017.11.03, didn&#39;t=
  those make this<br>
 =C2=A0work again?<br>
 <span class=3D"HOEnZb"><font color=3D"#888888"><br>
 =C2=A0Martin<br>
 <br>
 </font></span></blockquote></div><br></div>
 
 --94eb2c05f42cdb57c8055d3cb67b--
 



Home | Main Index | Thread Index | Old Index