tech-kern archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: SunFire v100 / Acer M5229 IDE DMA error workaround
On Wed, Oct 29, 2008 at 12:22:13PM -0500, David Young wrote:
> On Wed, Oct 29, 2008 at 12:05:39PM -0400, Rafal Boni wrote:
> > Folks:
> > I've been taunted by the IDE interface on my SunFire V100 for a long
> > (LOOONG!) time with messages along the lines of:
> >
> > wdNN: DMA error writing fsbn xxxx of xxxx-yyy (wdN bn pppp; cn ccc tn
> > tt sn ss), retrying
> > wdN: soft error (corrected)
> >
[...]
> > ---8<------8<------8<------8<------8<------8<------8<------8<------8<---
> > Index: pci/pciide_common.c
> > ===================================================================
> > RCS file: /cvsroot/src/sys/dev/pci/pciide_common.c,v
> > retrieving revision 1.38
> > diff -u -p -r1.38 pciide_common.c
> > --- pci/pciide_common.c 18 Mar 2008 20:46:37 -0000 1.38
> > +++ pci/pciide_common.c 29 Oct 2008 15:30:21 -0000
> > @@ -737,7 +738,9 @@ pciide_dma_finish(v, channel, drive, for
> > ATADEBUG_PRINT(("pciide_dma_finish: status 0x%x\n", status),
> > DEBUG_XFERS);
> >
> > - if (force == WDC_DMAEND_END && (status & IDEDMA_CTL_INTR) == 0)
> > + /* XXXrkb: From FreeBSD; should probably add an evcnt here */
> > + if (force == WDC_DMAEND_END &&
> > + ((status & (IDEDMA_CTL_INTR | IDEDMA_CTL_ACT)) != IDEDMA_CTL_INTR))
> > return WDC_DMAST_NOIRQ;
>
> I have a hunch that this is not necessary. After you introduce the new
> bus_space_write_1() call, below, does the condition IDEDMA_CTL_INTR &&
> IDEDMA_CTL_ACT ever occur?
The above is in fact necessary; the bus_space_write_1() on the DMA control
register I added in pciide_dma_finish() may, however, not be. That's left
over from try N-1 after noticing that both FreeBSD and OpenBSD wrote to the
DMA status register there to clear it's sticky bits.
My testing with just the status-clear in pciide_dma_finish() didn't help
the issue I'm seeing, it merely changed the status values reported later,
but they still contained both IDEDMA_CTL_INTR & IDEDMA_CTL_ACT. In fact,
the reason I resorted to this is that the debug messages I added showed
that both IDEDMA_CTL_INTR & IDEDMA_CTL_ACT were set *everytime* the system
reported an IDE DMA error.
>
> > /* stop DMA channel */
> > @@ -752,6 +755,9 @@ pciide_dma_finish(v, channel, drive, for
> > BUS_DMASYNC_POSTREAD : BUS_DMASYNC_POSTWRITE);
> > bus_dmamap_unload(sc->sc_dmat, dma_maps->dmamap_xfer);
> >
> > + /* Clear status bits */
> > + bus_space_write_1(sc->sc_dma_iot, cp->dma_iohs[IDEDMA_CTL], 0, status);
> > +
>
> I may be missing something, but by my reading of a PCI IDE controller
> spec that I scrounged off the web, it is important to acknowledge the
> interrupt in this way. ISTM that the code should already acknowledge
> the interrupt by calling pciide_irqack(). Not so?
It probably is; as I said, I noticed that both Free- and OpenBSD do it
in both places, so I thought it was worth trying. I'll double-check to
make sure this change isn't necessary for things to work with the above
change.
> Note that this write may not be flushed to the device, and the
> interrupt deasserted, until a second call to pciide_dma_finish() calls
> bus_space_read_1(, cp->dma_iohs[IDEDMA_CTL], ). In other words, you
> may take two interrupts per DMA completed.
Hmm, this is a theory that's worth exploring more, however, I would expect
the bus_space calls to handle any necessary flushing.
Thanks,
--rafal
--
Time is an illusion; lunchtime, doubly so. |/\/\| Rafal Boni
-- Ford Prefect |\/\/|
rafal%pobox.com@localhost
Home |
Main Index |
Thread Index |
Old Index