netbsd-bugs: Re: kern/36716: cd(4) problem with transfers exceeding 65535 bytes

Subject: Re: kern/36716: cd(4) problem with transfers exceeding 65535 bytes
To: gnats-bugs@NetBSD.org, Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
From: Stephen M. Rumble <stephen.rumble@utoronto.ca>
List: netbsd-bugs
Date: 08/01/2007 12:29:59
Quoting Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>:

> The following reply was made to PR kern/36716; it has been noted by GNATS.
>
> From: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
> To: gnats-bugs@NetBSD.org
> Cc: kern-bug-people@NetBSD.org, gnats-admin@NetBSD.org,
> 	netbsd-bugs@NetBSD.org, tsutsui@ceres.dti.ne.jp
> Subject: Re: kern/36716: cd(4) problem with transfers exceeding 65535 bytes
> Date: Wed, 1 Aug 2007 21:26:54 +0900
>
>  rumble@ephemeral.org wrote:
>
>  > >How-To-Repeat:
>  > It should be easily repeatable with an atapi cdrom when used
>  > with a disklabel where d_secsize != 2048.
>
>  I saw the similar problem on sgimips with CD-ROM which had sgivolhdr:
>  http://mail-index.netbsd.org/port-sgimips/2005/09/12/0000.html
>  http://mail-index.netbsd.org/port-sgimips/2005/09/14/0000.html
>
>  It seems the proper workaround to fix readdisklabel(9) to
>  always return d_secsize = 2048 for CDs, but I'm not sure.

If I understand things correctly, the problem is that non-2048b  
sectors weren't working properly and your solution was to use 2048b  
sectors and update the disklabel offsets/lengths in memory, yes? If  
that's the case, I think it's only hiding the problem. We need  
d_secsize == 512 for EFS images, and it should 'just work'.

If you're able to repeat the tests, I'd be interested to know if my  
aforementioned cd(4) bounce patch resolves the issue for you.

The problem as I see it is that cd(4) will make requests > MAXPHYS  
when bounce buffers are used and we need to round up one more sector  
to accommodate unaligned accesses. Is sending such a request to scsipi  
an invalid thing to do? If it's not permitted, it should be asserted  
somewhere, and I think my patch is the proper solution.

If, however, it is entirely valid, we have problems elsewhere. So long  
as in cd.c:796 nblks is <= 32 (corresponding to a maximum transfer of  
MAXPHYS), the underlying subsystems seem happy. However, if b_bcount  
is > (2^16), we can't just cap nblks, rather we appear to need to  
split up requests. I'm assuming nblks is passed to the device, and  
b_bcount is used to initiate DMA transfers until everything is read.  
Is it illegal to send a device a read command that cannot be handled  
in one DMA?

What's really confusing to me is that, in the atapi case, if we leave  
everything else alone (letting nblks be 33, which is 66k) and instead  
limit the DMA transfer size in atapi_wdc.c to 0xfffd, everything  
works. However, if we permit transfers of 0xfffe or 0xffff, it breaks.  
If we cap nblks at 32, then 0xffff-sized transfers are fine in that  
interrupts don't get lost, but we don't end up getting all of the  
requested data from the drive.

It seems that restricting the DMA size in atapi_wdc.c just happens to  
work by chance and wouldn't fix the real problem (which appears to  
exist for SCSI as well).

So, in short, questions I'd love to have answered:
   1) Is issuing requests > MAXPHYS to scsipi inherently bad or unsupported?
   2) To satisfy my curiosity, can anybody tell me why limiting the ATAPI DMA
      transfer size to 0xfffd or lower avoids the problem, while limiting to
      0xfffe or 0xffff does not?

Steve