Subject: Re: data corruption (Re: Binding more than one IP to a NIC)
To: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
From: Markus W Kilbinger <kilbi@rad.rwth-aachen.de>
List: port-cobalt
Date: 11/09/2004 00:57:15
>>>>> "Izumi" == Izumi Tsutsui <tsutsui@ceres.dti.ne.jp> writes:

First: Thanks for all infos!

    Izumi> Well, AFAIK the problem on cobalt PCI implementation
    Izumi> affects memory mapped PCI devices (like siop), so I'm not
    Izumi> sure if it fixes your "data corruption" problem. (patch
    Izumi> attached, including MI changes filed in kern/27423)

No, the data corruption problem remains, but my qube2 became much more
stable with your pci related patches! Beside the data corruption
problems now I 'only' see panics under heavy load of the following
kind:

  trap: TLB miss (load or instr. fetch) in kernel mode
  status=0x2403, cause=0x8808, epc=0x80229214, vaddr=0xc874e000
  pid=196 cmd=ttcp usp=0x7fffda10 ksp=0xc8749b08
  Stopped in pid 196.1 (ttcp) at  netbsd:r5k_pdcache_wb_range_32+0x58: cache   0
  x19,0x1a0(a0)
  db> t
  r5k_pdcache_wb_range_32+58 (c874de60,c874e3e0,5ea,5ea) ra 8022e468 sz 0
  8022e3d8+90 (c874de60,c874e3e0,5ea,5ea) ra 0 sz 0
  User-level: pid 196.1
  db>

No silent lockups as before (til now)...

    Izumi> On the other hand, there is a patch for MIPS r5k cache:
    Izumi> http://mail-index.netbsd.org/port-sgimips/2003/01/06/0001.html

Seems to be already almost in -current...

And http://mail-index.netbsd.org/port-mips/2003/04/27/0000.html does
also not prevent these panics and te data corruption.

    Izumi> Anyway, I also have a NetBSD/arc machine (R4400) running
    Izumi> 2.0G as file server and it has no data corruption for 2
    Izumi> months, so I don't think there is MIPS _common_ VM issue,
    Izumi> but I also see some problem reports on r5k O2. (I wonder if
    Izumi> current mips pmap handles 2-way set associative
    Izumi>  cache on r5k CPUs)

So, if I've read the lists correctly there _is_ some r5k related
cache/pmap problem left which causes similar problems on sgimips...

Is anybody working on that?

Thanks again,

Markus.