Subject: couldn't ping cpus
To: None <port-sparc@netbsd.org>
From: Manuel Bouyer <bouyer@antioche.eu.org>
List: port-sparc
Date: 01/19/2003 16:41:51
Hi,
I've a SMP problem on my dual-CPU ss20, running a kernel as of today:
bootpath: /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@3,0
mainbus0 (root): SUNW,SPARCstation-20
cpu0 at mainbus0: mid 8: TMS390Z50 v0 or TMS390Z55 @ 50 MHz, on-chip FPU
cpu0: physical 20K instruction (64 b/l), 16K data (32 b/l): cache enabled
cpu1 at mainbus0: mid 10: TMS390Z50 v0 or TMS390Z55 @ 50 MHz, on-chip FPU
cpu1: physical 20K instruction (64 b/l), 16K data (32 b/l): cache enabled

when running a build.sh -j4, it hang with
xcall(cpu1,0xfoo): couldn't ping cpus: cpu0
After that the box is dead, doesn't even react to stop-A.
Needs a power cycle.
I don't have LOCKDEBUG compiled in, only DIAGNOSTIC.
Do other peoples still see this problem on MP supersparc ?
>From messages on this list I though it was solved ...

If this is relevant, at the end of boot I see:
root on sd0a dumps on sd0b
root file system type: ffs
cpu0: booting secondary processors: cpu1
bogus interrupt ipl 0x6 pc=0xf00d942c npc=0xf00d9418 psr=404001c2<S,PS>
bogus interrupt ipl 0x6 pc=0xf00d941c npc=0xf00d9420 psr=404001c2<S,PS>
bogus interrupt ipl 0x6 pc=0xf00d9420 npc=0xf00d9424 psr=400001c2<S,PS>

>From experiment with systat, ipl 6 seems to be related to network.

-- 
Manuel Bouyer <bouyer@antioche.eu.org>
     NetBSD: 23 ans d'experience feront toujours la difference
--