Subject: multiprocessor AS2100 problems
To: None <tech-smp@netbsd.org>
From: Murray Armfield <murray.armfield@arconsulting.com.au>
List: tech-smp
Date: 06/07/2002 11:36:04
Hi folks,
=09I have a four CPU AS2100 4/275 system. I have got current as of yester=
day=20
running on it, using a MP kernel. It only works with one CPU enabled. If =
I=20
enable any more (ie 2 CPU's or 4 CPU's) I consistantly get panics just af=
ter=20
probing root file system type. I have it currently set up to only go sing=
le=20
user.

=09I would be quite happy to test anything anyone might like on this mach=
ine.

This is dmesg... (+panic + backtrace) starting in SRM.

P00>>>set boot_osflags s
P00>>>set cpu_enabled 3
P00>>>inint
inint: No such command
P00>>>init


VMS PALcode V5.56-7, OSF PALcode V1.45-12

starting console on CPU 0
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
bus 0, slot 0 -- ewa -- DECchip 21040-AA
bus 0, slot 1 -- pka -- NCR 53C810
bus 1, slot 2 -- vga -- ISA VGA
bus 0, slot 7 -- pkb -- NCR 53C810
bus 0, slot 8 -- fwa -- DEC PCI FDDI
starting console on CPU 1
os_type: UNIX - console CIPCA driver not started
Memory Testing and Configuration Status
Module   Size    Base Addr   Intlv Mode  Intlv Unit  Status
------   -----   ---------   ----------  ----------  ------
  2      128MB   00000000      2-Way         0       Passed
  3      128MB   00000000      2-Way         1       Passed
Total Bad Pages 0
Testing the System
Testing the Disks (read only)
Testing the Network
AlphaServer 2100 Console V5.3-2, built on Oct 16 1998 at 11:32:56
P00>>>boot
(boot dka0.0.0.1.0 -flags s)
block 0 of dka0.0.0.1.0 is a valid boot block
reading 15 blocks from dka0.0.0.1.0
bootstrap code read in
base =3D 200000, image_start =3D 0, image_bytes =3D 1e00
initializing HWRPB at 2000
initializing page table at fff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code

NetBSD/alpha 1.5ZC FFS Primary Bootstrap
Jumping to entry point...

NetBSD/alpha 1.5ZC Secondary Bootstrap, Revision 1.13
(mthomas@matt01.allegronetworks.com, Mon May 13 19:58:04 UTC 2002)

VMS PAL rev: 0x4000700010538
OSF PAL rev: 0x4000c0002012d
Switch to OSF PAL code succeeded.

Boot flags: s
3400664+324504 [197760+110336]=3D0x3d8eb0

Entering netbsd at 0xfffffc0000301410...
[ using 309056 bytes of netbsd ELF symbol table ]
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 1.6A (MERCURY) #2: Thu Jun  6 15:55:35 EST 2002
    admin@mercuryrising:/usr/src/sys/arch/alpha/compile/MERCURY
AlphaServer 2100 4/275, 274MHz, s/n ay55014441
8192 byte page size, 2 processors.
total memory =3D 256 MB
(2120 KB reserved for PROM, 253 MB used by NetBSD)
avail memory =3D 232 MB
using 1637 buffers containing 13096 KB of memory
mainbus0 (root)
cpu0 at mainbus0: ID 0 (primary), 21064A-2
cpu0: Architecture extensions: fffffc0000688770<PAT,MVI>
cpu1 at mainbus0: ID 1, 21064A-2
cpu1: Architecture extensions: fffffc0000008300<PAT,MVI>
ttwoga0 at mainbus0
ttwopci0 at ttwoga0 hose 0: T2 Gate Array rev. 2
sableio0 at ttwopci0 bus 0: Sable STDIO module
pckbc0 at sableio0 port 0x60
pckbd0 at pckbc0 (kbd slot)
pckbc0: kbd slot interrupting at T2 irq 6
wskbd0 at pckbd0 (mux ignored)
pms0 at pckbc0 (aux slot)
pckbc0: aux slot interrupting at T2 irq 3
wsmouse0 at pms0 (mux ignored)
fdc0 at sableio0 port 0x3f0
fdc0: interrupting at T2 irq 7
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
com0 at sableio0 port 0x3f8: ns16550a, working fifo
com0: console
com0: interrupting at T2 irq 15
com1 at sableio0 port 0x2f8: ns16550a, working fifo
com1: interrupting at T2 irq 8
lpt0 at sableio0 port 0x3bc
lpt0: interrupting at T2 irq 9
pci0 at ttwopci0 bus 0
pci0: i/o space, memory space enabled
tlp0 at pci0 dev 0 function 0: DECchip 21040 Ethernet, pass 2.3
tlp0: interrupting at T2 irq 2
tlp0: Ethernet address 08:00:2b:e7:35:ec
tlp0: 10baseT, 10baseT-FDX, 10base5, manual
siop0 at pci0 dev 1 function 0: Symbios Logic 53c810 (fast scsi)
siop0: interrupting at T2 irq 1
scsibus0 at siop0: 8 targets, 8 luns per target
pceb0 at pci0 dev 2 function 0: Intel 82375EB/SB PCI-EISA Bridge (PCEB) (=
rev.=20
0x03)
siop1 at pci0 dev 7 function 0: Symbios Logic 53c810 (fast scsi)
siop1: interrupting at T2 irq 4
scsibus1 at siop1: 8 targets, 8 luns per target
fpa0 at pci0 dev 8 function 0: DEC DEFPA PCI FDDI SAS Controller
fpa0: FDDI address 00:00:f8:4a:87:9b, FW=3D3.20, HW=3D0, SMT V7.2
fpa0: FDDI Port =3D S (PMD =3D Unshielded Twisted Pair)
fpa0: interrupting at T2 irq 5
eisa0 at pceb0
isa0 at pceb0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: PC speaker
spkr0 at pcppi0
isabeep0 at pcppi0
mcclock0 at isa0 port 0x70-0x71: mc146818 or compatible
scsibus0: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 0 lun 0: <DEC, RZ1CF-CF (C) DEC, 1614> SCSI2 0/dir=
ect=20
fixed
sd0: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd0: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
sd1 at scsibus0 target 1 lun 0: <DEC, RZ29B    (C) DEC, 0014> SCSI2 0/dir=
ect=20
fixed
sd1: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd1: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
sd2 at scsibus0 target 2 lun 0: <DEC, RZ2CA-LA (C) DEC, N1H1> SCSI2 0/dir=
ect=20
fixed
sd2: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd2: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
sd3 at scsibus0 target 3 lun 0: <DEC, RZ29B    (C) DEC, 0014> SCSI2 0/dir=
ect=20
fixed
sd3: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd3: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
cd0 at scsibus0 target 6 lun 0: <DEC, RRD45   (C) DEC, 1645> SCSI2 5/cdro=
m=20
removable
cd0: async, 8-bit transfers
scsibus1: waiting 2 seconds for devices to settle...
sd4 at scsibus1 target 0 lun 0: <DEC, RZ29B    (C) DEC, 0016> SCSI2 0/dir=
ect=20
fixed
sd4: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd4: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
sd5 at scsibus1 target 1 lun 0: <DEC, RZ29B    (C) DEC, 0014> SCSI2 0/dir=
ect=20
fixed
sd5: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd5: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
sd6 at scsibus1 target 2 lun 0: <DEC, RZ29B    (C) DEC, 0014> SCSI2 0/dir=
ect=20
fixed
sd6: 4091 MB, 3708 cyl, 20 head, 113 sec, 512 bytes/sect x 8380080 sector=
s
sd6: sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queuei=
ng
IPsec: Initialized Security Association Processing.
root on sd0a dumps on sd0b
root file system type: ffs

unexpected machine check:

    mces    =3D 0x1
    vector  =3D 0x660
    param   =3D 0xfffffc0000006400
    pc      =3D 0xfffffc000030073c
    ra      =3D 0xfffffc000030072c
    code    =3D 0x10000008a
    curproc =3D 0x0

panic: machine check
Stopped at      cpu_Debugger+0x4:       ret     zero,(ra)
db{1}> trace
cpu_Debugger() at cpu_Debugger+0x4
panic() at panic+0x168
machine_check() at machine_check+0x25c
dec_2100_a500_machine_check() at dec_2100_a500_machine_check+0x40
interrupt() at interrupt+0x228
XentInt() at XentInt+0x1c
--- interrupt (from ipl 0) ---
idle() at idle+0x64
idle() at idle+0x54
--- root of call graph ---
db{1}>


Regards,
=09Murray Armfield