Subject: *ISA* Ethernet driver -> CPU Death
To: None <port-alpha@NetBSD.ORG>
From: Trevor Mendez <tmendez@BBN.COM>
List: port-alpha
Date: 01/25/1996 11:36:17
I've been working on an ISA ethernet driver.  The DE205 is detected and
attached properly.  When I tried to ifconfig, etc the system crashed pretty
hard and would not reboot any of my netbsd kernels -- not even the "safe" ones.
It kept halting unexpectedly at many different stages of the boot.  After
several power-cycles and reboot attempts I decided to swap back in my OSF/1
disk...  The result seems to have more debugging info.  Any ideas?  Thanks.
/tdm


----- Forwarded message # 1:

          25 Jan 96 11:27 EST
Date:     Thu, 25 Jan 96 11:19:42 EST
From:     Trevor Mendez <tmendez@BBN.COM>
To:       alpha-osf-managers@ornl.gov
cc:       tmendez@BBN.COM
Subject:  Boot Failure with AXPpci 33

Folks,
     I've got an AXPpci 33 system on which I run 2 operating systems -- swap
the SCSI disks.  Seems like NetBSD hacking may hosed my system so that I
can no longer get OSF/1 to boot (see below).  Any ideas?  Thanks in advance.
/tdm

Trevor Mendez
BBN STD
Cambridge, MA
(617)873-3271
tmendez@bbn.com

System Info:	AXPpci 33 233 MHz.
		64 MB RAM, 1 MB cache
		Floppy, CD-ROM, SCSI hard disk
		tga video, PCI Ethernet, ISA Ethernet

----- Begin Console Output
starting console on CPU 0
initialized idle PCB
initializing semaphores
initializing heap
initial heap 1c0c0
memory low limit = 100000
heap = 1c0c0, 17fc0
initializing driver structures
initializing idle process PID
XDELTA not enabled.
initializing file system
initializing 8259s
initializing timer data structures
lowering IPL
CPU 0 speed is 4.29 ns (233MHz)

Powerup process has started
CPU ID = 0
Initialize configuration locks etc.
Configure the memory
Initialising ISA/PCI interrupts.
Configure the PCI Bus
Start tt class, port (graphics, keyboard, then serial) drivers
entering idle loop
initializing keyboard

** mouse error **
change stdin/out/err channels from nl to tt
Memory size = 64Mbytes
testing memory from 800000 to 3e16000 ...
* Memory error at address:2029418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffff
* Memory error at address:2129418  Wrote:aaaaaaaaaaaaaaaa Read:ffefffffff2fffff
* Memory error at address:2229418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffff
* Memory error at address:2329418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffff
* Memory error at address:2429418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffff
* Memory error at address:2529418  Wrote:aaaaaaaaaaaaaaaa Read:ffefffffff2fffff
* Memory error at address:2629418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffff
* Memory error at address:2729418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffff
* Memory error at address:2829418  Wrote:aaaaaaaaaaaaaaaa Read:ff8fffffdf0fffff
* Memory error at address:2929418  Wrote:aaaaaaaaaaaaaaaa Read:ff7fffffff0fffff
* Memory error at address:2a29418  Wrote:aaaaaaaaaaaaaaaa Read:ff2fffffff0fffff
Start driver phase 4
access NVRAM
Build this processor's slot in the hwrpb
Backup Cache size = 1Mb
Init SCSI class driver
Start driver phase 5
AXPpci33 Common Console X4.3-3114, built on Sep 13 1995 at 04:15:18

CPU 0 booting

warning -- main memory zone is not free
>>>boot
(boot dka0.0.0.6.0 -flags a)
block 0 of dka0.0.0.6.0 is a valid boot block
reading 16 blocks from dka0.0.0.6.0
bootstrap code read in
base = 10e000, image_start = 0, image_bytes = 2000
initializing HWRPB at 2000
initializing page table at 100000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code

OSF boot - Mon Jul 24 21:56:39 EDT 1995

Loading vmunix ...
Current PAL Revision <0x1000000010530>
Switching to OSF PALcode Succeeded
New PAL Revision <0x1000000020123>
Loading into KSEG Address Space

Sizes:
text = 4986240
data = 1281824
bss  = 2580864
Starting at 0xfffffc00002392a0

[ ... ]

fd0 at fdi0 unit 0
le0 at isa0
le0: DEC LeMAC Ethernet Interface, hardware address: 08-00-2B-93-B4-74 
tga0 at pci0 slot 8
PCXAL keyboard, language English (American)

tga0: depth 8, map size 4MB, 1280x1024
tu0: DECchip 21041-AA: Revision: 1.1
tu0 at pci0 slot 12
tu0: DEC TULIP Ethernet Interface, hardware address: 00-00-F8-00-27-3C
tu0: console mode: selecting AUI (10Base5) port: no carrier
lvm0: configured.
lvm1: configured.
lvm2: configured.
Checking local filesystems
/sbin/ufs_fsck -p
Alpha AXPpci33 correctable error type 0x630.
  retry = 1
  mchk_code     = 0x86   frame_rev      = 0x1
              esr = 0x6ff4daf900020013
              ear = 0x0000000083c29419
          dc_stat = 0x0000000000000003
            stat0 = 0x000000c2000000c2
            stat1 = 0x0000007100000071
           bc_tag = 0x0000000000000000
Alpha AXPpci33 correctable error type 0x630.
  retry = 1
  mchk_code     = 0x86   frame_rev      = 0x1
              esr = 0x6ff4daf900020013
              ear = 0x0000000083c29419
          dc_stat = 0x0000000000000003
            stat0 = 0x000000c2000000c2
            stat1 = 0x0000007100000071
           bc_tag = 0x0000000000000000
/dev/rrz0a: INCORRECT BLOCK COUNT I=13377 (2 should be 0) (CORRECTED)
/dev/rrz0a: UNREF FILE I=13377  OWNER=root MODE=100644
/dev/rrz0a: SIZE=0 MTIME=Jan 25 11:03 1996  (CLEARED)
/dev/rrz0a: 751 files, 30163 used, 33172 free (108 frags, 4133 blocks, 0.2% fragmentation)
/dev/rrz0f: 15800 files, 190918 used, 19249 free (1713 frags, 2192 blocks, 0.8% fragmentation)
/dev/rrz0d: LINK COUNT FILE I=25691  OWNER=root MODE=100644
/dev/rrz0d: SIZE=1092 MTIME=Jan 25 11:03 1996  COUNT 2 SHOULD BE 1 (ADJUSTED)
/dev/rrz0d: FREE BLK COUNT(S) WRONG IN SUPERBLK (SALVAGED)
/dev/rrz0d: BLK(S) MISSING IN BIT MAPS (SALVAGED)
/dev/rrz0d: SUMMARY INFORMATION BAD (SALVAGED)
/dev/rrz0d: 3838 files, 91252 used, 13866 free (386 frags, 1685 blocks, 0.4% fragmentation)
/dev/rrz0e: 2 files, 8505 used, 54830 free (14 frags, 6852 blocks, 0.0% fragmentation)
Mounting / (root)
Mounting local filesystems
/dev/rz0a on / type ufs (rw)
/proc on /proc type procfs (rw)
/dev/rz0f on /bsdusr type ufs (rw)
/dev/rz0d on /usr type ufs (rw)
/dev/rz0e on /bsdroot type ufs (rw)
Jan 25 11:08:21 update: started
The system is coming up. Please wait...
Checking for crash dumps
Found an incomplete partial dump.
System went down at Thu Jan 25 11:03:36 1996
Dump will be omitted, noAlpha AXPpci33 machine check type 0x670.
Machine check abort
  retry = 0
  mchk_code     = 0x88   frame_rev      = 0x1
      paltemp[ 1] = 0x001a82f800000004
      paltemp[ 2] = 0x0000000000000001
      paltemp[ 3] = 0x0000000000000000
      paltemp[ 4] = 0x0000000000000040
      paltemp[ 5] = 0x0000000000000000
      paltemp[ 6] = 0x0000000000004200
      paltemp[ 7] = 0x0000000000000400
      paltemp[ 8] = 0x0000000000000000
      paltemp[ 9] = 0xfffffc0000442190
      paltemp[10] = 0x0000000000000000
      paltemp[11] = 0xfffffc0000442530
      paltemp[12] = 0xfffffc0000442560
      paltemp[13] = 0xfffffc00004425c0
      paltemp[14] = 0xfffffc0000442330
      paltemp[15] = 0xfffffc0000442040
      paltemp[16] = 0x0000000000000000
      paltemp[17] = 0x000000011fffeec0
      paltemp[18] = 0xffffffff83adb230
      paltemp[19] = 0xfffffc0000827310
      paltemp[20] = 0x0000000000000000
      paltemp[21] = 0x0032323a3a3e3e3e
      paltemp[22] = 0x0000000011223344
      paltemp[23] = 0x0000000000000000
      paltemp[24] = 0x0000000000010000
      paltemp[25] = 0x0000000000000000
      paltemp[26] = 0x0000000000000000
      paltemp[27] = 0x0000000000210000
      paltemp[28] = 0xfffffffc00000000
      paltemp[29] = 0x0000000000000001
      paltemp[30] = 0x0000000001569a58
      paltemp[31] = 0xfffffc000044e318
         exc_addr = 0xfffffc000044e318
          exc_sum = 0x0000000000000000
         exc_mask = 0x0000000000000000
            iccsr = 0x00000003505f0000
         pal_base = 0x0000000000014000
             hier = 0x0000000000001c70
             hirr = 0x0000000000000042
           mm_csr = 0x0000000000001fcb
          dc_stat = 0x0000000000000003
          dc_addr = 0x00000007ffffffff
         abox_ctl = 0x0000000000000428
          lca_esr = 0x6ff4daf900020015
          lca_ear = 0x0000000083d29419
          lca_car = 0x000000006020e591
        lca_stat0 = 0x000000c3000000c3
        lca_stat1 = 0x0000002000000020
               va = 0x0000000000000000
           bc_tag = 0x0000000000000000
 SIO       status = 0x0000000000000200
 SIO          rev = 0x0000000000000003
 SIO        ubcsa = 0x00000000000000ff
 SIO        ubcsb = 0x00000000000000ff
 SIO         pirq = 0x000000000b090a80
 SIO    irq_mask0 = 0x0000000000000081
 SIO    irq_mask1 = 0x00000000000000e1
 SIO    irq_edge0 = 0x0000000000000000
 SIO    irq_edge1 = 0x000000000000000e
 SIO     irq_irr0 = 0x0000000000000000
 SIO     irq_irr1 = 0x0000000000000000
 SIO     irq_isr0 = 0x0000000000000000
 SIO     irq_isr1 = 0x0000000000000000
 SIO   nmi_status = 0x0000000000000030
 SIO  dma_status0 = 0x0000000000000000
 SIO  dma_status1 = 0x0000000000000000
panic (cpu 0): Machine check - Hardware error
syncing disks... t enough 6 space on device
The saving of the cra
DUMP: 131072 blocks available for dumping.
DUMP: 19973 required for a partial dump.
DUMP: 0x800001 is the primary swap with 131071, start our last 19972
    : of dump at 111099, going to end (real end is one more, for header)
DUMP.prom: dev SCSI 0 6 0 0 0 0 0, block 131072
DUMP: Header to 0x800001 at 131071 (0x1ffff)
DUMP.prom: dev SCSI 0 6 0 0 0 0 0, block 131072
DUMP: Dump to 0x800001: .........: End 0x800001
DUMP.prom: dev SCSI 0 6 0 0 0 0 0, block 131072
DUMP: Header to 0x800001 at 131071 (0x1ffff)
succeeded
ebp10_isa_handler_dis(key=6)

halted CPU 0

halt code = 5
HALT instruction executed
PC = fffffc0000442b50

----- End of forwarded messages