Subject: Upgrade install from 1.2.1 to 1.3.1 fails with hung disk on AHA3940W?
To: None <port-i386@NetBSD.ORG>
From: Greg Earle <earle@isolar.Tujunga.CA.US>
List: port-i386
Date: 04/30/1998 01:31:20
Ouch.  I decided that my trusty no-name Pentium 133 tower server should be
my test mule for my 1.2.1 -> 1.3.1 upgrade (SPARCstation next), so I set
off to do it.

The Pentium has an Adaptec 3940W SCSI controller with two channels; there
is one internal 2.1 Gb disk on channel A, and an external HP ScanJet IIc
also on channel A.  In later tests I removed the scanner from the bus, so
it's not relevant here.

I booted the 1.3.1 install floppy and told "sysinst" I wanted to do an Upgrade.

It fsck'ed the disks OK, then as soon as it got to the point where it says

	Populating filesystems with bootstrapping binaries and config files...

the system locked up hard.  Disk drive (and front panel "Disk" LED) stuck on,
solid.  :-(

Upon reboot I found that "/etc" had been moved to "/etc.old" and a new "/etc"
had been created with 6 files in it (ifconfig, rc, etc.).  I was able to
fix this without much trouble.  A full reboot later on showed that my existing
"/bin/cat" had gotten blown away - it had been turned into a 0-length file.
I was able to recover that easily enough (but not before some weirdness in
the boot without having a "cat"; no hostname set, and "rwhod" complained:

	rwhod[112]: malformed host name from fdb19580

where "fdb19580 is the PC's own IP address in hex, only backward!   Weird).

Thus, alas, I'm stuck.  This is infinutely repeatable - it always happens
in the same fashion, at the same place in the cycle, with the same results:
new /etc, 6 files, "/bin/cat" toast.

The system is more or less perfectly fine running under 1.2.1.  I'll throw
in the 1.2.1 boot messages if it helps at all.  See below.

I don't really know what to think here - a problem with the 1.3.1 aha driver
vs. 1.2.1, perhaps?  All I know is that the Upgrade process is able to mount
the root partition onto "/mnt" OK, and it even gets those 6 files into the
new proto-/etc OK, but as soon as it goes to touch /bin/cat et al., kablooey.
Insta-freeze.

Any suggestions/ideas welcome.  Is it worth my while to try a plain 1.3
boot floppy instead of this 1.3.1 boot floppy?  Did the Adaptec driver
change at all between 1.3 and 1.3.1?

Regards,

	- Greg

P.S. Please cc: me on any & all replies, as I'm not on the port-i386 list.
     (Sorry, but this box just hums along and my main box is a SPARCstation.)

-----------------------------------------------------------------------------

NetBSD 1.2.1 (PCNETBSD4ME) #1: Tue Sep 16 02:59:43 PDT 1997
    root@pcnetbsd4me:/usr/src/1.2.1/usr/src/sys/arch/i386/compile/PCNETBSD4ME
CPU: Pentium (GenuineIntel 586-class CPU)
real mem  = 66711552
avail mem = 60719104
using 839 buffers containing 3436544 bytes of memory
mainbus0 (root)
isa0 at mainbus0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns8250 or ns16450, no fifo
com1 at isa0 port 0x2f8-0x2ff irq 3: ns8250 or ns16450, no fifo
lpt0 at isa0 port 0x378-0x37f irq 7
npx0 at isa0 port 0xf0-0xff: using exception 16
pc0 at isa0 port 0x60-0x6f irq 1: color
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
fd1 at fdc0 drive 1: 1.44MB 80 cyl, 2 head, 18 sec
pci0 at mainbus0 bus 0: configuration mode 1
vendor 0x8086 product 0x122d (class bridge, subclass host, revision 0x01) at pci0 dev 0 function 0 not configured
vendor 0x8086 product 0x122e (class bridge, subclass ISA, revision 0x02) at pci0 dev 7 function 0 not configured
de0 at pci0 dev 13 function 0: DC21140 [10-100Mb/s] pass 1.1
de0: Ethernet address 00:00:c0:73:ac:bf
de0: enabling 10baseT UTP port
de0: interrupting at irq 10
ppb0 at pci0 dev 14 function 0: vendor 0x1011 product 0x0001 (rev. 0x02)
pci1 at ppb0 bus 1
ahc0 at pci1 dev 4 function 0
ahc0: interrupting at irq 10
ahc0: aic7870 Channel A, SCSI Id=7, 16 SCBs
scsibus0 at ahc0
ahc0: target 1 synchronous at 10.0MHz, offset = 0xf
sd0 at scsibus0 targ 1 lun 0: <SEAGATE, ST32430N, 0300> SCSI2 0/direct fixed
sd0: 2049MB, 3992 cyl, 9 head, 116 sec, 512 bytes/sec
ahc0:A:2: refuses syncronous negotiation. Using asyncronous transfers
ss0 at scsibus0 targ 2 lun 0: <HP, C1750A, 3226> SCSI2 3/processor fixed
ss0: HP ScanJet IIc
ahc1 at pci1 dev 5 function 0
ahc1: interrupting at irq 11
ahc1: aic7870 Channel B, SCSI Id=7, 16 SCBs
scsibus1 at ahc1
vendor 0x1002 product 0x4758 (class display, subclass VGA, revision 0x01) at pci0 dev 15 function 0 not configured
biomask c40 netmask c40 ttymask cda
changing root device to sd0a
/dev/sd0a: file system not clean; please fsck(8)
lpt0: out of paper
lpd[120]: restarted
Aperture driver for XFree86 version 1.4
init: kernel security level changed from 0 to 1

Apr 30 00:45:26 pcnetbsd4me /netbsd: duplicate IP address 8095b1fd sent from ethernet address 08:00:20:21:31:17