Subject: kernel crash and scsi-disk/hang
To: None <current-users@NetBSD.ORG>
From: Andrew Wheadon <andrew@wipux2.wifo.uni-mannheim.de>
List: current-users
Date: 01/22/1995 11:57:32
I'm having quite a few kernel crashes recently (about 2-3 per week)
and the accounting always seems to be corrupted when people
are using irc or screen-3.6, so I presume it's crashing when they
are running these programs for some time. I find it a bit frustrating
that when I do pop in to see what has happened the kernel has 
usually rebooted instead of staying in the Debugger to tell me
where it happened. (I have DDB and DIAGNOSTIC options in my configfile)

Anyway it sometimes seems to reboot while the slower of my two scsi-disks
(sd0) is being accessed, at least the light is still on both the 
disk and the controller while the other peripherals (sd1 and st0 are off).

This causes it to fail to find any scsi-peripherals and thus
ask for a boot-disk and then just wait until i come into work
and turn sd0 off and turn it on again, after this it finds 
them nicely on both a hard and a soft-reboot.

I can imagine that sd0 is being accessed too fast, at least 
our Novell-server has the same problem when the speed is set
too high on it's adaptec, but I don't know how to set the ncr
to different speeds for different devices and it seems to do
it itself anyway.

SO:	How do I make the kernel stop in debugger (for ever) 
	instead of rebooting.
	How do I set the speed for different devices on the
	scsi-bus.
	How do I cause the scsi-bus to be reset properly on
	reboot.
	How do I find out which program is crashing my system
	or better still stop programs from being able to crash
	my System, without disabling user-access of the machine ?

I get no coredump when the machine crashes anyway.

Here is dmesg:

NetBSD 1.0A (WIPUX) #0: Fri Jan 20 09:43:25 MET 1995
    toor@wipux2:/src/src/sys/arch/i386/compile/WIPUX
CPU: Pentium (GenuineIntel 586-class CPU)
real mem  = 33161216
avail mem = 29646848
using 430 buffers containing 1761280 bytes of memory
isa0 (root)
npx0 at isa0 port 0xf0-0xff: using exception 16
WARNING: Pentium FDIV bug detected!
vt0 at isa0 port 0x60-0x6f irq 1: et4000, 80/132 col, color, 8 scr, mf2-kbd, [R3.00]
com0 at isa0 port 0x3f8-0x3ff irq 4: ns82450 or ns16450, no fifo
com1 at isa0 port 0x2f8-0x2ff irq 3: ns82450 or ns16450, no fifo
lpt0 at isa0 port 0x378-0x37f: polled
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
ed0 at isa0 port 0x300-0x31f iomem 0xdc000-0xdffff irq 10: address 00:00:c0:c0:43:a0, type SMC8216/SMC8216C (16-bit) aui
root device eisa not configured
pci0 (root): configuration mode 2
pci0 bus 0 device 0: identifier 04a38086 class 06000011 not configured
pci0 bus 0 device 2: identifier 04828086 class 00000003 not configured
ncr0 at pci0 bus 0 device 5
pci_map_mem: memory mapped at c0000000-c0000fff
pci_map_int: pin A mapped to line 11
ncr0: restart (scsi reset).
ncr0 scanning for targets 0..6 ($Revision: 1.10 $)
scsibus0 at ncr0
probe(ncr0:0:0): 225ns (4 Mb/sec) offset 8.
ncr0 targ 0 lun 0: <FUJITSU, M2266S-512, 0020> SCSI2 0/direct fixed
sd0 at scsibus0: 1029MB, 1658 cyl, 15 head, 84 sec, 512 bytes/sec
probe(ncr0:1:0): 200ns (5 Mb/sec) offset 8.
ncr0 targ 1 lun 0: <FUJITSU, M2694S-512, 0124> SCSI2 0/direct fixed
sd1 at scsibus0sd1(ncr0:1:0): FAST SCSI-2 100ns (10 Mb/sec) offset 8.
: 1033MB, 1819 cyl, 15 head, 77 sec, 512 bytes/sec
ncr0 targ 5 lun 0: <HP, HP35470A, 1109> SCSI2 1/sequential removable
st0 at scsibus0: st0(ncr0:5:0): 200ns (5 Mb/sec) offset 8.
drive empty
biomask 840 netmask 41a ttymask 1a
changing root device to sd0a
sd1(ncr0:1:0): sd1(ncr0:1:0):
	^- these two messages appear while fsck is running.

this is my config file:
#	$NetBSD: GENERIC,v 1.7 1994/12/11 16:20:52 mycroft Exp $
#
#	GENERIC -- everything that's currently supported
#

machine		i386		# architecture, used by config; REQUIRED

options		I386_CPU	# CPU classes; at least one is REQUIRED
options		I486_CPU
options		I586_CPU
#options		MATH_EMULATE	# floating point emulation

options		DUMMY_NOPS	# speed hack; recommended
options		XSERVER,UCONSOLE
options		MACHINE_NONCONTIG

maxusers	32		# estimated number of users
options		TIMEZONE=0	# time zone to adjust RTC time by
options		DST=0		# daylight savings time used by RTC

options		SWAPPAGER	# paging; REQUIRED
options		VNODEPAGER	# mmap() of files
options		DEVPAGER	# mmap() of devices

options		DDB		# in-kernel debugger
options		DIAGNOSTIC	# internally consistency checks
#options		KTRACE		# system call tracing, a la ktrace(1)

options		SYSVMSG		# System V-like message queues
options		SYSVSEM		# System V-like semaphores
options		SYSVSHM		# System V-like memory sharing
#options	SHMMAXPGS=1024	# 1024 pages is the default

options		COMPAT_NOMID	# compatibility with 386BSD, BSDI, NetBSD 0.8,
options		COMPAT_09	# NetBSD 0.9,
options		COMPAT_10	# NetBSD 1.0,
options		COMPAT_43	# and 4.3BSD
options		TCP_COMPAT_42	# TCP bug compatibility with 4.2BSD

options	COMPAT_SVR4	# binary compatibility with SVR4
#options	COMPAT_IBCS2	# binary compatibility with SCO and ISC

options		USER_LDT	# user-settable LDT; used by WINE
options		LKM		# loadable kernel modules

options		FFS,QUOTA	# UFS and quotas
#options		LFS		# log-structured file system
options		MFS		# memory file system

options		NFSCLIENT	# Network File System client
options		NFSSERVER	# Network File System server

options		CD9660		# ISO 9660 + Rock Ridge file system
options		MSDOSFS		# MS-DOS file system
#options		FDESC		# /dev/fd
options		FIFO		# FIFOs; RECOMMENDED
#options		KERNFS		# /kern
#options		NULLFS		# loopback file system
#options		PORTAL		# ?
#options		PROCFS		# /proc
#options		UMAPFS		# NULLFS + uid and gid remapping
#options		UNION		# union file system

#options	GATEWAY		# packet forwarding
options		INET		# IP + ICMP + TCP + UDP
#options		NS		# XNS
#options		ISO,TPIP	# OSI
#options		EON		# OSI tunneling over IP
#options		CCITT,LLC,HDLC	# X.25

config		netbsd	swap generic
options		GENERIC
options		WIPUX

isa0	at root
#eisa0	at root
pci0	at root

npx0	at isa? port 0xf0 irq 13	# math coprocessor

vt0	at isa? port 0x60 irq 1		# virtual PC console device

com0	at isa? port 0x3f8 irq 4	# standard PC serial ports
com1	at isa? port 0x2f8 irq 3

#lpt0	at isa? port 0x378 irq 7	# standard PC parallel ports
lpt0	at isa? port 0x378 		# polled ports (safer)
#lpt1	at isa? port 0x278

ncr*	at pci? bus ? dev ?		# NCR 538XX SCSI controllers
scsibus* at ncr?
# ahb*	at eisa? slot ? irq ?		# Adaptec 174[024] SCSI controllers
# scsibus* at ahb?
aha0	at isa? port 0x330 irq ? drq ?	# Adaptec 154[02] SCSI controllers
scsibus* at aha?
sd*	at scsibus? target ? lun ?	# SCSI disk drives
st*	at scsibus? target ? lun ?	# SCSI tape drives
cd*	at scsibus? target ? lun ?	# SCSI CD-ROM drives
ch*	at scsibus? target ? lun ?	# SCSI autochangers

fdc0	at isa? port 0x3f0 irq 6 drq 2	# standard PC floppy controllers
fd*	at fdc? drive ?

ed0	at isa? port 0x300 iomem 0xdc000 irq 10	# WD/SMC, 3C503, and NE[12]000
#ed1	at eisa? slot ? iomem 0xd0000 irq ?	# Eisa/SMC Card in ISA Mode ?

#sb0	at isa? port 0x220 irq 7 drq 1	# SoundBlaster
#spkr0	at isa? ...

pseudo-device	loop	1		# network loopback
pseudo-device	bpfilter 8		# packet filter
pseudo-device	sl	2		# CSLIP
pseudo-device	ppp	2		# PPP
pseudo-device	tun	2		# network tunneling over tty

pseudo-device	pty	64		# pseudo-terminals
pseudo-device	tb	1		# tablet line discipline
pseudo-device	vn	4		# paging to files

Well it might be the Fifos but I didn't think irc used fifos.
I've switched to using an mfs-mounted /tmp but it didn't help
much and some people are defining their own SCREENDIR.
Cheerio
-- 
The cost of living hasn't affected it's popularity. (unknown)
current	release=doc host=wipux2.wifo.uni-mannheim.de \ "NetBSD-current mirror"
hostbase=/mit/ftp/pub/NetBSD base=/usr prefix=/usr backup delete use-rel-suffix