Subject: DMA error while dumping?
To: None <port-i386@netbsd.org>
From: Jeff Rizzo <riz@redcrowgroup.com>
List: port-i386
Date: 11/09/2004 09:38:02
This has been going on for me for a while, but it's now getting to the 
top of my annoyance list, so I thought I'd see what I can do about it... 
:) 

I'm running 2.99.10 from mid-October, and I can't for the life of me get 
a crash dump; I use 'reboot 0x104' from the db> prompt, and it starts 
dumping, but invariably craps out after a random number of pages like so:

db> reboot 0x104

dumping to dev 0,9 offset 525727
dump 767 766 765 764 763 762 761 760 759 758 757 756 755 754 753 752 751 
750 749
 748 747 746 745 744 743 742 741 740 739 738 737 736 735 734 733 732 731 
730 729
 728 727 726 725 724 723 722 721 720 719 718 717 716 715 714 713 712 711 
710 709
 708 707 706 705 704 703 702 701 700 699 698 697 696 695 694 693 692 691 
690 689
 688 687 686 685 684 683 682 681 680 679 678 677 676 675 674 673 672 671 
670 669
 668 667 666 665 664 663 662 661 660 659 658 657 656 655 654 653 652 651 
650 649
 648 647 646 645 644 643 642 641 640 639 638 637 636 635 634 633 632 631 
630 629
 628 627 626 625 624 623 622 621 620 619 618 617 616 615 614 613 612 611 
610 609
 608 607 606 605 604 603 602 601 600 599 598 597 596 595 594 593 592 591 
590 589
 wddump: DMA error
i/o error

Sometimes it gets much farther than this; sometimes not as far.  I've 
tried limiting the amount of memory the kernel sees to get smaller dumps 
in the hopes of getting one I can use, but it _always_ dies, or yields a 
dump that gdb doesn't recognize.

Does this smell like a hardware problem?  Or something else?  This is 
the only system I have this trouble on, but it's also the only one I'm 
running -current on.  Any suggestions?

I've included a copy of the boot messages in case the chipset/controller 
info yields any clues...

Thanks in advance.

+j

NetBSD 2.99.10 (DESKTOPDEV) #8: Mon Nov  8 21:24:46 PST 2004
        
riz@desktop.boogers.sf.ca.us:/scratch/netbsd/current/src/sys/arch/i386/c
ompile/DESKTOPDEV
total memory = 767 MB
avail memory = 744 MB
BIOS32 rev. 0 found at 0xf0f50
mainbus0 (root)
cpu0 at mainbus0: (uniprocessor)
cpu0: AMD Athlon Model 4 (Thunderbird) (686-class), 1009.08 MHz, id 0x642
cpu0: features c1c7f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR>
cpu0: features c1c7f9ff<PGE,MCA,CMOV,PAT,PSE36,PN,MMXX,MMX>
cpu0: features c1c7f9ff<FXSR,3DNOW2,3DNOW>
cpu0: "AMD Athlon(tm) Processor"
cpu0: I-cache 64 KB 64B/line 2-way, D-cache 64 KB 64B/line 2-way
cpu0: L2 cache 256 KB 64B/line 16-way
cpu0: ITLB 16 4 KB entries fully associative, 8 4 MB entries fully 
associative
cpu0: DTLB 24 4 KB entries fully associative, 8 4 MB entries 4-way
cpu0: 8 page colors
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: VIA Technologies VT8363 KT133 System Controller (rev. 0x02)
agp0 at pchb0: aperture at 0xe4000000, size 0x10000000
ppb0 at pci0 dev 1 function 0: VIA Technologies VT8363 KT133 PCI to AGP 
Bridge (
rev. 0x00)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled
vga1 at pci1 dev 0 function 0: Nvidia Corporation RIVA TNT2 Model 64 
(rev. 0x15)
wsdisplay0 at vga1 kbdmux 1
wsmux1: connecting to wsdisplay0
pcib0 at pci0 dev 4 function 0
pcib0: VIA Technologies VT82C686A (Apollo KX133) PCI-ISA Bridge (rev. 0x22)
viaide0 at pci0 dev 4 function 1
viaide0: VIA Technologies VT82C686A (Apollo KX133) ATA66 controller
viaide0: bus-master DMA support present
viaide0: primary channel configured to compatibility mode
viaide0: primary channel interrupting at irq 14
atabus0 at viaide0 channel 0
viaide0: secondary channel configured to compatibility mode
viaide0: secondary channel interrupting at irq 15
atabus1 at viaide0 channel 1
uhci0 at pci0 dev 4 function 2: VIA Technologies VT83C572 USB Controller 
(rev. 0
x10)
uhci0: interrupting at irq 9
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: VIA Technologies UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1 at pci0 dev 4 function 3: VIA Technologies VT83C572 USB Controller 
(rev. 0
x10)
uhci1: interrupting at irq 9
usb1 at uhci1: USB revision 1.0
uhub1 at usb1
uhub1: VIA Technologies UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
viapm0 at pci0 dev 4 function 4
viaenv0 at viapm0: disabled
fwohci0 at pci0 dev 9 function 0: Texas Instruments TSB12LV23 IEEE 1394 
Host Con
troller (rev. 0x00)
fwohci0: interrupting at irq 9
fwohci0: OHCI 1.0
fwohci0: fatal: no global UID ROM
eap0 at pci0 dev 10 function 0: Ensoniq AudioPCI 97 (rev. 0x02)
eap0: interrupting at irq 5
eap0: ac97: Asahi Kasei AK4540 codec; no 3D stereo
eap0: ac97: ext id 4d00<SECONDARY01,AC97_23,AC97_22,LDAC>
audio0 at eap0: full duplex, mmap, independent
unknown vendor 0xe159 product 0x0001 (miscellaneous network) at pci0 dev 
11 func
tion 0 not configured
tlp0 at pci0 dev 12 function 0: DECchip 21140A Ethernet, pass 2.2
tlp0: interrupting at irq 11
tlp0: Ethernet address 00:80:c8:27:0e:17
lxtphy0 at tlp0 phy 0: LXT970 10/100 media interface, rev. 1
lxtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
hptide0 at pci0 dev 13 function 0
hptide0: Triones/Highpoint HPT366 IDE Controller
hptide0: bus-master DMA support present
hptide0: primary channel wired to native-PCI mode
hptide0: using irq 9 for native-PCI interrupt
atabus2 at hptide0 channel 0
hptide1 at pci0 dev 13 function 1
hptide1: Triones/Highpoint HPT366 IDE Controller
hptide1: bus-master DMA support present
hptide1: primary channel wired to native-PCI mode
hptide1: using irq 9 for native-PCI interrupt
atabus3 at hptide1 channel 0
pdcide0 at pci0 dev 17 function 0
pdcide0: Promise Ultra100/ATA Bus Master IDE Accelerator (rev. 0x02)
pdcide0: bus-master DMA support present
pdcide0: primary channel configured to native-PCI mode
pdcide0: using irq 10 for native-PCI interrupt
atabus4 at pdcide0 channel 0
pdcide0: secondary channel configured to native-PCI mode
atabus5 at pdcide0 channel 1
isa0 at pcib0
lpt0 at isa0 port 0x378-0x37b irq 7
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
pckbc0 at isa0 port 0x60-0x64
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0 mux 1
wskbd0: connecting to wsdisplay0
pmsprobe: reset error 5
pcppi0 at isa0 port 0x61
sysbeep0 at pcppi0
npx0 at isa0 port 0xf0-0xff: using exception 16
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
apm0 at mainbus0: Power Management spec V1.2
fdcintr: got probe interrupt
fdcintr: got probe interrupt
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
fdcintr: got probe interrupt
Zapata Telephony Interface Registered on major 196
IPsec: Initialized Security Association Processing.
atapibus0 at atabus0: 2 targets
cd0 at atapibus0 drive 0: <SONY    DVD RW DRU-500A, DA211F3E, 1.0f> 
cdrom remova
ble
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd0(viaide0:0:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using 
DMA data
transfers)
atapibus1 at atabus1: 2 targets
cd1 at atapibus1 drive 0: <Hewlett-Packard CD-Writer Plus 9100, 
YMPAR3F8AV, 1.0c
 > cdrom removable
cd1: 32-bit data port
cd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd1(viaide0:1:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using 
DMA data
transfers)
uhub2 at uhub0 port 2uhub3 at uhub1 port 2
uhub2: Texas Instruments General Purpose USB Hub, class 9/0, rev 
1.10/1.01, addr
 2

uhub3: ALCOR Generic USB Hub, class 9/0, rev 1.10/1.00, addr 2
uhub2: 2 ports with 0 removable, bus powered
uhub3: 4 ports with 4 removable, self powered
uhidev0 at uhub2 port 1 configuration 1 interface 0
uhidev0: Logitech USB Receiver, rev 1.10/27.04, addr 3, iclass 3/1
ums0 at uhidev0: 7 buttons and Z dir.
wsmouse0 at ums0 mux 0
uhidev1 at uhub2 port 1 configuration 1 interface 1
uhidev1: Logitech USB Receiver, rev 1.10/27.04, addr 3, iclass 3/0
uhidev1: 17 report ids
uhid0 at uhidev1 reportid 16: input=6, output=6, feature=0
uhid1 at uhidev1 reportid 17: input=19, output=19, feature=0
wd0 at atabus3 drive 0: <QUANTUM FIREBALLP AS40.0>
wd0: drive supports 16-sector PIO transfers, LBA addressing
wd0: 39203 MB, 79651 cyl, 16 head, 63 sec, 512 bytes/sect x 80288480 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd0(hptide1:0:0): using PIO mode 4, Ultra-DMA mode 4 (Ultra/66) (using 
DMA data
transfers)
wd1 at atabus4 drive 0: <WDC WD2500JB-00FUA0>
wd1: drive supports 16-sector PIO transfers, LBA48 addressing
wd1: 232 GB, 484521 cyl, 16 head, 63 sec, 512 bytes/sect x 488397168 sectors
wd1: 32-bit data port
wd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd1(pdcide0:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using 
DMA data
 transfers)
wd2 at atabus5 drive 0: <WDC WD1000BB-75CHE0>
wd2: drive supports 16-sector PIO transfers, LBA addressing
wd2: 95396 MB, 193821 cyl, 16 head, 63 sec, 512 bytes/sect x 195371568 
sectors
wd2: 32-bit data port
wd2: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd2(pdcide0:1:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using 
DMA data
 transfers)
boot device: wd1
root on wd1a dumps on wd1b
mountroot: trying smbfs...
mountroot: trying msdos...
mountroot: trying cd9660...
mountroot: trying ntfs...
mountroot: trying nfs...
mountroot: trying lfs...
mountroot: trying ext2fs...
mountroot: trying ffs...
root file system type: ffs

-- 
Jeff Rizzo                                         riz@redcrowgroup.com
Red Crow Group LLC                                 http://www.redcrowgroup.com/
+1 415 550 0310