Subject: "panic: extent_free: region not found" on AS4100, 1.6.x with mlx(4)
To: NetBSD port-alpha List <port-alpha@NetBSD.org>
From: Greg A. Woods <woods@weird.com>
List: tech-kern
Date: 10/24/2005 12:41:27
--Multipart_Mon_Oct_24_12:40:55_2005-2
Content-Type: multipart/signed;
 boundary="pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1"; micalg=pgp-sha1;
 protocol="application/pgp-signature"
Content-Transfer-Encoding: 7bit

--pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1
Content-Type: text/plain; charset=US-ASCII

These two panics, which appear to be releated, have happened to my new
AS4100 running NetBSD 1.6.x, now with three CPUs, and of course a mlx(4)
(DCA960) RAID controller:

	extent `mcpcia pci sgmap' (0x0 - 0x3fffffff), flags = 0x2
	     0x0 - 0x1fff
	     0x2000 - 0x3fff
	     0x4000 - 0x5fff
	     0x6000 - 0x7fff
	     0x8000 - 0x9fff
	     0xa000 - 0xbfff
	     0xc000 - 0xdfff
	     0xe000 - 0xffff
	     0x10000 - 0x11fff
	     0x12000 - 0x13fff
	     0x14000 - 0x15fff
	     0x16000 - 0x17fff
	     0x18000 - 0x1dfff
	extent_free: start 0x18000, end 0x1bfff
	panic: extent_free: region not found
	Stopped at      cpu_Debugger+0x4:       ret     zero,(ra)
	db{0}> trace
	cpu_Debugger() at cpu_Debugger+0x4
	panic() at panic+0x160
	extent_free() at extent_free+0x320
	pci_sgmap_pte64_unload() at pci_sgmap_pte64_unload+0x108
	mcpcia_bus_dmamap_unload_sgmap() at mcpcia_bus_dmamap_unload_sgmap+0x38
	mlx_ccb_unmap() at mlx_ccb_unmap+0xa8
	ld_mlx_handler() at ld_mlx_handler+0xe4
	mlx_intr() at mlx_intr+0xec
	alpha_shared_intr_dispatch() at alpha_shared_intr_dispatch+0x6c
	kn300_iointr() at kn300_iointr+0x54
	interrupt() at interrupt+0x33c
	XentInt() at XentInt+0x1c
	--- interrupt (from ipl 0) ---
	idle() at idle+0x78
	idle() at idle+0x54
	--- root of call graph ---
	db{0}> reboot
	syncing disks... tlp0: receive ring overrun
	
	CPU 0: fatal kernel trap:
	
	CPU 0    trap entry = 0x2 (memory management fault)
	CPU 0    a0         = 0x1a4
	CPU 0    a1         = 0x1
	CPU 0    a2         = 0x0
	CPU 0    pc         = 0xfffffc00004a3bd4
	CPU 0    ra         = 0xfffffc00003fa598
	CPU 0    pv         = 0xfffffc00004a3860
	CPU 0    curproc    = 0x0
	
	panic: trap
	Stopped at      cpu_Debugger+0x4:       ret     zero,(ra)
	db{0}> reboot
	cpu1: shutting down...
	cpu2: shutting down...
	rebooting...

	[[ and still a hard reset was needed ]]

and the second one:

	extent `mcpcia pci sgmap' (0x0 - 0x3fffffff), flags = 0x2
	     0x0 - 0x1fff
	     0x2000 - 0x3fff
	     0x4000 - 0x5fff
	     0x6000 - 0x7fff
	     0x8000 - 0x9fff
	     0xa000 - 0xdfff
	extent_free: start 0xa000, end 0xbfff
	panic: extent_free: region not found
	Stopped at      cpu_Debugger+0x4:       ret     zero,(ra)
	db{0}> trace
	cpu_Debugger() at cpu_Debugger+0x4
	panic() at panic+0x160
	extent_free() at extent_free+0x320
	pci_sgmap_pte64_unload() at pci_sgmap_pte64_unload+0x108
	mcpcia_bus_dmamap_unload_sgmap() at mcpcia_bus_dmamap_unload_sgmap+0x38
	mlx_ccb_unmap() at mlx_ccb_unmap+0xa8
	ld_mlx_handler() at ld_mlx_handler+0xe4
	mlx_intr() at mlx_intr+0xec
	alpha_shared_intr_dispatch() at alpha_shared_intr_dispatch+0x6c
	kn300_iointr() at kn300_iointr+0x54
	interrupt() at interrupt+0x33c
	XentInt() at XentInt+0x1c
	--- interrupt (from ipl 0) ---
	idle() at idle+0x78
	idle() at idle+0x54
	--- root of call graph ---
	db{0}> 


Is there any further information I can try to find if this happens
again?

Unfortunately I doubt I'll be able to get a kernel core dump, especially
since the default dump device is one of the logical devices attached to
the mlx(4) controller.


--pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1
Content-Type: application/pgp-signature
Content-Transfer-Encoding: 7bit

-----BEGIN PGP SIGNATURE-----
Version: PGPfreeware 5.0i for non-commercial use
MessageID: KkBYolzulu6M1srWPxD0dCFAMfvWVf9A

iQA/AwUBQ10OtmJ7XxTCWceFEQLhAgCfXPjr+kL1rVAepKa63VKp2uWxWzQAn1A4
IwtUBMKmT2MAZkAsu06Ekzsj
=cq8z
-----END PGP SIGNATURE-----

--pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1--

--Multipart_Mon_Oct_24_12:40:55_2005-2
Content-Type: text/plain; charset=US-ASCII

-- 
						Greg A. Woods

H:+1 416 218-0098  W:+1 416 489-5852 x122  VE3TCP  RoboHack <woods@robohack.ca>
Planix, Inc. <woods@planix.com>          Secrets of the Weird <woods@weird.com>

--Multipart_Mon_Oct_24_12:40:55_2005-2--