Port-sparc64 archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: How to get a crash dump with recursive panic?



On Mon, 9 Jun 2014, Darren Reed wrote:

> In testing out ipfilter on sparc64, I see a bunch of "Alignment error"
> messages like these:
> 
> Alignment error: pid=24522.1 comm=ipfstat dsfsr=00000000:00800001
> dsfar=ffffffff:fea0c252 isfsr=00000000:00808000 pc=10e3b0
> Alignment error: pid=22537.1 comm=ipfstat dsfsr=00000000:00800001
> dsfar=ffffffff:fea02252 isfsr=00000000:00808000 pc=10e3b0
> Alignment error: pid=6845.1 comm=ipfstat dsfsr=00000000:00800001
> dsfar=ffffffff:fea02252 isfsr=00000000:00808000 pc=10e3b0
> 
> Followed by a panic like this:
> 
> trap type 0x34: cpu 0, pc=109faac npc=109fab0 pstate=0x820006<PRIV,IE>
> Skipping crash dump on recursive panic
> panic: mem address not aligned
> cpu0: Begin traceback...
> cpu0: End traceback...
> cpu1: shutting down
> cpu0: rebooting
> 
> All that I can do is:
> (gdb) x/i 0x109faac
>    0x109faac <ipf_fixskip+44>:  ldx  [ %g4 + 0x20 ], %g4
> 
> Further tips anyone?

What's the previous panic look like?  (I wonder if we have an SMP bug in 
vpanic()...)

Trap type 0x34 is an alignment trap.  The instruction in question is 
trying to load an 8-byte integer pointed to by %g4+0x20 into %g4.  You can 
enable DDB and dump the registers to find the contents of %g4.  That 
should not be 8-byte aligned.

Beyond that it's a question of debugging the ipfilter code.

That ipfstat is getting unaligned accesses implies some data structure is 
unaligned.  You can slap gdb on it to find out what, or you can break into 
DDB and set the TDB_STOPSIG bit in trapdebug to have the kernel break into 
DDB on each unaligned access and debug it from there. 

Eduardo


Home | Main Index | Thread Index | Old Index