Subject: Re: locking problem?!?
To: Nicolas Joly <njoly@pasteur.fr>
From: Andrew Doran <ad@netbsd.org>
List: current-users
Date: 03/15/2007 16:30:06
> On Thu, Mar 15, 2007 at 02:21:46PM +0100, Kurt Schreiner wrote:
>> Hi,
>>
>> since around Sunday I have this one reproducible (just rsync a large
>> chunk
>> of data and wait for disk activity on the receiving machine. "works" on
>> an
>> P4 2.8 w/ HT enabled, an oldish PII 400 dualboard and on my w2100z dual
>> opteron
>> system).
>> All three machines hang hard w/ a "normal" kernel (having to press
>> "Reset"),
>> compiling w/ LOCKDEBUG I get the following (on amd, sources cvs update'd
>> ca. 11:00 UTC today):
>
> Likewise here, since about 2 days, on my dual amd64 workstation
> (GENERIC.MP+DIAGNOTIC+LOCKDEBUG kernel) ... But with a slightly
> different trace :
>
> kernel: page fault trap, code=0
> Stopped in pid 13777.1 (sshd) at        netbsd:db_nextframe+0xf8:
> movq
> 0x70(%rax),%rsi
>
> db{0}> mach cpu 0
> using CPU 0
> db{0}>
> db_nextframe() at netbsd:db_nextframe+0xf8
> db_stack_trace_print() at netbsd:db_stack_trace_print+0x5ec
> _simple_lock() at netbsd:_simple_lock+0x146
> pool_cache_get_paddr() at netbsd:pool_cache_get_paddr+0x4a
> bge_newbuf_std() at netbsd:bge_newbuf_std+0xba
> bge_rxeof() at netbsd:bge_rxeof+0x43f
> bge_intr() at netbsd:bge_intr+0x1fe
> intr_biglock_wrapper() at netbsd:intr_biglock_wrapper+0x35
> Xintr_ioapic_level11() at netbsd:Xintr_ioapic_level11+0xdb
> --- interrupt ---
> pcg_put() at netbsd:pcg_put+0x1b
> pool_cache_put_paddr() at netbsd:pool_cache_put_paddr+0x217
> sodopendfreel() at netbsd:sodopendfreel+0xf9
> sodopendfree() at netbsd:sodopendfree+0x19
> sosend() at netbsd:sosend+0x3d
> soo_write() at netbsd:soo_write+0x70
> dofilewrite() at netbsd:dofilewrite+0x120
> sys_write() at netbsd:sys_write+0x12b
> syscall_plain() at netbsd:syscall_plain+0x288
> kernel: page fault trap, code=0
> db{0}> mach cpu 1
> using CPU 1
> db{0}> bt
> spllower() at netbsd:spllower+0x20
> trap() at netbsd:trap+0x7a4
> Stopped in pid 13777.1 (sshd) at        netbsd:breakpoint+0x5:  leave
>
> --
> Nicolas Joly
>
> Biological Software and Databanks.
> Institut Pasteur, Paris.

Ok sorry about that. It should be fixed by uipc_socket.c 1.137.

Andrew