Current-Users archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: Filesystem tests crashing host
hi,
> Andreas Gustafsson <gson%gson.org@localhost> wrote:
>> Jukka Ruohonen wrote:
>> > On Thu, Apr 14, 2011 at 06:27:19AM -0700, Paul Goyette wrote:
>> > > The fault is coming from the following code at lines 817-821 of
>> > > src/sys/kern/kern_descrip.c (rev. 1.212, in which christos@ touched
>> > > the close-on-exec stuff)
>> > >
>> > > if (fp->f_ops != NULL) {
>> > > error = (*fp->f_ops->fo_close)(fp);
>> > > } else {
>> > > error = 0;
>> > > }
>> >
>> > Now that this is diagnosed, any idea how to fix this regression that
>> > broke the whole test infrastructure?
>>
>> I still say revert rmind's changes of 2011.04.11.22.31.43, because
>> that's when the failures started. My logs show six test runs between
>> christos' change to kern_descrip.c (at 2011.04.10.15.45.33) and rmind's
>> changes, and none of those test runs paniced; after rmind's changes,
>> every single test run has paniced.
>
> Problem is not diagnosed. It cannot be reproduced on real hardware,
> and I do not see how f_ops can become invalid when using semaphore.
> Even if we assume that it can - the semaphore code should actually be
> *used* in the first place. However, it seems that neither failing
> ATF tests, nor ATF itself are using semaphores. Can somebody prove
> me wrong on this?
>
> Perhaps a simple printf("f_type = %d\n", fp->f_type) would hint what
> type of descriptor is actually failing. Also, a wild guess - can one
> reproduce the problem with the following changes reverted:
>
> http://mail-index.netbsd.org/source-changes/2011/04/10/msg020849.html
doesn't atf leave gdb'able core files for tests failed in a way like this?
YAMAMOTO Takashi
>
> --
> Mindaugas
Home |
Main Index |
Thread Index |
Old Index