Current-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: ZFS disaster on -current



#  » crash -M netbsd.19.core -N netbsd.19
Crash version 9.99.68, image version 9.99.68.
crash: _kvm_kvatop(0)
Kernel compiled without options LOCKDEBUG.
System panicked: trap
Backtrace from time of crash is available.
crash> bt
_KERNEL_OPT_NARCNET() at 0
?() at ffff90819ac60000
sys_reboot() at sys_reboot
vpanic() at vpanic+0x15b
snprintf() at snprintf
startlwp() at startlwp
calltrap() at calltrap+0x19
vdev_open() at vdev_open+0x9e
vdev_open_children() at vdev_open_children+0x39
vdev_root_open() at vdev_root_open+0x33
vdev_open() at vdev_open+0x9e
spa_load() at spa_load+0x38e
spa_tryimport() at spa_tryimport+0x86
zfs_ioc_pool_tryimport() at zfs_ioc_pool_tryimport+0x41
zfsdev_ioctl() at zfsdev_ioctl+0x8c1
nb_zfsdev_ioctl() at nb_zfsdev_ioctl+0x38
VOP_IOCTL() at VOP_IOCTL+0x44
vn_ioctl() at vn_ioctl+0xa5
sys_ioctl() at sys_ioctl+0x550
syscall() at syscall+0x26e
--- syscall (number 54) ---
syscall+0x26e:

On Wed, 24 Jun 2020 at 12:04, Chavdar Ivanov <ci4ic4%gmail.com@localhost> wrote:
>
> Yes, I do. Should I be looking for something specific?
>
>  I've uploaded it here, if it is of interest -
> https://send.firefox.com/download/74761aa43c6c54b3/#PbaGxtDN81Hzk2VUjefozw
> .
>
> BTW I repeated the test on a pvh guest of XCP-NG, it panics the same way.
>
> Chavdar
>
> On Wed, 24 Jun 2020 at 12:07, Jaromír Doleček <jaromir.dolecek%gmail.com@localhost> wrote:
> >
> > By chance, do you have the kernel crash dump from the original panic
> > which happened yesterday? The subsequent ones might be a result of the
> > first one.
> >
> > The messages about redzone don't mean anything beyond that there is no
> > overflow protection for items on the pool.
> >
> > Jaromir
> >
> > Le mer. 24 juin 2020 à 11:34, Chavdar Ivanov <ci4ic4%gmail.com@localhost> a écrit :
> > >
> > > Hi,
> > >
> > > On
> > >
> > > NetBSD ymir 9.99.68 NetBSD 9.99.68 (GENERIC) #1: Tue Jun 23 22:53:46
> > > BST 2020  sysbuild@ymir:/home/sysbuild/amd64/obj/home/sysbuild/src/sys/arch/amd64/compile/GENERIC
> > > amd64
> > >
> > > I suddenly got a panic with ZFS; it took place with the previous
> > > kernel, so it was something with the module. In single user I disabled
> > > zfs in /etc/rc.conf and was able to complete boot, but obviously
> > > without my two pools.
> > >
> > > 'modload solaris' didn't show any problem.
> > >
> > > I set aside the contents of /etc/zfs and did 'modload zfs', which resulted in:
> > >
> > > .....
> > >
> > > WARNING: ZFS on NetBSD is under development
> > > pool redzone disabled for 'zio_buf_4096'
> > > pool redzone disabled for 'zio_data_buf_4096'
> > > pool redzone disabled for 'zio_buf_8192'
> > > pool redzone disabled for 'zio_data_buf_8192'
> > > pool redzone disabled for 'zio_buf_16384'
> > > pool redzone disabled for 'zio_data_buf_16384'
> > > pool redzone disabled for 'zio_buf_32768'
> > > pool redzone disabled for 'zio_data_buf_32768'
> > > pool redzone disabled for 'zio_buf_65536'
> > > pool redzone disabled for 'zio_data_buf_65536'
> > > pool redzone disabled for 'zio_buf_131072'
> > > pool redzone disabled for 'zio_data_buf_131072'
> > > pool redzone disabled for 'zio_buf_262144'
> > > pool redzone disabled for 'zio_data_buf_262144'
> > > pool redzone disabled for 'zio_buf_524288'
> > > pool redzone disabled for 'zio_data_buf_524288'
> > > pool redzone disabled for 'zio_buf_1048576'
> > > pool redzone disabled for 'zio_data_buf_1048576'
> > > pool redzone disabled for 'zio_buf_2097152'
> > > pool redzone disabled for 'zio_data_buf_2097152'
> > > pool redzone disabled for 'zio_buf_4194304'
> > > pool redzone disabled for 'zio_data_buf_4194304'
> > > pool redzone disabled for 'zio_buf_8388608'
> > > pool redzone disabled for 'zio_data_buf_8388608'
> > > pool redzone disabled for 'zio_buf_16777216'
> > > pool redzone disabled for 'zio_data_buf_16777216'
> > >
> > > I have no idea what that means, it is a first for me, ZFS otherwise
> > > has been very reliable on this hardware so far, inasmuch as I have the
> > > mercurial repo on a zfs and build from it from time to time (the panic
> > > is from the last cvs update from yesterday, though).
> > >
> > > Subsequent 'zpool import' repeated the panic (without getting me into
> > > the debugger, though):
> > >
> > >
> > > ZFS filesystem version: 5
> > > uvm_fault(0xffffa97e4c3e1610, 0x0, 1) -> e
> > > fatal page fault in supervisor mode
> > > trap type 6 code 0 rip 0xffffffff81d49882 cs 0x8 rflags 0x10286 cr2
> > > 0xa0 ilevel 0 rsp 0xffffde819c16d760
> > > curlwp 0xffffa97e3a41e140 pid 17394.17394 lowest kstack 0xffffde819c16a2c0
> > > panic: trap
> > > cpu0: Begin traceback...
> > > vpanic() at netbsd:vpanic+0x152
> > > snprintf() at netbsd:snprintf
> > > startlwp() at netbsd:startlwp
> > > alltraps() at netbsd:alltraps+0xc3
> > > vdev_open() at zfs:vdev_open+0x9e
> > > vdev_open_children() at zfs:vdev_open_children+0x39
> > > vdev_root_open() at zfs:vdev_root_open+0x33
> > > vdev_open() at zfs:vdev_open+0x9e
> > > spa_load() at zfs:spa_load+0x38e
> > > spa_tryimport() at zfs:spa_tryimport+0x86
> > > zfs_ioc_pool_tryimport() at zfs:zfs_ioc_pool_tryimport+0x41
> > > zfsdev_ioctl() at zfs:zfsdev_ioctl+0x8c1
> > > nb_zfsdev_ioctl() at zfs:nb_zfsdev_ioctl+0x38
> > > VOP_IOCTL() at netbsd:VOP_IOCTL+0x44
> > > vn_ioctl() at netbsd:vn_ioctl+0xa5
> > > sys_ioctl() at netbsd:sys_ioctl+0x550
> > > syscall() at netbsd:syscall+0x26e
> > > --- syscall (number 54) ---
> > > netbsd:syscall+0x26e:
> > > cpu0: End traceback...
> > >
> > > The above panic did not leave a crash dump.
> > >
> > > When I had /etc/zfs populated before, I also got a crash dump (with
> > > 'reboot 0x104'), as follows:
> > >
> > > # crash -M netbsd.18.core -N netbsd.18
> > > Crash version 9.99.68, image version 9.99.68.
> > > crash: _kvm_kvatop(0)
> > > Kernel compiled without options LOCKDEBUG.
> > > System panicked: reboot forced via kernel debugger
> > > Backtrace from time of crash is available.
> > > crash> bt
> > > _KERNEL_OPT_NARCNET() at 0
> > > _KERNEL_OPT_NARCNET() at 0
> > > sys_reboot() at sys_reboot
> > > db_fncall() at db_fncall
> > > db_command() at db_command+0x127
> > > db_command_loop() at db_command_loop+0xa6
> > > db_trap() at db_trap+0xe6
> > > kdb_trap() at kdb_trap+0xe1
> > > trap() at trap+0x2b7
> > > --- trap (number 6) ---
> > > vdev_disk_open.part.4() at vdev_disk_open.part.4+0x49a
> > > vdev_open() at vdev_open+0x9e
> > > vdev_open_children() at vdev_open_children+0x39
> > > vdev_root_open() at vdev_root_open+0x33
> > > vdev_open() at vdev_open+0x9e
> > > spa_load() at spa_load+0x38e
> > > spa_load_best() at spa_load_best+0x58
> > > spa_open_common() at spa_open_common+0xc2
> > > pool_status_check.part.25() at pool_status_check.part.25+0x1e
> > > zfsdev_ioctl() at zfsdev_ioctl+0x80e
> > > nb_zfsdev_ioctl() at nb_zfsdev_ioctl+0x38
> > > VOP_IOCTL() at VOP_IOCTL+0x44
> > > vn_ioctl() at vn_ioctl+0xa5
> > > sys_ioctl() at sys_ioctl+0x550
> > > syscall() at syscall+0x26e
> > > --- syscall (number 54) ---
> > > syscall+0x26e:
> > > .....
> > >
> > > Any idea what is going on? I've restarted a build, but the cvs log
> > > doesn't show anything relevant as far as I can see.
> > >
> > >
> > > Chavdar
> > >
> > >
> > >
> > > --
> > > ----
>
>
>
> --
> ----



-- 
----


Home | Main Index | Thread Index | Old Index