NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/56479: Crash in ata_recovery_resume



The following reply was made to PR kern/56479; it has been noted by GNATS.

From: Andreas Gustafsson <gson%gson.org@localhost>
To: gnats-bugs%netbsd.org@localhost
Cc: 
Subject: Re: kern/56479: Crash in ata_recovery_resume
Date: Sun, 30 Jan 2022 12:36:36 +0200

 It happened again last night.  The machine logged 63 "device timeout
 writing fsbn" errors to the console and then paniced.  Here's the end
 of the console output:
 
 [ 2760684.070946] wd3d: device timeout writing fsbn 7013874752 of 7013874752-7013874879 (wd3 bn 7013874752; cn 6958209 tn 1 sn 17), xfer 818, retry 1
 [ 2760684.229029] wd3d: device timeout writing fsbn 7013874624 of 7013874624-7013874751 (wd3 bn 7013874624; cn 6958208 tn 15 sn 15), xfer 780, retry 1
 [ 2760684.388150] wd3d: device timeout writing fsbn 7013874368 of 7013874368-7013874431 (wd3 bn 7013874368; cn 6958208 tn 11 sn 11), xfer 6e8, retry 1
 [ 2760684.547271] wd3d: device timeout writing fsbn 7013874240 of 7013874240-7013874367 (wd3 bn 7013874240; cn 6958208 tn 9 sn 9), xfer f8, retry 1
 [ 2760684.703272] wd3d: device timeout writing fsbn 7013873984 of 7013873984-7013874047 (wd3 bn 7013873984; cn 6958208 tn 5 sn 5), xfer 650, retry 1
 [ 2760684.860314] wd3d: device timeout writing fsbn 7013873856 of 7013873856-7013873983 (wd3 bn 7013873856; cn 6958208 tn 3 sn 3), xfer 5b8, retry 1
 [ 2760685.017356] wd3d: device timeout writing fsbn 7013873472 of 7013873472-7013873535 (wd3 bn 7013873472; cn 6958207 tn 12 sn 60), xfer 520, retry 1
 [ 2760685.176477] wd3d: device timeout writing fsbn 7013873344 of 7013873344-7013873471 (wd3 bn 7013873344; cn 6958207 tn 10 sn 58), xfer 60, retry 1
 [ 2760685.334559] wd3d: device timeout writing fsbn 7013873216 of 7013873216-7013873343 (wd3 bn 7013873216; cn 6958207 tn 8 sn 56), xfer 488, retry 1
 [ 2760685.492640] wd3d: device timeout writing fsbn 7013873088 of 7013873088-7013873151 (wd3 bn 7013873088; cn 6958207 tn 6 sn 54), xfer 3f0, retry 1
 [ 2760685.650722] wd3d: device timeout writing fsbn 7013872832 of 7013872832-7013872895 (wd3 bn 7013872832; cn 6958207 tn 2 sn 50), xfer 358, retry 1
 [ 2760685.808804] wd3d: device timeout writing fsbn 7013872704 of 7013872704-7013872831 (wd3 bn 7013872704; cn 6958207 tn 0 sn 48), xfer 2c0, retry 1
 [ 2760685.966886] wd3d: device timeout writing fsbn 7013872576 of 7013872576-7013872639 (wd3 bn 7013872576; cn 6958206 tn 14 sn 46), xfer 228, retry 1
 [ 2760686.126008] wd3d: device timeout writing fsbn 7013872192 of 7013872192-7013872255 (wd3 bn 7013872192; cn 6958206 tn 8 sn 40), xfer 190, retry 1
 [ 2760686.284088] wd3d: device timeout writing fsbn 7013872064 of 7013872064-7013872127 (wd3 bn 7013872064; cn 6958206 tn 6 sn 38), xfer 350, retry 1
 [ 2760686.442170] wd3d: device timeout writing fsbn 7013871936 of 7013871936-7013872063 (wd3 bn 7013871936; cn 6958206 tn 4 sn 36), xfer 2b8, retry 1
 [ 2760686.600252] wd3d: device timeout writing fsbn 7013871808 of 7013871808-7013871871 (wd3 bn 7013871808; cn 6958206 tn 2 sn 34), xfer 220, retry 1
 [ 2760686.758333] wd3d: device timeout writing fsbn 7013871680 of 7013871680-7013871807 (wd3 bn 7013871680; cn 6958206 tn 0 sn 32), xfer 188, retry 1
 [ 2760686.916416] wd3d: device timeout writing fsbn 7013871552 of 7013871552-7013871615 (wd3 bn 7013871552; cn 6958205 tn 14 sn 30), xfer f0, retry 1
 [ 2760687.074496] wd3d: device timeout writing fsbn 7013871424 of 7013871424-7013871551 (wd3 bn 7013871424; cn 6958205 tn 12 sn 28), xfer 58, retry 1
 [ 2760687.232577] wd3d: device timeout writing fsbn 7013871296 of 7013871296ry 1
 [ 2770780.456401] uvm_fault(0xffffffff81585c20, 0x0, 1) -> e
 [ 2770780.517013] fatal page fault in supervisor mode
 [ 2770780.577041] trap type 6 code 0 rip 0xffffffff8027bd27 cs 0x8 rflags 0x10286 cr2 0x10 ilevel 0 rsp 0xffff8e849432cf00
 [ 2770780.707101] curlwp 0xffff848a0f738900 pid 0.310 lowest kstack 0xffff8e849432a2c0
 [ 2770780.797143] panic: trap
 [ 2770780.827157] cpu19: Begin traceback...
 [ 2770780.877183] vpanic() at netbsd:vpanic+0x160
 [ 2770780.937208] snprintf() at netbsd:snprintf
 [ 2770780.987231] startlwp() at netbsd:startlwp
 [ 2770781.037257] alltraps() at netbsd:alltraps+0xbb
 [ 2770781.097281] ata_thread_run() at netbsd:ata_thread_run+0x249
 [ 2770781.167313] atabus_thread() at netbsd:atabus_thread+0x208
 [ 2770781.237347] cpu19: End traceback...
 
 [ 2770781.287371] dumping to dev 0,1 (offset=8, size=16753263):
 [ 2770781.347398] dump
 
 Batcktrace according to gdb:
 
 (gdb) bt
 #0  0xffffffff80222aaa in cpu_reboot ()
 #1  0xffffffff80994a96 in vpanic ()
 #2  0xffffffff80994b47 in panic ()
 #3  0xffffffff80224aed in trap ()
 #4  0xffffffff8021d56b in alltraps ()
 #5  0xffffffff8027bd27 in ata_recovery_resume ()
 #6  0xffffffff80279912 in ata_thread_run ()
 #7  0xffffffff8027a236 in atabus_thread ()
 #8  0xffffffff80209747 in lwp_trampoline ()
 #9  0x0000000000000000 in ?? ()
 
 The SMART error log of the disk in case is empty.  It uses SMR, so I
 would not be surprised if it's sometimes slow to respond.  Here's the
 dmesg output for the controller and disk:
 
 [     1.092336] ahcisata1 at pci0 dev 31 function 2: vendor 8086 product 8d02 (rev. 0x05)
 [     1.092336] ahcisata1: 64-bit DMA
 [     1.092336] ahcisata1: AHCI revision 1.30, 6 ports, 32 slots, CAP 0xcb30ff45<EMS,PSC,SSC,PMD,ISS=0x3=Gen3,SCLO,SAL,SSS,SNCQ,S64A>
 [     1.092336] ahcisata1: interrupting at msi5 vec 0
 [     1.092336] atabus4 at ahcisata1 channel 0
 [     1.092336] atabus5 at ahcisata1 channel 1
 [     1.092336] atabus6 at ahcisata1 channel 2
 [     1.092336] atabus7 at ahcisata1 channel 3
 [     1.092336] atabus8 at ahcisata1 channel 4
 [     1.092336] atabus9 at ahcisata1 channel 5
 [     3.251001] ahcisata1 port 1: device present, speed: 6.0Gb/s
 [     6.532170] wd3(ahcisata1:1:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA), NCQ (31 tags)
 [...]
 [     5.921953] wd3: <ST5000LM000-2AN170>
 [     5.961967] wd3: drive supports 1-sector PIO transfers, LBA48 addressing
 [     5.961967] wd3: 4657 GB, 9690021 cyl, 16 head, 63 sec, 512 bytes/sect x 9767541168 sectors (0 bytes/physsect; first aligned sector: 8)
 [     6.422131] wd3: GPT GUID: 31a830f6-dace-4062-ab89-f6911c261385
 [     6.422131] dk2 at wd3: "65183669-9719-47da-9e2a-09a1f9a7bf6d", 9767538688 blocks at 2048, type: ffs
 [     6.532170] wd3: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133), WRITE DMA FUA, NCQ (32 tags)
 [     6.532170] wd3(ahcisata1:1:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA), NCQ (31 tags)
 
 -- 
 Andreas Gustafsson, gson%gson.org@localhost
 


Home | Main Index | Thread Index | Old Index