NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/56686: wd(4) device timeouts



The following reply was made to PR kern/56686; it has been noted by GNATS.

From: Andreas Gustafsson <gson%gson.org@localhost>
To: mlelstv%serpens.de@localhost (Michael van Elst)
Cc: gnats-bugs%netbsd.org@localhost
Subject: Re: kern/56686: wd(4) device timeouts
Date: Fri, 4 Feb 2022 15:45:00 +0200

 Michael van Elst wrote:
 >  > [     5.921953] wd3: <ST5000LM000-2AN170>
 >  
 >  ST5000LM000 is a SMR disk, the timeouts might be real and
 >  the driver might need to wait longer for such hardware.
 
 Agreed.  Where is the timeout defined, and does it take the queue
 length into account?
 
 >  The 'uncorrectable data error' comes from the drive, so
 >  that's probably real too.
 
 Probably, but I don't see it in the SMART error log:
 
 smartctl 7.2 2020-12-30 r5155 [NetBSD 9.2 amd64] (local build)
 Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
 
 === START OF INFORMATION SECTION ===
 Model Family:     Seagate Barracuda 2.5 5400
 Device Model:     ST5000LM000-2AN170
 Serial Number:    WCJ46G52
 LU WWN Device Id: 5 000c50 0d44f62ec
 Firmware Version: 0001
 User Capacity:    5,000,981,078,016 bytes [5.00 TB]
 Sector Sizes:     512 bytes logical, 4096 bytes physical
 Rotation Rate:    5526 rpm
 Form Factor:      2.5 inches
 Device is:        In smartctl database [for details use: -P show]
 ATA Version is:   ACS-3 T13/2161-D revision 5
 SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
 Local Time is:    Fri Feb  4 15:41:50 2022 EET
 SMART support is: Available - device has SMART capability.
 SMART support is: Enabled
 
 === START OF READ SMART DATA SECTION ===
 SMART overall-health self-assessment test result: PASSED
 
 General SMART Values:
 Offline data collection status:  (0x00)	Offline data collection activity
 					was never started.
 					Auto Offline Data Collection: Disabled.
 Self-test execution status:      (   0)	The previous self-test routine completed
 					without error or no self-test has ever 
 					been run.
 Total time to complete Offline 
 data collection: 		(    0) seconds.
 Offline data collection
 capabilities: 			 (0x73) SMART execute Offline immediate.
 					Auto Offline data collection on/off support.
 					Suspend Offline collection upon new
 					command.
 					No Offline surface scan supported.
 					Self-test supported.
 					Conveyance Self-test supported.
 					Selective Self-test supported.
 SMART capabilities:            (0x0003)	Saves SMART data before entering
 					power-saving mode.
 					Supports SMART auto save timer.
 Error logging capability:        (0x01)	Error logging supported.
 					General Purpose Logging supported.
 Short self-test routine 
 recommended polling time: 	 (   1) minutes.
 Extended self-test routine
 recommended polling time: 	 ( 824) minutes.
 Conveyance self-test routine
 recommended polling time: 	 (   2) minutes.
 SCT capabilities: 	       (0x30a5)	SCT Status supported.
 					SCT Data Table supported.
 
 SMART Attributes Data Structure revision number: 10
 Vendor Specific SMART Attributes with Thresholds:
 ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
   1 Raw_Read_Error_Rate     0x000f   072   062   006    Pre-fail  Always       -       74443112
   3 Spin_Up_Time            0x0003   096   096   000    Pre-fail  Always       -       0
   4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       37
   5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
   7 Seek_Error_Rate         0x000f   082   060   045    Pre-fail  Always       -       166162157
   9 Power_On_Hours          0x0032   092   092   000    Old_age   Always       -       7150 (25 38 0)
  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
  12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       36
 183 SATA_Downshift_Count    0x0032   100   100   000    Old_age   Always       -       0
 184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
 187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
 188 Command_Timeout         0x0032   100   099   000    Old_age   Always       -       34360262665
 189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
 190 Airflow_Temperature_Cel 0x0022   072   046   040    Old_age   Always       -       28 (Min/Max 22/33)
 191 G-Sense_Error_Rate      0x0032   099   099   000    Old_age   Always       -       2867
 192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       3
 193 Load_Cycle_Count        0x0032   001   001   000    Old_age   Always       -       223651
 194 Temperature_Celsius     0x0022   028   054   000    Old_age   Always       -       28 (0 19 0 0 0)
 195 Hardware_ECC_Recovered  0x001a   079   064   000    Old_age   Always       -       74443112
 197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
 198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
 240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       2389 (212 48 0)
 241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       35963005615
 242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       14641645157
 254 Free_Fall_Sensor        0x0032   100   100   000    Old_age   Always       -       0
 
 SMART Error Log Version: 1
 No Errors Logged
 
 SMART Self-test log structure revision number 1
 No self-tests have been logged.  [To run self-tests, use: smartctl -t]
 
 SMART Selective self-test log data structure revision number 1
  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
     1        0        0  Not_testing
     2        0        0  Not_testing
     3        0        0  Not_testing
     4        0        0  Not_testing
     5        0        0  Not_testing
 Selective self-test flags (0x0):
   After scanning selected spans, do NOT read-scan remainder of disk.
 If Selective self-test is pending on power-up, resume after 0 minute delay.
 
 -- 
 Andreas Gustafsson, gson%gson.org@localhost
 


Home | Main Index | Thread Index | Old Index