NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/56978: nvme hangs under very heavy loads



The following reply was made to PR kern/56978; it has been noted by GNATS.

From: Paul Goyette <paul%whooppee.com@localhost>
To: gnats-bugs%netbsd.org@localhost
Cc: kern-bug-people%netbsd.org@localhost
Subject: Re: kern/56978: nvme hangs under very heavy loads
Date: Wed, 24 Aug 2022 08:31:34 -0700 (PDT)

 +-------+
 | NVME0 |
 +-------+
 
 Controller Capabilities/Features
 ================================
 Vendor ID:                  144d
 Subsystem Vendor ID:        144d
 Serial Number:              S3EWNX0K108171P
 Model Number:               Samsung SSD 960 PRO 512GB
 Firmware Version:           2B6QCXP7
 Recommended Arb Burst:      2
 IEEE OUI Identifier:        38 25 00
 Multi-Interface Cap:        00
 Max Data Transfer Size:     2097152
 Controller ID:              0x02
 
 Admin Command Set Attributes
 ============================
 Security Send/Receive:       Supported
 Format NVM:                  Supported
 Firmware Activate/Download:  Supported
 Namespace Management:        Not Supported
 Abort Command Limit:         8
 Async Event Request Limit:   4
 Number of Firmware Slots:    3
 Firmware Slot 1 Read-Only:   No
 Per-Namespace SMART Log:     Yes
 Error Log Page Entries:      64
 Number of Power States:      5
 
 NVM Command Set Attributes
 ==========================
 Submission Queue Entry Size
    Max:                       64
    Min:                       64
 Completion Queue Entry Size
    Max:                       16
    Min:                       16
 Number of Namespaces:        1
 Compare Command:             Supported
 Write Uncorrectable Command: Supported
 Dataset Management Command:  Supported
 Write Zeroes Command:        Supported
 Features Save/Select Field:  Supported
 Reservation:                 Not Supported
 Volatile Write Cache:        Present
 Autonomous Power State Transitions:    Supported
 Error Information Log
 =====================
 Entry 01
 =========
   Error count:           17
   Submission queue ID:   1
   Command ID:            65535
   Status:
    Phase tag:            0
    Status code:          0
    Status code type:     0
    More:                 0
    DNR:                  0
   Error location:        0
   LBA:                   0
   Namespace ID:          0
   Vendor specific info:  0
   Command specific info: 0
 Entry 02
 =========
   Error count:           16
   Submission queue ID:   1
   Command ID:            65535
   Status:
    Phase tag:            0
    Status code:          0
    Status code type:     0
    More:                 0
    DNR:                  0
   Error location:        0
   LBA:                   0
   Namespace ID:          0
   Vendor specific info:  0
   Command specific info: 0
 Entry 03
 =========
   Error count:           15
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          11
    Status code type:     0
    More:                 1
    DNR:                  0
   Error location:        4
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 04
 =========
   Error count:           14
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 05
 =========
   Error count:           13
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 06
 =========
   Error count:           12
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 07
 =========
   Error count:           11
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 08
 =========
   Error count:           10
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 09
 =========
   Error count:           9
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 10
 =========
   Error count:           8
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          11
    Status code type:     0
    More:                 1
    DNR:                  0
   Error location:        4
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 11
 =========
   Error count:           7
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          11
    Status code type:     0
    More:                 1
    DNR:                  0
   Error location:        4
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 12
 =========
   Error count:           6
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          11
    Status code type:     0
    More:                 1
    DNR:                  0
   Error location:        4
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 13
 =========
   Error count:           5
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 14
 =========
   Error count:           4
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 15
 =========
   Error count:           3
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 16
 =========
   Error count:           2
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          9
    Status code type:     1
    More:                 1
    DNR:                  0
   Error location:        40
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 Entry 17
 =========
   Error count:           1
   Submission queue ID:   0
   Command ID:            0
   Status:
    Phase tag:            0
    Status code:          11
    Status code type:     0
    More:                 1
    DNR:                  0
   Error location:        4
   LBA:                   0
   Namespace ID:          65535
   Vendor specific info:  0
   Command specific info: 0
 SMART/Health Information Log
 ============================
 Critical Warning State:         0x00
   Available spare:               0
   Temperature:                   0
   Device reliability:            0
   Read only:                     0
   Volatile memory backup:        0
 Temperature:                    320 K, 46.85 C, 116.33 F
 Available spare:                100
 Available spare threshold:      10
 Percentage used:                38
 Data units (512 byte) read:     22777887 
 Data units (512 byte) written:  331218824 
 Host read commands:             648551789 
 Host write commands:            12123039245 
 Controller busy time (minutes): 9012 
 Power cycles:                   45 
 Power on hours:                 34906 
 Unsafe shutdowns:               43 
 Media errors:                   0 
 No. error info log entries:     17 
 Warning Temp Composite Time:    0
 Error Temp Composite Time:      0
 Temperature Sensor 1:           320 K, 46.85 C, 116.33 F
 Temperature Sensor 2:           336 K, 62.85 C, 145.13 F
 
 +-------+
 | NVME1 |
 +-------+
 
 Controller Capabilities/Features
 ================================
 Vendor ID:                  144d
 Subsystem Vendor ID:        144d
 Serial Number:              S6S2NS0T528809P
 Model Number:               Samsung SSD 970 EVO Plus 2TB
 Firmware Version:           4B2QEXM7
 Recommended Arb Burst:      2
 IEEE OUI Identifier:        38 25 00
 Multi-Interface Cap:        00
 Max Data Transfer Size:     524288
 Controller ID:              0x06
 
 Admin Command Set Attributes
 ============================
 Security Send/Receive:       Supported
 Format NVM:                  Supported
 Firmware Activate/Download:  Supported
 Namespace Management:        Not Supported
 Abort Command Limit:         8
 Async Event Request Limit:   4
 Number of Firmware Slots:    3
 Firmware Slot 1 Read-Only:   No
 Per-Namespace SMART Log:     Yes
 Error Log Page Entries:      64
 Number of Power States:      5
 
 NVM Command Set Attributes
 ==========================
 Submission Queue Entry Size
    Max:                       64
    Min:                       64
 Completion Queue Entry Size
    Max:                       16
    Min:                       16
 Number of Namespaces:        1
 Compare Command:             Supported
 Write Uncorrectable Command: Supported
 Dataset Management Command:  Supported
 Write Zeroes Command:        Not Supported
 Features Save/Select Field:  Supported
 Reservation:                 Not Supported
 Volatile Write Cache:        Present
 Autonomous Power State Transitions:    Supported
 Error Information Log
 =====================
 No error entries found
 SMART/Health Information Log
 ============================
 Critical Warning State:         0x00
   Available spare:               0
   Temperature:                   0
   Device reliability:            0
   Read only:                     0
   Volatile memory backup:        0
 Temperature:                    322 K, 48.85 C, 119.93 F
 Available spare:                100
 Available spare threshold:      10
 Percentage used:                0
 Data units (512 byte) read:     1354369 
 Data units (512 byte) written:  13787261 
 Host read commands:             40158360 
 Host write commands:            282418753 
 Controller busy time (minutes): 158 
 Power cycles:                   3 
 Power on hours:                 446 
 Unsafe shutdowns:               2 
 Media errors:                   0 
 No. error info log entries:     0 
 Warning Temp Composite Time:    0
 Error Temp Composite Time:      0
 Temperature Sensor 1:           322 K, 48.85 C, 119.93 F
 Temperature Sensor 2:           333 K, 59.85 C, 139.73 F
 
 +-------+
 | NVME2 |
 +-------+
 
 Controller Capabilities/Features
 ================================
 Vendor ID:                  144d
 Subsystem Vendor ID:        144d
 Serial Number:              S6WRNS0T114070H
 Model Number:               Samsung SSD 980 PRO with Heatsink 2TB
 Firmware Version:           4B2QGXA7
 Recommended Arb Burst:      2
 IEEE OUI Identifier:        38 25 00
 Multi-Interface Cap:        00
 Max Data Transfer Size:     524288
 Controller ID:              0x06
 
 Admin Command Set Attributes
 ============================
 Security Send/Receive:       Supported
 Format NVM:                  Supported
 Firmware Activate/Download:  Supported
 Namespace Management:        Not Supported
 Abort Command Limit:         8
 Async Event Request Limit:   4
 Number of Firmware Slots:    3
 Firmware Slot 1 Read-Only:   No
 Per-Namespace SMART Log:     Yes
 Error Log Page Entries:      64
 Number of Power States:      5
 
 NVM Command Set Attributes
 ==========================
 Submission Queue Entry Size
    Max:                       64
    Min:                       64
 Completion Queue Entry Size
    Max:                       16
    Min:                       16
 Number of Namespaces:        1
 Compare Command:             Supported
 Write Uncorrectable Command: Supported
 Dataset Management Command:  Supported
 Write Zeroes Command:        Not Supported
 Features Save/Select Field:  Supported
 Reservation:                 Not Supported
 Volatile Write Cache:        Present
 Autonomous Power State Transitions:    Supported
 Error Information Log
 =====================
 No error entries found
 SMART/Health Information Log
 ============================
 Critical Warning State:         0x00
   Available spare:               0
   Temperature:                   0
   Device reliability:            0
   Read only:                     0
   Volatile memory backup:        0
 Temperature:                    322 K, 48.85 C, 119.93 F
 Available spare:                100
 Available spare threshold:      10
 Percentage used:                0
 Data units (512 byte) read:     64309 
 Data units (512 byte) written:  3493433 
 Host read commands:             2683385 
 Host write commands:            61951123 
 Controller busy time (minutes): 29 
 Power cycles:                   1 
 Power on hours:                 42 
 Unsafe shutdowns:               0 
 Media errors:                   0 
 No. error info log entries:     0 
 Warning Temp Composite Time:    0
 Error Temp Composite Time:      0
 Temperature Sensor 1:           322 K, 48.85 C, 119.93 F
 Temperature Sensor 2:           327 K, 53.85 C, 128.93 F
 
 
 +--------------------+--------------------------+----------------------+
 | Paul Goyette       | PGP Key fingerprint:     | E-mail addresses:    |
 | (Retired)          | FA29 0E3B 35AF E8AE 6651 | paul%whooppee.com@localhost    |
 | Software Developer | 0786 F758 55DE 53BA 7731 | pgoyette%netbsd.org@localhost  |
 | & Network Engineer |                          | pgoyette99%gmail.com@localhost |
 +--------------------+--------------------------+----------------------+
 


Home | Main Index | Thread Index | Old Index