Hello à tous !
De temps en temps j'entends un "click" pas bien rassurant (du tout) en provenance de la cage des disques durs de mon boitier. Tout de suite après ce "click", je vois ça :
[1027791.826359] ata2.00: exception Emask 0x10 SAct 0x8000000 SErr 0x40c0000 action 0xe frozen
[1027791.826367] ata2.00: irq_stat 0x00000040, connection status changed
[1027791.826369] ata2: SError: { CommWake 10B8B DevExch }
[1027791.826375] ata2.00: failed command: READ FPDMA QUEUED
[1027791.826376] ata2.00: cmd 60/58:d8:08:83:4e/02:00:02:00:00/40 tag 27 ncq dma 307200 in
res 50/00:58:08:83:4e/00:02:02:00:00/40 Emask 0x10 (ATA bus error)
[1027791.826383] ata2.00: status: { DRDY }
[1027791.826387] ata2: hard resetting link
[1027793.850442] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[1027793.854264] ata2.00: configured for UDMA/133
[1027793.854485] sd 1:0:0:0: [sdb] tag#27 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[1027793.854491] sd 1:0:0:0: [sdb] tag#27 Sense Key : Illegal Request [current]
[1027793.854494] sd 1:0:0:0: [sdb] tag#27 Add. Sense: Unaligned write command
[1027793.854497] sd 1:0:0:0: [sdb] tag#27 CDB: Read(10) 28 00 02 4e 83 08 00 02 58 00
[1027793.854499] I/O error, dev sdb, sector 38699784 op 0x0:(READ) flags 0x80700 phys_seg 65 prio class 0
[1027793.854530] ata2: EH complete
[1027794.290459] ata2.00: exception Emask 0x10 SAct 0xf00000 SErr 0x40c0000 action 0xe frozen
[1027794.290466] ata2.00: irq_stat 0x00000040, connection status changed
[1027794.290469] ata2: SError: { CommWake 10B8B DevExch }
[1027794.290474] ata2.00: failed command: READ FPDMA QUEUED
[1027794.290476] ata2.00: cmd 60/68:a0:00:22:40/00:00:02:00:00/40 tag 20 ncq dma 53248 in
res 50/00:48:c0:22:40/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
[1027794.290482] ata2.00: status: { DRDY }
[1027794.290485] ata2.00: failed command: READ FPDMA QUEUED
[1027794.290486] ata2.00: cmd 60/30:a8:78:22:40/00:00:02:00:00/40 tag 21 ncq dma 24576 in
res 50/00:48:c0:22:40/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
[1027794.290492] ata2.00: status: { DRDY }
[1027794.290494] ata2.00: failed command: READ FPDMA QUEUED
[1027794.290496] ata2.00: cmd 60/08:b0:b0:22:40/00:00:02:00:00/40 tag 22 ncq dma 4096 in
res 50/00:48:c0:22:40/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
[1027794.290501] ata2.00: status: { DRDY }
[1027794.290503] ata2.00: failed command: READ FPDMA QUEUED
[1027794.290505] ata2.00: cmd 60/48:b8:c0:22:40/00:00:02:00:00/40 tag 23 ncq dma 36864 in
res 50/00:48:c0:22:40/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
[1027794.290510] ata2.00: status: { DRDY }
[1027794.290514] ata2: hard resetting link
[1027796.427544] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[1027796.433004] ata2.00: configured for UDMA/133
[1027796.433318] sd 1:0:0:0: [sdb] tag#20 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[1027796.433323] sd 1:0:0:0: [sdb] tag#20 Sense Key : Illegal Request [current]
[1027796.433326] sd 1:0:0:0: [sdb] tag#20 Add. Sense: Unaligned write command
[1027796.433329] sd 1:0:0:0: [sdb] tag#20 CDB: Read(10) 28 00 02 40 22 00 00 00 68 00
[1027796.433330] I/O error, dev sdb, sector 37757440 op 0x0:(READ) flags 0x80700 phys_seg 13 prio class 0
[1027796.433344] sd 1:0:0:0: [sdb] tag#21 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[1027796.433346] sd 1:0:0:0: [sdb] tag#21 Sense Key : Illegal Request [current]
[1027796.433348] sd 1:0:0:0: [sdb] tag#21 Add. Sense: Unaligned write command
[1027796.433351] sd 1:0:0:0: [sdb] tag#21 CDB: Read(10) 28 00 02 40 22 78 00 00 30 00
[1027796.433352] I/O error, dev sdb, sector 37757560 op 0x0:(READ) flags 0x80700 phys_seg 6 prio class 0
[1027796.433358] sd 1:0:0:0: [sdb] tag#22 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[1027796.433360] sd 1:0:0:0: [sdb] tag#22 Sense Key : Illegal Request [current]
[1027796.433363] sd 1:0:0:0: [sdb] tag#22 Add. Sense: Unaligned write command
[1027796.433365] sd 1:0:0:0: [sdb] tag#22 CDB: Read(10) 28 00 02 40 22 b0 00 00 08 00
[1027796.433366] I/O error, dev sdb, sector 37757616 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[1027796.433377] sd 1:0:0:0: [sdb] tag#23 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[1027796.433379] sd 1:0:0:0: [sdb] tag#23 Sense Key : Illegal Request [current]
[1027796.433381] sd 1:0:0:0: [sdb] tag#23 Add. Sense: Unaligned write command
[1027796.433384] sd 1:0:0:0: [sdb] tag#23 CDB: Read(10) 28 00 02 40 22 c0 00 00 48 00
[1027796.433385] I/O error, dev sdb, sector 37757632 op 0x0:(READ) flags 0x80700 phys_seg 9 prio class 0
[1027796.433389] ata2: EH complete
Mais ce que je ne comprends pas, c'est que les outils que je connais (peu) ne me semblent pas me dire que mon hdd est KO.
Voici donc le retour smartctl :
sudo smartctl -s on -a /dev/sdb
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-5.18.18-200.fc36.x86_64] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: HGST Travelstar 7K1000
Device Model: HGST HTS721010A9E630
Serial Number: JR1000BN1WKMJE
LU WWN Device Id: 5 000cca 8d8da9fb4
Firmware Version: JB0OA3U0
User Capacity: 1000204886016 bytes [1,00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 7200 rpm
Form Factor: 2.5 inches
Device is: In smartctl database 7.3/5319
ATA Version is: ATA8-ACS T13/1699-D revision 6
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Thu Sep 1 18:14:15 2022 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF ENABLE/DISABLE COMMANDS SECTION ===
SMART Enabled.
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x80) Offline data collection activity
was never started.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 45) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 165) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 062 Pre-fail Always - 0
2 Throughput_Performance 0x0005 100 100 040 Pre-fail Offline - 0
3 Spin_Up_Time 0x0007 253 253 033 Pre-fail Always - 0
4 Start_Stop_Count 0x0012 096 096 000 Old_age Always - 7252
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 100 100 040 Pre-fail Offline - 0
9 Power_On_Hours 0x0012 040 040 000 Old_age Always - 26429
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 796
191 G-Sense_Error_Rate 0x000a 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 099 099 000 Old_age Always - 224
193 Load_Cycle_Count 0x0012 099 099 000 Old_age Always - 14375
194 Temperature_Celsius 0x0002 176 176 000 Old_age Always - 34 (Min/Max 14/45)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
223 Load_Retry_Count 0x000a 100 100 000 Old_age Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 26201 -
# 2 Short offline Completed without error 00% 26195 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
Le retour du test long :
sudo smartctl -l selftest /dev/sdb
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-5.18.18-200.fc36.x86_64] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 26433 -
# 2 Short offline Completed without error 00% 26430 -
# 3 Extended offline Completed without error 00% 26201 -
# 4 Short offline Completed without error 00% 26195 -
Donc question : est-ce qu'un disque dur peut être KO indépendamment du retour des outils tel que smartctl ? Avez-vous vu quelque chose dans les résultats qui vous semblent bien indiquer que mon disque est souffrant ?
Qu'est-ce que vous en pensez s'il vous plaît ?
PS : à tout hasard, j'ai changé le câble sata, même symptomes. J'ai pas essayé de changer de port sata sur la CM mais j'y crois moyen.