• [supprimé]

Voici la ligne qui reviens régulièrement dans le fichier /var/log/messages :
Sep  2 22:30:14 localhost smartd[2010]: Device: /dev/sda, 1 Offline uncorrectable sectors
C'est grâve ??? :-o
Extrait de «man smart.conf» :
SMARTD_FAILTYPE
      gives  the  reason  for  the  warning or message email.  The possible values that it takes and their meanings are:
[...]
      [u]OfflineUncorrectableSector[/u]: during off-line testing, or self-testing, one or more disk sectors could not be read.
.
En clair tu serais bien inspiré de faire un test sérieux de l'intégrité de ton disque avant de risquer un gros gros malaise
  • [supprimé]

Je le fais comment/avec quoi le test test d'intégrité ?
Salut.

Sauvegarde au plus tot tes donnees personnelles que tu as sur ton DD...

Si ton DD est /dev/sda, essaye :
# smartctl -a /dev/sda

pour collecter des infos. Qu'as tu comme resultats ?

++
.
Regarde -à toutes fins utiles- la page de man dur http://smartmontools.sourceforge.net/man/smartd.8.html

Lorsqu'on y lit que "The purpose of SMART is to monitor the reliability of the hard drive and predict drive failures" on ne peut que prendre au sérieux tous les warnings qu'il émet. Ensuite, comme le dit tout à fait a propos Eddy33, il faut évaluer la nature du risque, mais dès qu'il s'agit des disques, ce n'est jamais anodin.
SAlut.

Envoie nous les resultats apres ta sauvegarde...

Pour mes PC professionnels sous win$, tu as comme outil gratuit speedfan qui a une rubrique SMART pour voir l'etat d'un DD : http://www.almico.com/speedfan.php


++
  • [supprimé]

Je viens d'installer speedfan sous windows, tous les trucs sont en vert !
  • [supprimé]

Je viens de lancer un scandisk sous windows XP en cochant les deux cases, il n'a rien trouvé !
hum...
et tu as les compteurs d'erreurs sous speedfan a 0 ?
Si oui, tu peux dormir sur tes 2 oreilles.


++
  • [supprimé]

J'ai téléchargé l'outil powermax (CD de Boot) sur le site de maxtor afin de faire un test basique du disque dur. Il n'a rien trouvé !

Sinon voici ce que me retourne la commande :
[root@localhost ~]# smartctl -a /dev/sda
smartctl version 5.36 [i686-redhat-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

Device: ATA      Maxtor 6Y250M0   Version: YAR5
Serial number: Y65TLSYE
Device type: disk
Local Time is: Sun Sep  3 20:38:30 2006 CEST
Device does not support SMART

Error Counter logging not supported

[GLTSD (Global Logging Target Save Disable) set. Enable Save with '-S on']
Device does not support Self Test logging
[root@localhost ~]# smartctl -S on /dev/sda2
smartctl version 5.36 [i686-redhat-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

Enable autosave (clear GLTSD bit) failed
bizarre non ?
  • [supprimé]

J'ai regardé dans le bios, SMART est bien activé pourtant !

Sinon pour SpeedFan, je ne trouve pas le compteur d'erreur ... Il y a la colone Warn qui me dit qu'il y a plein de warning pour certaine options.
  • [supprimé]

Si je lance le test "Perform an in-depth online annalysis of this hard disk", j'ai une page web de résultat avec ceci :
Your hard disk is a Maxtor 6Y250M0 with firmware YAR51HW0.
The average temperature for this hard disk is 44C (MIN=28C MAX=61C) and yours is 62C.
Your hard disk's S.M.A.R.T. attributes are now being analyzed and a full report about the reliability, health and status of your hard disk is generated:
Your hard disk is not below any attribute threshold. This is good.
Your hard disk was never below any attribute threshold. This is good.
Your hard disk is now being compared to real data used to define normal values for your specific hard disk model. This way, the analysis can automatically use proper operating ranges. The images give you an idea of how each attribute is within such range. Current and raw values are shown for easier reference for experienced users. There are 1047 hard disk models in the current archive.

Attribute Current Raw Overall
2 Spin Up Time 178 20092 Normal
10 Start/Stop Count 253 215 Very good
10 Reallocated Sector Count 253 5 Very good
10 Read Channel Margin 253 0 Very good
10 Seek Error Rate 253 0 Very good
9 Seek Time Performance 252 37791 Very good
10 Power On Hours Count 250 5624 Very good
10 Spin Retry Count 253 0 Very good
10 Calibration Retry Count 253 0 Very good
6 Power Cycle Count 252 589 Good
10 Power Off Retract Count 253 0 Very good
10 Load Cycle Count 253 0 Very good
10 Hardware ECC Recovered 253 4675 Very good
10 Reallocated Event Count 253 0 Very good
10 Current Pending Sector 253 0 Very good
10 Offline Uncorrectable Sector Count 253 0 Very good
10 Ultra DMA CRC Error Rate 199 0 Very good
10 Write Error Rate 253 0 Very good
10 Soft Read Error Rate 253 36 Very good
10 TA Increase Count 253 0 Very good
10 Run Out Cancel 253 2 Very good
10 Shock Count Write Opern 253 0 Very good
10 Shock Rate Write Opern 253 0 Very good
10 Spin High Current 253 0 Very good
10 Spin Buzz 253 0 Very good
1 Offline Seek Performance 193 0 Normal
10 Unknown attribute 99 253 0 Very good
10 Unknown attribute 100 253 0 Very good
10 Unknown attribute 101 253 0 Very good

All of the attributes of your hard disk have normal values. This is good.

The overall fitness for this drive is 92%.
The overall performance for this drive is 92%.
Bah, c'est ok...

pour speedfan, c'est la premiere colonne qui doit avoir des OK verts et la derniere colonne raw: les compteurs count et error doivent etre a 0.

++
5 mois plus tard
Yep, meme probleme aussi

et rien à faire avec e2fsck tjrs ce satané message
Jan 22 23:39:42 localhost smartd[3233]: Device: /dev/hdb, 36 Offline uncorrectable sectors
5 jours plus tard
pour info, j'ai ce message ( apparement toute les 1/2 heure mais pas tout le temps non plus ) ds /var/log/messages
Jan 28 04:37:03 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 05:07:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 05:37:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 06:07:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 06:37:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 07:07:03 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 07:37:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 08:07:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 08:37:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 09:07:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 09:37:03 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors 
Jan 28 10:07:02 localhost smartd[3117]: Device: /dev/hdb, 36 Offline uncorrectable sectors
un smartctl -a /dev/hdb me donne ça
smartctl version 5.36 [i386-redhat-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family:     Maxtor DiamondMax 10 family
Device Model:     Maxtor 6B200P0
Serial Number:    B40SFJPH
Firmware Version: BAH41B70
User Capacity:    203,928,109,056 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 0
Local Time is:    Sun Jan 28 10:28:26 2007 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                 (1622) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        No General Purpose Logging support.
Short self-test routine 
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  82) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  3 Spin_Up_Time            0x0027   206   205   063    Pre-fail  Always       -       14576
  4 Start_Stop_Count        0x0032   253   253   000    Old_age   Always       -       124
  5 Reallocated_Sector_Ct   0x0033   251   250   063    Pre-fail  Always       -       25
  6 Read_Channel_Margin     0x0001   253   253   100    Pre-fail  Offline      -       0
  7 Seek_Error_Rate         0x000a   253   252   000    Old_age   Always       -       0
  8 Seek_Time_Performance   0x0027   247   220   187    Pre-fail  Always       -       42521
  9 Power_On_Minutes        0x0032   210   210   000    Old_age   Always       -       793h+24m
 10 Spin_Retry_Count        0x002b   253   252   157    Pre-fail  Always       -       0
 11 Calibration_Retry_Count 0x002b   253   252   223    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   253   253   000    Old_age   Always       -       107
192 Power-Off_Retract_Count 0x0032   253   253   000    Old_age   Always       -       0
193 Load_Cycle_Count        0x0032   253   253   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0032   044   253   000    Old_age   Always       -       35
195 Hardware_ECC_Recovered  0x000a   253   252   000    Old_age   Always       -       13114
196 Reallocated_Event_Count 0x0008   245   245   000    Old_age   Offline      -       8
197 Current_Pending_Sector  0x0008   253   252   000    Old_age   Offline      -       0
198 Offline_Uncorrectable   0x0008   217   217   000    Old_age   Offline      -       36
199 UDMA_CRC_Error_Count    0x0008   199   199   000    Old_age   Offline      -       0
200 Multi_Zone_Error_Rate   0x000a   253   252   000    Old_age   Always       -       0
201 Soft_Read_Error_Rate    0x000a   253   252   000    Old_age   Always       -       1
202 TA_Increase_Count       0x000a   253   243   000    Old_age   Always       -       0
203 Run_Out_Cancel          0x000b   253   252   180    Pre-fail  Always       -       0
204 Shock_Count_Write_Opern 0x000a   253   252   000    Old_age   Always       -       0
205 Shock_Rate_Write_Opern  0x000a   253   252   000    Old_age   Always       -       0
207 Spin_High_Current       0x002a   253   252   000    Old_age   Always       -       0
208 Spin_Buzz               0x002a   253   252   000    Old_age   Always       -       0
209 Offline_Seek_Performnce 0x0024   240   240   000    Old_age   Offline      -       166
210 Unknown_Attribute       0x0032   253   252   000    Old_age   Always       -       0
211 Unknown_Attribute       0x0032   253   252   000    Old_age   Always       -       0
212 Unknown_Attribute       0x0032   253   253   000    Old_age   Always       -       0

SMART Error Log Version: 1
ATA Error Count: 2728 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 2728 occurred at disk power-on lifetime: 8780 hours (365 days + 20 hours)
  When the command that caused the error occurred, the device was in an unknown state.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  5a 4a 70 6f 96 78 e0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  e0 03 70 6f 96 78 e0 00      23:49:46.219  STANDBY IMMEDIATE
  e0 03 08 67 96 78 e0 00      23:49:44.798  STANDBY IMMEDIATE
  e0 03 01 00 00 00 e0 00      23:49:44.711  STANDBY IMMEDIATE
  e0 03 08 67 96 78 e0 00      23:49:41.882  STANDBY IMMEDIATE
  e0 03 01 00 00 00 e0 00      23:49:41.866  STANDBY IMMEDIATE

Error 2727 occurred at disk power-on lifetime: 8780 hours (365 days + 20 hours)
  When the command that caused the error occurred, the device was in an unknown state.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  5a 4a 01 00 00 00 e0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  e0 03 01 00 00 00 e0 00      23:49:44.711  STANDBY IMMEDIATE
  e0 03 08 67 96 78 e0 00      23:49:41.882  STANDBY IMMEDIATE
  e0 03 01 00 00 00 e0 00      23:49:41.866  STANDBY IMMEDIATE
  e0 03 08 5f 96 78 e0 00      23:49:41.779  STANDBY IMMEDIATE
  e0 03 78 67 96 78 e0 00      23:49:40.298  STANDBY IMMEDIATE

Error 2726 occurred at disk power-on lifetime: 8780 hours (365 days + 20 hours)
  When the command that caused the error occurred, the device was in an unknown state.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  5a 4a 08 5f 96 78 e0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  e0 03 08 5f 96 78 e0 00      23:49:41.779  STANDBY IMMEDIATE
  e0 03 78 67 96 78 e0 00      23:49:40.298  STANDBY IMMEDIATE
  e0 03 80 5f 96 78 e0 00      23:49:37.505  STANDBY IMMEDIATE
  e0 03 80 df 95 78 e0 00      23:49:37.501  STANDBY IMMEDIATE
  e0 03 80 5f 95 78 e0 00      23:49:37.498  STANDBY IMMEDIATE

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
qu'en pensez vous?
ds quel etat est le disque? c'est ma partition ext3 pour le download, donc forcement pas mal solicitée..

edit: ce qui est 'marrant' c'est que depuis ce matin 3h37 j'ai plus de messages ds la console
( je precise que les meme soft tournaient à 3h du mat que maintenant, à pars gnome radio pour ecouter FMR 8-) )