SMART values 01 and C8 going up

Hello,

We’re using a set of WD1002FAEX drives in a RAID1 array in one of our file servers. On both drives we’re seeing the smart value 01 going up, and on one of the drives the C8 value is also rising. Smart data of both disks below. Is this anything to worry about?

All data on the disks is adequately backed up, but if the drives are on their way out we’d like to know in advance so we can replace them before we encounter some down time.

Many thanks in advance to anyone who can answer my question.

----------------------------------------------------------------------------
CrystalDiskInfo 5.0.0 (C) 2008-2012 hiyohiyo
                                Crystal Dew World : http://crystalmark.info/
----------------------------------------------------------------------------

    OS : Windows Server 2008 R2 Server Standard Edition (full installation) SP1 [6.1 Build 7601] (x64)
  Date : 2012/11/26 8:51:46

-- Controller Map ----------------------------------------------------------
 + Standard AHCI 1.0 Serial ATA Controller [ATA]
   - ATA Channel 0 (0)
   + ATA Channel 1 (1)
     - TSSTcorp CDDVDW SH-222BB ATA Device
 + Intel(R) Desktop/Workstation/Server Express Chipset SATA RAID Controller [SCSI]
   - OS
   - DATA
   - RD

-- Disk List ---------------------------------------------------------------
 (1) M4-CT064M4SSD2 : 64,0 GB [X/0/0, cs] - mi
 (2) M4-CT064M4SSD2 : 64,0 GB [X/0/1, cs] - mi
 (3) WDC WD1002FAEX-00Y9A0 : 1000,2 GB [X/0/2, cs]
 (4) WDC WD1002FAEX-00Y9A0 : 1000,2 GB [X/0/3, cs]
 (5) M4-CT256M4SSD2 : 256,0 GB [X/0/4, cs] - mi
 (6) M4-CT256M4SSD2 : 256,0 GB [X/0/5, cs] - mi

----------------------------------------------------------------------------
 (3) WDC WD1002FAEX-00Y9A0
----------------------------------------------------------------------------
           Model : WDC WD1002FAEX-00Y9A0
        Firmware : 05.01D05
   Serial Number : WD-WCAW33662374
       Disk Size : 1000,2 GB (8,4/137,4/1000,2)
     Buffer Size : Unknown
     Queue Depth : 32
    # of Sectors : 1953525168
   Rotation Rate : Unknown
       Interface : Serial ATA
   Major Version : ATA8-ACS
   Minor Version : ----
   Transfer Mode : SATA/600
  Power On Hours : 2219 hours
  Power On Count : 24 count
     Temparature : 19 C (66 F)
   Health Status : Good
        Features : S.M.A.R.T., 48bit LBA, NCQ
       APM Level : ----
       AAM Level : ----

-- S.M.A.R.T. --------------------------------------------------------------
ID Cur Wor Thr RawValues(6) Attribute Name
01 200 200 _51 000000000007 Read Error Rate
03 173 173 _21 0000000010ED Spin-Up Time
04 100 100 __0 00000000001A Start/Stop Count
05 200 200 140 000000000000 Reallocated Sectors Count
07 100 253 __0 000000000000 Seek Error Rate
09 _97 _97 __0 0000000008AB Power-On Hours
0A 100 253 __0 000000000000 Spin Retry Count
0B 100 253 __0 000000000000 Recalibration Retries
0C 100 100 __0 000000000018 Power Cycle Count
C0 200 200 __0 000000000008 Power-off Retract Count
C1 200 200 __0 000000000013 Load/Unload Cycle Count
C2 128 _96 __0 000000000013 Temperature
C4 200 200 __0 000000000000 Reallocation Event Count
C5 200 200 __0 000000000000 Current Pending Sector Count
C6 200 200 __0 000000000000 Uncorrectable Sector Count
C7 200 200 __0 000000000000 UltraDMA CRC Error Count
C8 200 200 __0 000000000002 Write Error Rate

----------------------------------------------------------------------------
 (4) WDC WD1002FAEX-00Y9A0
----------------------------------------------------------------------------
           Model : WDC WD1002FAEX-00Y9A0
        Firmware : 05.01D05
   Serial Number : WD-WCAW33654413
       Disk Size : 1000,2 GB (8,4/137,4/1000,2)
     Buffer Size : Unknown
     Queue Depth : 32
    # of Sectors : 1953525168
   Rotation Rate : Unknown
       Interface : Serial ATA
   Major Version : ATA8-ACS
   Minor Version : ----
   Transfer Mode : SATA/600
  Power On Hours : 2219 hours
  Power On Count : 24 count
     Temparature : 22 C (71 F)
   Health Status : Good
        Features : S.M.A.R.T., 48bit LBA, NCQ
       APM Level : ----
       AAM Level : ----

-- S.M.A.R.T. --------------------------------------------------------------
ID Cur Wor Thr RawValues(6) Attribute Name
01 200 200 _51 00000000001D Read Error Rate
03 173 173 _21 0000000010D4 Spin-Up Time
04 100 100 __0 00000000001A Start/Stop Count
05 200 200 140 000000000000 Reallocated Sectors Count
07 100 253 __0 000000000000 Seek Error Rate
09 _97 _97 __0 0000000008AB Power-On Hours
0A 100 253 __0 000000000000 Spin Retry Count
0B 100 253 __0 000000000000 Recalibration Retries
0C 100 100 __0 000000000018 Power Cycle Count
C0 200 200 __0 000000000008 Power-off Retract Count
C1 200 200 __0 000000000013 Load/Unload Cycle Count
C2 125 _96 __0 000000000016 Temperature
C4 200 200 __0 000000000000 Reallocation Event Count
C5 200 200 __0 000000000000 Current Pending Sector Count
C6 200 200 __0 000000000000 Uncorrectable Sector Count
C7 200 200 __0 000000000000 UltraDMA CRC Error Count
C8 200 200 __0 000000000000 Write Error Rate

You can try to test the unit with DLG tool that is provided by WD

WD DLG Tool

http://wdc.custhelp.com/app/answers/detail/a_id/940/session/L3RpbWUvMTM1NDEyNjM4MC9zaWQvZWg0Y2xxY2w%3D

Can the DLG tool scan drives that are part of a raid array?

If it can, is it safe? Don’t want the tool “fixing” things and bringing the array out of sync.

You will need to take it out of the RAID and will be better if you use the DOS version of the DLG tool so it wont compromise the data on the drive nor the Raid.

Check the links below.

http://support.wdc.com/product/download.asp?groupid=612&sid=30&lang=en

http://wdc.custhelp.com/app/answers/detail/a_id/1329/related/1/session/L2F2LzEvdGltZS8xMzU0MjE2ODg4L3NpZC9DU2xKU3ZjbA%3D%3D

1 Like

Interesting… I am also seeing the raw-values for x01 (01) and xC8 (200) increasing but the SMART-value and SMART-worst values still are at a healthy 200. See topic WD10EFRX SMART-errors but status is OK in this forum.

I’m wondering how old the drives are. I just got one recently (same model from an RMA)

Those SMART reports are harder to read than some other SMART monitoring app’s.

Everyone is probably gonna wonder about the integrity of the reported info you posted,

hence the desire to get you to repeat the result on another program that will output smart data etc.

and as well i guess the obvious …check for errors

Which most likely isn’t gonna turn up anything your not seeing already.

I have a Maxtor drive that has been plagued with those SMART errors logged and it is related

to some bad cables i have used i’m pretty sure (Transmission errors - read or write)

So that made me think have you did any PC case cleaning etc lately ? …assuming you answered the first question i asked (how old are the drives) with the drives are not new etc…

I’m going at a potential cables / connection in issue in a round about way.

If you can confirm with mutiple sources the smart data is increasing on those values

i would be concerned myself and after ruling out and variables like cables or drivers etc

i would consider rma’ing the drive. Although those errors could be connected to a driver issue i think too.

What OS are you running ? If windows have a look in the event view for disk related errors/warnings

and if there is maybe you could try changing settings or drivers etc

Also you arn’t running any Cache software like “Romex FancyCache” are you ?

Which can use a filesystem driver to intercept write calls (defered writes)

Or anything else that could interfere with trasmissions like for example,

Raxco Pefect Disk has a default feature that is called “OptiWrite” that uses a filesystem driver

to intercept write calls. I mention that because its common knowledge

these programs and similar can cause corruption etc…

taking it out of the array to run tests is going to be quite a PITA. Probably cheaper for the company to just replace em.

Drives are less than a year old now.

OS is Server 2008R2.

Not using any caching software, Write cache on Intel RST is disabled.

No cleaning since it was put together.

Cables were new from the build, we’ll probably look into replacing em if/when we replace the drives.

Currently Drive 3 has:

10 On Read Error Rate (01)

01 On Write Error Rate (C8)

Drive 4:

30 On Read Error Rate (01)

00 On Write Error Rate (C8)

Current and worst values are unchanging as ever.

No other errors being reported

Intel RST isn’t reporting any errors either.