Find a bad disk in ARRAY

Hi All,

There isn’t a category for this (unless I missed it), so its going here…

I have 8x WD40EFRX (4Tb WD RED) SATA disks all NEW

These setups ALL have the same issues outlined below

  • 8 disks in a single RAID6 volume and partition, PCIe hardware RAID controller, Windows Server 2008 installed on separate disk
  • 8 disk Windows software RAID5, PCIe SATA controller x2 (4x SATA each), Windows Server 2008 installed on separate disk
  • 8 disk ZFS volume, PCIe SATA controller x2 (4x SATA each), FreeNAS installed on separate disk

The hardware RAID controller uses SFF-8087 (Mini-SAS) to SATA cables, and the SATA controllers use standard cables… So different data cables
All securely connected
All NEW hardware
Clean software installs

Now, I (Think) have isolated the issue to a 4 Disk set (case cage), but still don’t know which disk(s) are causing issues

  • 2x 4 disk RAID5 volumes and partitions, PCIe hardware RAID controller, Windows Server 2008. one volume ‘seems’ OK. the other has the same issue

The Issue
copying 4Tb Data (robocopy, xcopy, Windows copy, TFP and other methods) across the network. File sizes from 1kb to 300Gb, about 300,000 files

All files copy OK without error. But the MD5 of source differ on some files…
if I were to rename the bad file and copy again, 9/10 times it would MD5 OK, on the tenth time it would require another copy.

SMART for all 8 disks are OK
WD Data Lifeguard Diagnostic for Windows all quick tests OK

any further diags I can do to find the bad disk(s)?

Many Thanks

I’d recommend a full test. Additionally, if all hard drives are completely healthy then you may be looking at a RAID controller or configuration issue.

if all hard drives are completely healthy then you may be looking at a RAID controller or configuration issue.

Hence the RAID controller and SATA controllers in different testings, only other things are mobo, PSU…

Memory (64Gb) all tests OK, ‘memtest86’ full tests

…anyways will do long tests and report back.

Thanks