My Cloud drops from network 2/3 minutes after boot since 5.27.157 update

Eire12 · October 29, 2023, 10:30pm

The My Cloud OS5 4TB Gen 2 device I have seems to drop off the network after 2/3 minutes since the most recent October firmware update.

It’s similar symptoms I have had with indexing or Twonky impacting resources, but I have disabled cloud access, uninstalled all apps and still it lasts no more than 2/3 minutes before it runs into problems. There seems to be a CPU spike around the3 minute mark around when this problem occurs, but I can’t figure what is causing it.

Has anyone else had this issue?

cat0w · October 29, 2023, 11:28pm

@Eire12

What are the LEDs doing, both front and back? Do you have a static IP address for your device? When you think it has dropped from the network have you checked its status on your Router? See example image from mine below.

OS5 User Manual

https://products.wdc.com/nas0s5/nasum/en/#t=MyCloud%2FConfiguringBasicSettings%2FNetwork.htm

WDMYCLOUD User Manual, the older manual.

Eire12 · October 30, 2023, 12:51am

For the LEDs, it blinks blue on boot and turns to solid blue thereafter. For ethernet, it is active blinking with hard drive activity, but after the ‘freeze’ seems to be inactive and the ethernet light just gives a little pulse every second or so. Though I can still ping the IP from cmd, so it is still ‘active’ in some way.

For the IP it was set to DHCP but I changed it to static to troubleshoot, but made no difference.

Though on my router dashboard, it shows the device well and connected, though it still does not appear in Windows (ping in cmd aside)

The only ‘odd’ clue is that, trying a network feature, like the firmware update options gives a “check network connection” message … but I can only see this message over my working network connection … and certainly the server is working on the WD end, so it is not handling network connections well for some reason

cat0w · October 30, 2023, 12:13pm

@Eire12

You can do a manual update of the firmware. Here is a link to download the firmware and then you can perform a manual install.

Download Software, Firmware and Drivers for WD Products

I download the update, place the file on my desktop, and then go back to the dashboard to perform the manual update selecting the file on my desktop when it says to. See example image below. Click on, tap, or activate image to enlarge it.

Eire12 · October 30, 2023, 1:48pm

First, I really do appreciate the replies on this.

@cat0w - For the firmware update, I can get as far as here, but then it seems to hang permanently on this ‘upgrading’ screen with no movement on the progress bar.

@Cerberus - So I did just have enough time for the connection to be live to get an output from the SSH … it is as follows.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 193 193 051 Pre-fail Always - 137038
3 Spin_Up_Time 0x0027 203 172 021 Pre-fail Always - 6808
4 Start_Stop_Count 0x0032 083 083 000 Old_age Always - 17900
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 047 047 000 Old_age Always - 39393
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 263
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 207
193 Load_Cycle_Count 0x0032 189 189 000 Old_age Always - 33896
194 Temperature_Celsius 0x0022 130 080 000 Old_age Always - 22
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 3
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 199 000 Old_age Always - 56114
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 93

The output is worse than I thought though still could be worse, and certainly all worked perfectly until the day of the update. I gues si didn’t check before now as the dashboard SMART readout has everything as being fine. (Ignore the slightly low temperature, I put it in a cooler spot, just incase of CPU throttling with it hitting the 100%)

It can’t be indexing though, can it? Cloud services are off. The dashboard made no mention of indexing percentages, just ‘connected’ or ‘disabled’ depending on my setting. It could be wrong though… is there another way to check if indexing is occurring?

trekrap · October 31, 2023, 12:24pm

Hello, My 40TB EX4100 w/ FW@ 5.26.300 does same. My open incidents, dating from 22 October 2023 with ‘WD Support’ (apparently an oxymoron) remain unresolved with assertion that RAID 5 data is corrupted. However, all drives check healthy and LEDs are solid blue. Despite those facts Support directed a manual FW update to 5.27.157. Any attempt gets bogus msg ‘Unable to contact firmware update server.’ I have ~175 40GB files or ~6TB of video I am trying to move off the device but due to the constant timeout/dropoff I am having little luck getting it copied anywhere. Any insight into how to recover the data or knowledge of how your problem was resolved is appreciated.

cat0w · October 31, 2023, 2:06pm

@trekrap

Have you already looked in the sub-forum for your device to see if others with the same device have or have had this problem?

Latest My Cloud OS 5 Personal & Network Attached Storage/My Cloud EX Series topics - WD Community

trekrap · October 31, 2023, 5:17pm

Thx, good suggestion. I did search there and unless I missed something all are pretty old from 2020 and none come close to matching my EX4100’s behavior. Only @Eire12’s post describes what I’m seeing. Especially the consistently predictable ‘device locks-up’ and ‘drops off-line’ death spiral syndrome.

My 4 big problems are:
[1] Can’t successfully complete Manual Firmware Update. These steps were taken: (1) Down load file ‘WDMyCloudEX4100_5.27.157_prod.bin’ from support page using Safari browser to Mac. (2) Select update button. Open file and observe histogram for download reaches 100%. (3) Receive Msg ‘Firmware file not found. Please Try Again’

[2] Can’t Check for Updates; Receive Msg ‘Unable to connect to the firmware update server. Please check the network connection and try again.’

[3] Can’t Enable Auto Update. Receive Msg ‘Unable to connect to the firmware update server. Please check the network connection and try again.’

[4] Can’t Perform 40 second reset
Steps followed:

Remove power cord from the device.
Press and hold reset while reconnecting power cord.
While holding reset press and release power-on button

Curious as to how can a file issue can hamper the NAS’ ability to connect with WD FW Update Server?

trekrap · October 31, 2023, 5:20pm

Forgot to mention current tact is to sign into ssd and run diagnostics… will report findings as soon as I determine why I can’t login even though I carefully enter pwd that I made up?

trekrap · October 31, 2023, 7:30pm

copy all; procedure complete. results uploaded

.

Summary: sda - sdb - sdc - sdd

`Raw_Read_Error_Rate - 0 - 0 - 0 - 0
`Current_Pending_Sector - 0 - 0 - 0 - 0
`UDMA_CRC_Error_Count - 0 - 0 - 0 - 0
`Multi_Zone_Error_Rate - reporting line does not present in output ------

trekrap · October 31, 2023, 7:45pm

okay… meanwhile, should 'smartctl -a -d at a /dev/sda, sdb, sdc, sdd be used instead for SATA drives; how can drive-type be confirmed through dashboard or SSH?

trekrap · October 31, 2023, 7:52pm

copy all; makes perfect sense. rebooting now as device death spiral took precedence… again.

trekrap · October 31, 2023, 7:54pm

pardon screenshot… thought summary would ease and avoid PITA ‘eye chart.’

trekrap · October 31, 2023, 7:58pm

root@Centarus ~ # cat /proc/mdstat
Personalities : [linear] [raid0] [raid1] [raid10] [raid6] [raid5] [raid4]
md1 : active raid5 sdc2[0] sda2[3] sdb2[2] sdd2[1]
29286720960 blocks super 1.0 level 5, 64k chunk, algorithm 2 [4/4] [UUUU]
bitmap: 0/2 pages [0KB], 524288KB chunk

md0 : active raid1 sda1[3] sdb1[2] sdd1[1] sdc1[0]
2094080 blocks super 1.2 [4/4] [UUUU]
bitmap: 0/1 pages [0KB], 65536KB chunk

unused devices:
root@Centarus ~ #

completed with copy/paste report text as requested
Thank you for your continued assistance to make a sound problem determination

trekrap · October 31, 2023, 8:25pm

Agree. Thus far ‘no joy’ from attempts to update FW. have examined html source for page in attempt to understand where device thinks the update server is or should be… no luck there.
Have strenuously requested to speak with domestic WD expert on firmware update and nothing but ‘crickets.’

Can reference tables or meta data (i.e., firmware update server URL) be ascertained via SSH layer commands?

Is there a debug mode whereby I can step through the execution logic used to manually retrieve and install the FW?

Early in my IT career as an IBM OS MVS Systems Programmer I spent many hours reading ‘core dumps’ and setting traps using S/370 instruction step mode to track down memory corruption caused by errant routines operating in privileged state (i.e., root level).

Thank you for helping find potential diagnostics needed to identify and confirm the root cause of this problem.

trekrap · November 1, 2023, 4:54pm

Hello, Thank you for additional diagnostic guidance. I was able to get the reports and have reformatted them (see post from this evening)

trekrap · November 1, 2023, 6:42pm

2023-11-01T04:00:00Z

Filesystem	Inodes	Used	Available	Use%	Mounted on
/dev/root	14,336	2,435	11,901	17%	/
devtmpfs	24,168	755	23,413	3%	/dev
mdev	24,168	755	23,413	3%	/dev
ubi0:config	-	-	-	0%	/usr/local/config
/dev/loop0	8,629	8,629	-	100%	/usr/local/modules
tmpfs	-	-	-	0%	/mnt
tmpfs	-	-	-	0%	/var/log
tmpfs	20,000	79	19,921	0%	/tmp
/dev/md0p1	35,200	10	35,190	0%	/usr/local/upload
/dev/sdc4	258,048	20	258,028	0%	/mnt/HD_c4
/dev/sdd4	258,048	19	258,029	0%	/mnt/HD_d4
/dev/sdb4	258,048	19	258,029	0%	/mnt/HD_b4
/dev/sda4	258,048	125	257,923	0%	/mnt/HD_a4
/dev/md1	457,605,120	306,116	457,299,004	0%	/mnt/HD/HD_a2

trekrap · November 1, 2023, 10:11pm

Filesystem	Size	Used	Available	Use%	Mounted on
/dev/root	54.2M	19.9M	31.5M	39%	/
devtmpfs	1017.6M	32.0K	1017.6M	0%	/dev
mdev	1017.6M	32.0K	1017.6M	0%	/dev
ubi0:config	12.1M	112.0K	11.3M	1%	/usr/local/config
/dev/loop0	163.6M	163.6M	0	100%	/usr/local/modules
tmpfs	1.0M	0	1.0M	0%	/mnt
tmpfs	40.0M	8.0M	32.0M	20%	/var/log
tmpfs	100.0M	8.8M	91.2M	9%	/tmp
/dev/md0p1	525.3M	4.0K	514.3M	0%	/usr/local/upload
/dev/sdc4	928.9M	56.0K	912.9M	0%	/mnt/HD_c4
/dev/sdd4	928.9M	52.0K	912.9M	0%	/mnt/HD_d4
/dev/sdb4	928.9M	52.0K	912.9M	0%	/mnt/HD_b4
/dev/sda4	928.9M	109.4M	803.5M	12%	/mnt/HD_a4
/dev/md1	27.2T	11.1T	15.8T	41%	/mnt/HD/HD_a2

This version is easier to inspect.
Is there a critical issue due to ‘/dev/loop0’ being exhausted?

trekrap · November 2, 2023, 2:47am

2023-11-01T04:00:00Z; Yes, I have a spare albeit smaller capacity SATA drive. I conclude capacity is irrelevant to the proscribed text case and will proceed in the morning (late night now on the East Coast). Continued thanks and appreciation for diagnostic guidance and analysis. 2023-11-03T04:00:00Z I was unable to re-format my spare HD on MAC PC so decided to order cheapest SATA device available. [clarification on MAC. Apparently it see’s and mounts NTFS formatted drives ‘Read Only.’ I no longer have a working WinTel box so I decided to try alternate approach] Meanwhile, I ran Dashboard ScanDisk utility and as expected it failed. However, on a lark tried the manual firmware update and to my surprise it worked. I re-ran ScanDisk and it failed.
Not deterred I tried to copy another 42GB file off the device and after ~15 mins CPU spiked @100%. I tried to display offending process(s) but display would not cooperate and after about 15 mins I’m disappointed to report the device went offline leaving ~160 40+GB files that I need to copy.
Consequently, I’m left to wonder what will happen when I insert the one new hard drive; what difference might it make?
Thank you for additional diagnostic recommendations.

P.S. WD Support reply to SSH scandisk (i.e., df -i and df -h) report details in block quote:

A question or two comes mind. First, why they are consistently mute regarding the device locking up and going offline, a critical issue IMO and second, why do they assert the files are corrupted given the SMARTCTL diagnostics show NO ERRORS?

Blockquote
Hello TIM,
Thank you for your reply.
I got the case reviewed by the engineering team and this is what could be done:
You could try running the command “dmesg” over putty and will be able to see the errors.
2023-10-28T16:14:51.134628-04:00 di=b2cj7JZ1ei warning kernel: [ 124.578329] EXT4-fs (md1): warning: mounting fs with errors, running e2fsck is recommended
2023-10-28T16:14:51.134628-04:00 di=b2cj7JZ1ei warning kernel: [ 124.578329] EXT4-fs (md1): warning: mounting fs with errors, running e2fsck is recommended
2023-10-28T16:20:09.778070-04:00 di=b2cj7JZ1ei err kernel: [ 443.235401] inconsistent data on disk
2023-10-28T16:20:09.778081-04:00 di=b2cj7JZ1ei err kernel: [ 443.239124] EXT4-fs: ext4_free_blocks:4838: aborting transaction: IO failure in __ext4_forget
Also you could try these steps mentioned:
Run File system check and below is the link with the steps:
My Cloud: Scan Disk File System Check and Repair
My Cloud: Scan Disk File System Check and Repair
If no error, try performing a system only restore from dashboard
My Cloud: System, Quick and Full Restore a EX4100
My Cloud: System, Quick and Full Restore a EX4100
If issue still persists or if there are any errors reported in file system check then take a complete backup of data and perform full factory restore.
My Cloud: File System Check Failed or Has Detected Errors
My Cloud: File System Check Failed or Has Detected Errors
If you have any further questions, please reply to this email and we will be happy to assist you further.
Sincerely,
George D
Western Digital Customer Service and Support

trekrap · November 4, 2023, 3:03pm

Hello and thanks,

My back is against the wall. I get it’s not a ‘good idea’ to copy the data, however, I need to copy my data to a more stable and higher capacity PR4100 NAS

The sales pablum had me believing RAID 5 would enable me to recover any and all data across a drive failure. I inferred that file system issues could also be recoverable through a similar mechanism. Now it seems the only way to recover is a full backup of the entire 40Tb EX4100 device. This is not a viable option due to the fact the device constantly locks up and goes off-line.
Is there any way to put EX4100 drives into PR4100 solely to copy off the files I need to recover?

The Dashboard ‘Sandisk’ appears to repair nothing. No matter how many times I run it, thus far the results never vary. It doesn’t report what it finds, and it doesn’t repair anything. What am I missing?

Consequently, I examined and tried to run ‘e2fsck’ -p -f -C 0 /dev/sda, and for devices … /sdb, /sdc, and /sdd.

Should e2fsck also be run against /dev/md1?

Perhaps I don’t understand the proper syntax or have not properly conditioned the environment (i.e., assure device is not checking drive health). I sign into SSH as soon as possible due to the lock-up and timeout issue because I don’t know how long the scan/repair operation will take.

Thanks for help and further guidance.

Topic		Replies	Views
My Cloud. goes to Device offline after OS 5 update My Cloud Single Bay	35	5434	January 17, 2022
Firmware update killed my My Cloud 2TB My Cloud OS 3	41	21217	January 13, 2022
EX4100 losing network connection after a while (OS5) My Cloud EX	66	3746	May 7, 2021
My Cloud recent firmware update Rapid Flash Blue light and gone from network (Gen 2) My Cloud OS 3	33	7090	April 10, 2023
4TB My Cloud disappeared- Fast blinking blue light! HELP My Cloud OS 3	73	16998	August 8, 2022

My Cloud drops from network 2/3 minutes after boot since 5.27.157 update

Still Need Help?

Sign in to Your Support Account

Western Digital Business Portal

My Cloud drops from network 2/3 minutes after boot since 5.27.157 update

Related topics

Still Need Help?

Sign in to Your Support Account

Western Digital Business Portal