dslreports logo
 
    All Forums Hot Topics Gallery
spc
Search similar:


uniqs
1347
thms
join:2013-02-07

thms

Member

HDD lifecycle info needed.

hey guys,
i encountered a S.M.A.R.T. problem with my old machine and just wanted to know whether i should seriously considering a replacement for my HDD or if it's not that bad.

here are my smartmontool stats:

thanks in advance,
thms*

---------------------------------------------------

smartctl 6.0 2012-10-10 r3643 [i686-w64-mingw32-xp-sp3] (sf-6.0-1)
Copyright (C) 2002-12, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda ATA V
Device Model: ST360015A
Serial Number: 3KC01K0N
Firmware Version: 3.30
User Capacity: 60.022.480.896 bytes [60,0 GB]
Sector Size: 512 bytes logical/physical
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA/ATAPI-6 T13/1410D revision 2
Local Time is: Thu Feb 07 10:29:51 2013 WN
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 426) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 44) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 055 053 006 Pre-fail Always - 129182124
3 Spin_Up_Time 0x0003 100 100 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 311
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 087 060 030 Pre-fail Always - 486493864
9 Power_On_Hours 0x0032 079 079 000 Old_age Always - 18599
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 094 094 020 Old_age Always - 7150
194 Temperature_Celsius 0x0022 031 061 000 Old_age Always - 31
195 Hardware_ECC_Recovered 0x001a 055 052 000 Old_age Always - 129182124
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 090 000 Old_age Always - 410
200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0
202 Data_Address_Mark_Errs 0x0032 100 253 000 Old_age Always - 0

SMART Error Log Version: 1
ATA Error Count: 454 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 454 occurred at disk power-on lifetime: 3698 hours (154 days + 2 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
10 51 00 e8 ce fc f6 Error: IDNF at LBA = 0x06fccee8 = 117231336

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 00 e8 ce fc f6 00 02:17:32.895 READ DMA
c8 00 00 e8 cd fc f6 00 02:17:32.888 READ DMA
c8 00 00 e8 cc fc f6 00 02:17:32.883 READ DMA
c8 00 00 e8 cb fc f6 00 02:17:32.877 READ DMA
c8 00 00 e8 ca fc f6 00 02:17:32.872 READ DMA

Error 453 occurred at disk power-on lifetime: 3677 hours (153 days + 5 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 01 00 00 00 f0 Error: ICRC, ABRT 1 sectors at LBA = 0x00000000 = 0

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 00 00 00 f0 00 00:02:17.137 READ DMA
c4 00 01 00 00 00 f0 00 00:02:17.111 READ MULTIPLE
91 00 3f 01 00 00 bf 02 00:02:17.111 INITIALIZE DEVICE PARAMETERS [OBS-6]
00 00 00 00 00 00 00 06 00:02:16.378 NOP [Abort queued commands]
c8 00 01 00 00 00 f0 00 00:02:05.944 READ DMA

Error 452 occurred at disk power-on lifetime: 3677 hours (153 days + 5 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 01 00 00 00 f0 Error: ICRC, ABRT 1 sectors at LBA = 0x00000000 = 0

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 00 00 00 f0 00 00:00:12.073 READ DMA
f5 03 00 01 10 00 b0 00 00:00:10.886 SECURITY FREEZE LOCK
e3 03 00 01 10 00 b0 00 00:00:09.924 IDLE
ef 03 45 01 10 00 b0 00 00:00:09.913 SET FEATURES [Set transfer mode]
ef 03 0c 01 10 00 b0 00 00:00:09.913 SET FEATURES [Set transfer mode]

Error 451 occurred at disk power-on lifetime: 3666 hours (152 days + 18 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 01 00 00 00 f0 Error: ICRC, ABRT 1 sectors at LBA = 0x00000000 = 0

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 00 00 00 f0 00 00:00:12.355 READ DMA
f5 03 00 01 10 00 b0 00 00:00:11.193 SECURITY FREEZE LOCK
e3 03 00 01 10 00 b0 00 00:00:09.900 IDLE
ef 03 45 01 10 00 b0 00 00:00:09.889 SET FEATURES [Set transfer mode]
ef 03 0c 01 10 00 b0 00 00:00:09.889 SET FEATURES [Set transfer mode]

Error 450 occurred at disk power-on lifetime: 3591 hours (149 days + 15 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 01 00 00 00 f0 Error: ICRC, ABRT 1 sectors at LBA = 0x00000000 = 0

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 00 00 00 f0 00 00:00:13.386 READ DMA
c8 00 01 00 00 00 f0 00 00:00:12.953 READ DMA
f5 03 00 01 10 00 b0 00 00:00:11.805 SECURITY FREEZE LOCK
e3 03 00 01 10 00 b0 00 00:00:10.496 IDLE
ef 03 45 01 10 00 b0 00 00:00:10.482 SET FEATURES [Set transfer mode]

SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

---------------------------------------------------

koitsu
MVM
join:2002-07-16
Mountain View, CA
Humax BGW320-500

koitsu

MVM

I'll provide a brief analysis later tonight or tomorrow, time permitting.

P.S. -- Thank you for using smartmontools from the get-go. I appreciate it. In the future though, please post the output within a [code]
block so that it retains the formatting.
thms
join:2013-02-07

thms

Member

thank you very much, koitsu!
that yould be great.

koitsu
MVM
join:2002-07-16
Mountain View, CA
Humax BGW320-500

koitsu to thms

MVM

to thms
Your drive actually looks to be in okay condition. It's quite an old drive (the ST360015A was made in roughly 2001 to 2002), but has been used for less hours (18599) than some of my present-day server drives. That number -- 18599 -- matters. Keep reading.

Your SMART attributes look perfectly fine/normal for this model of drive. The only one of concern is attribute 199, which shows a count of 410 CRC-related errors, indicating physical transport issues between the disk and underlying PATA controller. I won't be going into an explanation of how to track those down because, probably much to your surprise, there's no reason to -- again, keep reading.

The SMART error log does show a very large number (454) of registered events. Due to space limitation in the HPA region of the drive, only the last 5 events are shown, and include a timestamp of sorts (power-on hours count) of when they occurred:

1. 3968 hours -- drive returned IDNF, indicating that the LBA requested by the controller was outside of the permitted LBA range of the drive. The controller requested the drive read LBA 117231336. The drive itself has a total of (60022480896 / 512), or 117231408, LBAs. I cannot explain this error -- it looks like there may be a portion of the drive at the very, very end which is not usable/not accessible for some reason. Otherwise this could be a firmware bug of some sort. I would not worry about it, especially since it hasn't recurred in almost 15000 hours.

2. 3677 hours -- Drive returned a CRC error when the controller attempted to read LBA 0, resulting in ABRT (abort) status. This is a result of the aforementioned CRC error count. However, like #1, this happened over 15000 hours ago, and has not recurred.

3. 3677 hours -- same as #2.

4. 3666 hours -- same as #2.

5. 3591 hours -- same as #2.

LBA 0 is "sector 0" in classic terms, so I can almost assure you that these situations were happening when the system was being booted up / during POST (when the PC BIOS goes looking for valid MBRs (or rather, PBRs -- primary boot records)).

Since these issues haven't recurred in over 15000 hours, I am inclined to believe this drive was either in a different machine when these issues occurred (i.e. something was physically wrong with the IDE controller, such as a damaged trace or a faulty/flaky pin), PATA cables have since been changed out/replaced, or something along those lines.

So in summary, your drive looks just fine to me. The errors in the error log happened quite some time ago, and I have absolutely no knowledge of the physical environment, cabling, usage pattern, etc. relating to that drive, so I can't provide insights there. I can only tell you what I see. I would not replace this drive, as purely from a SMART standpoint it seems to be fine.
thms
join:2013-02-07

thms

Member

without doubt - you rule!
THANKS A 1000 for your detailed review of my log file and the quick answer! nice to see that the dirve is not dying at all.

kudos to you **

thms

koitsu
MVM
join:2002-07-16
Mountain View, CA

koitsu

MVM

You're welcome. Cheers!