Tuesday, October 21, 2014
Search
  
Most Popular
Guides
PC Parts
WEB Reviews
Seagate Enterprise Capacity 3.5 V.3 4TB SAS 6Gb/s HDD Review
OCZ Vector 256GB SSD Review @ Custom PC Review
Gigabyte F2A85XM-D3H
NZXT Phantom 630
Auvio Bluetooth Portable Speaker Review
Corsair H90 CPU Cooler Review
BIOSTAR Hi-Fi Z77X (Intel Z77) Motherboard Review
Noctua NH-L9i Cooler Review on Technic3D
Breaking News
China Denies Apple's iCloud Hack Allegations
Samsung Galaxy KNOX Devices Approved for U.S. Government Classified Use
iPhone 6 Demand Help ARM's Growth
SK Hynix Develops High Density 16GB NVDIMM
Samsung Mass Produces 8-Gigabit DDR4 Based on 20 Nanometer Process Technology
Strong iPhone, Mac And App Store Sales Drive Apple's Record September Quarter Revenue And Earnings
Apple iOS 8.1 Available For Download
E FUN To Relase $179 Windows tablet
Home > Guides > PC Parts

Friday, April 20, 2007
Linux Hardware Monitoring

1. Introduction

- Introduction

So you have your new Linux-based system up and running. But you also need to monitor its status and be prepared for possible failures or determine causes of hardware instability. Modern PCs support a variety of tools to help you, and you'll be surprised to learn that with a few simple applications you can probe a number of functional parameters, like hard drive temperature, CPU fan speed and GPU clock. The list of tools shown here is by no means exhaustive, but should serve as a reasonable starting point for most desktop systems. There are more sophisticated tools to be found for server use, or ones which address highly specific needs.

- SMART status

Modern hard drives constantly monitor their functional parameters, including temperature, power-on hours, reallocated sector count, hardware ECC recovered data using a technology known as SMART. As a matter of fact, S.M.A.R.T. is an acronym for “Self Monitoring Analysis and Reporting Technology”. Recent research has shown that some of the SMART attributes may be useful for predicting hard drive failures. As a general rule, even though hard drives frequently fail abruptly, without previous indication of malfunction, the existence of a SMART error greatly increases the probability of failure. Specifically, scan errors, sector reallocations and sector probational counts mean that the drive is very likely to fail within 60 days.

Fortunately, you can use a simple set of tools called “smartmontools” that allow you to view this information, store it in a log file, periodically run automatic hard drive tests and even receive automatically generated email warnings in case of hard drive errors. Smartmontools can be obtained from Sourceforge (source version) but it is probably already included in your favorite distribution.

After the installation you can probe your hard drives for information with the following simple command:

  1. root@hagakure:~# smartctl -d ata -a /dev/sda
  2. smartctl version 5.36 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
  3. Home page is http://smartmontools.sourceforge.net/
  4. === START OF INFORMATION SECTION ===
  5. Device Model: SAMSUNG SP1614C
  6. Serial Number: 0696J1FX906990
  7. Firmware Version: SW100-25
  8. User Capacity: 160,041,885,696 bytes
  9. Device is: In smartctl database [for details use: -P show]
  10. ATA Version is: 7
  11. ATA Standard is: ATA/ATAPI-7 T13 1532D revision 0
  12. Local Time is: Fri Apr 6 18:20:46 2007 EEST
  13. ==> WARNING: May need -F samsung2 disabled; see manual for details.
  14. SMART support is: Available - device has SMART capability.
  15. SMART support is: Enabled
  16. === START OF READ SMART DATA SECTION ===
  17. SMART overall-health self-assessment test result: PASSED
  18. [............ REMOVED TEXT.................]
  19. SMART Attributes Data Structure revision number: 16
  20. Vendor Specific SMART Attributes with Thresholds:
  21. ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
  22. 1 Raw_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always - 9
  23. 3 Spin_Up_Time 0x0007 068 057 000 Pre-fail Always - 5632
  24. 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 458
  25. 5 Reallocated_Sector_Ct 0x0033 253 253 010 Pre-fail Always - 0
  26. 7 Seek_Error_Rate 0x000b 253 253 051 Pre-fail Always - 0
  27. 8 Seek_Time_Performance 0x0024 253 253 000 Old_age Offline - 0
  28. 9 Power_On_Half_Minutes 0x0032 098 098 000 Old_age Always - 11821h+44m
  29. 10 Spin_Retry_Count 0x0013 253 253 049 Pre-fail Always - 0
  30. 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 458
  31. 194 Temperature_Celsius 0x0022 193 112 000 Old_age Always - 15
  32. 195 Hardware_ECC_Recovered 0x000a 100 100 000 Old_age Always - 157734777
  33. 196 Reallocated_Event_Count 0x0012 253 253 000 Old_age Always - 0
  34. 197 Current_Pending_Sector 0x0033 253 253 010 Pre-fail Always - 0
  35. 198 Offline_Uncorrectable 0x0031 253 253 010 Pre-fail Offline - 0
  36. 199 UDMA_CRC_Error_Count 0x000b 100 100 051 Pre-fail Always - 0
  37. 200 Multi_Zone_Error_Rate 0x000b 100 100 051 Pre-fail Always - 0
  38. 201 Soft_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always - 0
  39. [..............REMOVED TEXT...............]
  40. SMART Self-test log structure revision number 1
  41. Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
  42. # 1 Extended offline Completed without error 00% 11622 -
  43. # 2 Extended offline Completed without error 00% 11466 -
  44. # 3 Extended offline Completed without error 00% 11310 -
  45. # 4 Extended offline Completed without error 00% 11153 -
  46. # 5 Extended offline Completed without error 00% 10996 -
  47. # 6 Extended offline Completed without error 00% 10357 -
  48. # 7 Short offline Completed without error 00% 10355 -
  49. # 8 Short offline Completed without error 00% 8167 -
  50. # 9 Extended offline Completed without error 00% 7736 -
  51. #10 Extended offline Completed without error 00% 7621 -
  52. #11 Extended offline Completed without error 00% 6575 -
  53. #12 Short offline Completed without error 00% 6573 -
  54. Device does not support Selective Self Tests/Logging




Get RSS feed Easy Print E-Mail this Message


 
Home | News | All News | Reviews | Articles | Guides | Download | Expert Area | Forum | Site Info
Site best viewed at 1024x768+ - CDRINFO.COM 1998-2014 - All rights reserved -
Privacy policy - Contact Us .