[[fester:hvalid_hdd]]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
fester:hvalid_hdd [2017/06/24 15:42] – [Resuming A Session In tmux] add content danfester:hvalid_hdd [2017/06/24 17:46] (current) – [Non-Destructive Badblocks Test Using tmux] brief instructions dan
Line 419: Line 419:
 ==== Non-Destructive Badblocks Test Using tmux ==== ==== Non-Destructive Badblocks Test Using tmux ====
  
-I don’t know how to do this.+To do a non-destructive ''badblocks'' test, follow the instructions above, but replace -w with -n Example commands might look like:
  
-If someone wants to give me the information and detailed copyright free screen shots will be happy to include them in this guide or you could replace this or any section with your own?+''badblocks -ns /dev/da0'' 
 + 
 +or 
 + 
 +''badblocks -b 4096 -nsv /dev/da1'' 
 + 
 +This test is intended to be non-destructive--once it has completed, the data on your disk should be unharmed.  Even so, I'd discourage running this on a disk with important data unless you have a good, readily-accessible backup.
  
  
Line 481: Line 487:
  
 ===== Getting Your Test Results ===== ===== Getting Your Test Results =====
 +
 +Getting your test results is always a tense moment.
 +
 +(I remember such an instance in the doctor’s examination room after an unforgettable trip to Bognor Regis, often referred to as “The Riviera of the South West”. Unfortunately the doctor confirmed Fester had come back with more than just fond memories, but with the liberal application of a strong antibiotic cream Fester was as good as new in a couple of weeks.)
 +
 +Here is how to get your results.
 +
 +(Do not start this section until all HDD tests have been completed.)
 +
 +Open an SSH console and log in.
 +
 +We are going to issue a command to each HDD/SDD in succession that will interrogate and retrieve the results of the tests stored in each drives memory using SMART commands.
 +
 +At the command prompt type in the following command using the name of the first drive you want to interrogate (in Fester’s case this is ada0).
 +
 +''smartctl –a /dev/ada0''
 +
 +{{:fester:34575799200aec7f192df26e22dfbfbf.png}}
 +
 +This should produce the following screen with the test results. The window displaying the information has been maximised (1) so it is easier to read.
 +
 +{{:fester:748fe2ad03dbdf87fbd109dc8ee80c58.png}}
 +
 +At this point Fester copies the information and pastes it into a text editor for ease of use.
 +
 +If you want to do this then select the text in the SSH console by clicking with the left mouse button where you want to begin, hold it down and then highlight the text you want to include.
 +
 +When you have done this press the “Ctrl” button and the “v” button together. This keystroke will copy the highlighted text into the clip board.
 +
 +Open the text editor you wish to use (Fester uses Notepad in Windows) and paste it into the text into the editor.
 +
 +You now need to repeat this process for the next drive in your system.
 +
 +At the command prompt type in the following command using the name of the next drive you want to interrogate (in Fester’s case this is da0).
 +
 +''smartctl –a /dev/da0''
 +
 +This will produce the next set of results in the SSH console. Copy and paste as before (if you want to).
 +
 +Now repeat the process for the next drive and the next until all the drives have been interrogated and their data copied and pasted.
 +
 +(In this way you will build up a list of each drives test results in a single text file that can be saved for examination later.)
 +
 +In Festers case this would mean issuing the following commands in the SSH console.
 +
 +''smartctl –a /dev/da1''
 +
 +''smartctl –a /dev/da2''
 +
 +''smartctl –a /dev/da3''
 +
 +''smartctl –a /dev/da4''
 +
 +''smartctl –a /dev/da5''
 +
 +''smartctl –a /dev/da6''
 +
 +''smartctl –a /dev/da7''
 +
 +These commands produce copious amounts of information about the drives. If you want something a little less gregarious then use this command instead (don’t forget to change the drive name each time, and note the capital -A rather than the lowercase -a).
 +
 +''smartctl –A /dev/ada0''
 +
 +This should produce a screen that looks something like this (much more compact).
 +
 +{{:fester:7a146af501c56cb6375b83159bf9b42c.png}}
 +
 +So you have now gathered your results, but they make about as much sense as a bacon butty at a bar mitzvah.
 +
 +What now?
 +
  
 ==== Making Sense of SMART Data ==== ==== Making Sense of SMART Data ====
 +
 +When looking at SMART data from a SMART storage device certain entries in the table are not important in terms of data integrity and health. They just give general information (e.g. Model, serial number, etc) and other types of information that could be useful in certain circumstances.
 +
 +Other entries are very important and should immediately ring alarms bells if certain values are present.
 +
 +In terms of HDD/SDD hardware validation these are the entries in the SMART data you need to scrutinise.
 +
 +|  \\ __ID#__ \\  |  \\ __ATTRIBUTE_NAME__ \\  |  \\ __FLAG__ \\  |  \\ __VALUE__ \\  |  \\ __WORST__ \\  |  \\ __THRESH__ \\  |  \\ __TYPE__ \\  |  \\ __UPDATED__ \\  |  \\ __WHEN_FAILED__ \\  |  \\ __RAW_VALUE__ \\  |
 +|  \\ 1 \\ |  \\ Raw_Read_Error_Rate \\ |  \\ 0x002f \\ |  \\ 200 \\ |  \\ 200 \\ |  \\ 051 \\ |  \\ Prefail \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 5 \\ |  \\ Reallocated_Sector_Ct \\ |  \\ 0x0033 \\ |  \\ 200 \\ |  \\ 200 \\ |  \\ 140 \\ |  \\ Prefail \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 7 \\ |  \\ Seek_Error_Rate \\ |  \\ 0x002e \\ |  \\ 200 \\ |  \\ 200 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 10 \\ |  \\ Spin_Retry_Count \\ |  \\ 0x0032 \\ |  \\ 100 \\ |  \\ 100 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 11 \\ |  \\ Calibration_Retry_Count \\ |  \\ 0x0032 \\ |  \\ 100 \\ |  \\ 100 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 196 \\ |  \\ Reallocated_Event_Count \\ |  \\ 0x0032 \\ |  \\ 200 \\ |  \\ 200 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 197 \\ |  \\ Current_Pending_Sector \\ |  \\ 0x0032 \\ |  \\ 200 \\ |  \\ 200 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 198 \\ |  \\ Offline_Uncorrectable \\ |  \\ 0x0030 \\ |  \\ 100 \\ |  \\ 253 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +|  \\ 199 \\ |  \\ UDMA_CRC_Error_Count \\ |  \\ 0x0032 \\ |  \\ 200 \\ |  \\ 200 \\ |  \\ 000 \\ |  \\ Old_age \\ |  \\ Always \\ |  \\ - \\ |  \\ 0 \\ |
 +
 +If you get any value other than __zero__ in the “RAW VALUE” for these entries you should be suspicious of this drive and may need to return the device for testing depending on the manufacturer’s warranty.
 +
 +Another area you should look at is the “SMART Self-test log structure”. Here is an example. It will tell you if the drive passed its tests.  Any result other than "Completed without error" is cause for concern.
 +
 +SMART Self-test log structure revision number 1
 +
 +|  \\ __Num__ \\  |  \\ __Test_Description__ \\  |  \\ __Status__ \\  |  \\ __Remaining LifeTime(hours)__ \\  |  \\ __LifeTime(hours)__ \\  |  \\ __LBA_of_first_error__ \\  |
 +|  \\ # 1 \\ |  \\ Extended offline \\ |  \\ Completed without error \\ |  \\ 00% \\ |  \\ 503 \\ |  \\ - \\ |
 +|  \\ # 2 \\ |  \\ Conveyance offline \\ |  \\ Completed without error \\ |  \\ 00% \\ |  \\ 494 \\ |  \\ - \\ |
 +|  \\ # 3 \\ |  \\ Short offline \\ |  \\ Completed without error \\ |  \\ 00% \\ |  \\ 75 \\ |  \\ - \\ |
 +
 +(If Fester is misinformed about interpreting SMART data or has omitted something important please let me know and I will try to put it in the guide or you could replace this or any section with your own?)
 +
 +That’s the HDD/SDD validation completed. Now it is time to reinstall FreeNAS and create a basic server.
 +
 +\\
  
  
  • fester/hvalid_hdd.txt
  • Last modified: 2017/06/24 17:46
  • by dan