Results 1 to 10 of 10

Thread: Hard disk lockups

  1. #1
    Join Date
    Apr 2007
    Beans
    107

    Hard disk lockups

    ... system has random lockups, while rsyncing from disk 1 to disk 2. Obvious suggestion is that the disk is going bad - but which disk?

    Complicating issues is the fact that the server is headless, and it would be very inconvenient to hook up a monitor.

    Can anyone suggest an easy way to test which disk is locking up?

    Thanks!
    Last edited by awacs; November 15th, 2012 at 05:31 PM.

  2. #2
    Join Date
    Jul 2005
    Beans
    2,047

    Re: Hard disk lockups

    Install smartmontools. Then do

    Code:
    sudo smartctl -a /dev/sda
    to check attributes of /dev/sda (or replace with /dev/sdb, etc.) It's less easy to interpret the attributes if you don't know which ones are important (the "pre-failure" text looks scary but isn't). Just paste the output of the command here for the disk you are worried about.

    You can also run S.M.A.R.T. tests.

  3. #3
    Join Date
    Apr 2007
    Beans
    107

    Re: Hard disk lockups

    Quote Originally Posted by ahallubuntu View Post
    Install smartmontools. Then do

    Code:
    sudo smartctl -a /dev/sda
    to check attributes of /dev/sda (or replace with /dev/sdb, etc.) It's less easy to interpret the attributes if you don't know which ones are important (the "pre-failure" text looks scary but isn't). Just paste the output of the command here for the disk you are worried about.

    You can also run S.M.A.R.T. tests.
    Thanks!

  4. #4
    Join Date
    Apr 2007
    Beans
    107

    Re: Hard disk lockups

    Thanks for anyone who wants to look at this. output of smartctl -a is attached (or I could post it inline if anyone wants.)
    Attached Files Attached Files

  5. #5
    Join Date
    Jul 2005
    Beans
    2,047

    Re: Hard disk lockups

    The 250GB WD drive (/dev/sda) has four reallocated sectors. That means four sectors failed and were marked "do not use" by the drive's firmware and replaced by spare sectors. This is not catastrophic especially on an old drive. You may see a slight reduction in performance as the spares will be non-contiguous with the sectors that failed.

    You should watch this number, though; if more sectors fail more than say a few a year I'd replace the drive. The drive probably has a few hundred spare sectors FYI.

    None of the other drives have glaring problems that jump out at me.

    You can use smartctl to run an extended S.M.A.R.T. test on the drives that are suspect. On the 250GB drive the test will take probably 1-2 hours. Check the man page for smartctl for exact syntax. The drive itself (the firmware) will run the test; smartctl will just monitor its progress each time you check it. You can use the drive while the test is in progress but probably best not to.

  6. #6
    Join Date
    Apr 2007
    Beans
    107

    Re: Hard disk lockups

    Thanks!

  7. #7
    Join Date
    Feb 2011
    Location
    Coquitlam, B.C. Canada
    Beans
    3,521
    Distro
    Ubuntu Development Release

    Re: Hard disk lockups

    When your system locks up, how long does it lock up for? Is it 30 seconds?
    Are there any entries in /var/log/kern.log when the lock up occurs?

  8. #8
    Join Date
    Apr 2007
    Beans
    107

    Re: Hard disk lockups

    Locks up 'forever'. Need to power cycle. No messages at the time of lockup, but at other times I get messages like
    'NMI backtrace for cpu 2'
    in kern.log.

  9. #9
    Join Date
    Feb 2011
    Location
    Coquitlam, B.C. Canada
    Beans
    3,521
    Distro
    Ubuntu Development Release

    Re: Hard disk lockups

    Thanks. Then your issue is different than mine. For my issue I have traced it back to a single line change in udev between launchpad versions 2759 and 2760. I have yet to find someone else experienceing the same issue.
    Hope you get yours figured out.

  10. #10
    Join Date
    Jul 2005
    Beans
    2,047

    Re: Hard disk lockups

    Go ahead and run the extended S.M.A.R.T. test. But there are lots of reasons a system could lock up that have nothing to do with the hard drives. The CPU could be overheating. The power supply could be suspect. The RAM may have an issue. Same with the motherboard.

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •