Hi. My raid is failing. I'd rather not lose any data, but I'm all out of ideas and nothing on google has helped so far.
I have a raid5 array on my amd64. It has 4 sata disks. The raid will assemble, but not mount. Here's some command output (which may be slightly paraphrased as I'm on another machine):
Code:
root@box: mount /dev/md0
VFS: Can't find ext3 filesystem on dev md0
mount: wrong fs type, bad option, bad superblock on dev md0, missing codepage or helper program, or other error. In some cases useful info is found in syslog - try dmesg | tail or so
Code:
root@box: fsck /dev/md0
fsck.ext3: Superblock invalid, trying backup blocks...
fsck.ext3: Bad magic number in super-block while trying to open /dev/md0
...
It suggested trying an alternate superblock, so I tried the one it suggested:
Code:
root@box: fsck -b 8193 /dev/md0
fsck.ext3: Device or resource busy while trying to open /dev/md0
Filesystem mounted or opened exclusively by another program?
No idea if that's normal on a raid. I did mke2fs -n to find the other backup superblocks. They all gave the previous message.
mdstat looks normal:
Code:
root@box: cat /proc/mdstat
...
md0: active raid5 sda[0] sdb1[3] sdd1[2] sdc1[1]
1172125208 blocks level 5, 64k chunk, algorithm 2 [4/4] [UUUU]
...
I tried stopping the device and doing --create on it again. The netterwebs told me it would rebuild the superblocks. Here's some output done after that point.
Code:
root@box: mdadm-Q --detail /dev/md0
/dev/md0:
Version: 00.90.03
Creation Time: Sun Aug 31 10:04:10 2008
Raid level: raid5
Array size: 1172126208 (1117GB 1200GB)
Use Dev Size: 390708736 (372GB 400GB)
Raid devices: 4
Total devices: 4
Preferred minor: 0
Persistence: Superblock is persistent
Update time: Sun Aug 31 18:16:07 2008
State: clean
Active devices: 4
Working devices: 4
Failed devices: 0
Spare devices: 0
Layout: left-symmetric
Chunk size: 64k
UUID: I'm not typing all that...
Events: 0.6
Number Major Minor Raiddevices State
0 8 0 0 active sync /dev/sda
1 8 33 1 active sync /dev/sdc1
2 8 49 2 active sync /dev/sdd1
3 8 17 3 active sync /dev/sdb1
Querying individual drives gives similar results. Finally...
Code:
root@box: mdadm --examine /dev/md0
mdadm: No md superblock detected on /dev/md0
In retrospect I should have piped all that into text files and scp'ed it to my other machine. My carpal tunnels would thank me.
Anyway, part of what I'm not getting is if this is an ext3 error or a raid error. I'm inclined to think that it's ext3, except that --examine tells me there's no md superblock. Is an md superblock the same as a regular superblock? Either way, what can I do about it?
Bookmarks