New RAID Array.. Bad time :(
[sap story]
Well.. it hasn't even been a full week since I set up my first RAID.. and I'm sick of it.
I love the speeds.. no question about that. I tweaked it a bit and got 118MB reads and 92MB writes.. great stuff.. it's just that....I've lost so much. When people say "have a backup! you have 2x the chance of a drive failing and you loose everything!".. I understood that. I took precautions and kept a backup... I never thought stuff like what is happening.. would happen.
Just out of no where.. something's corrupt. Nothing unusual happening at all.. it's just gone! Tonight, after an hour or so of BF1942.. I go to put on a movie and go to sleep.. but my "Video" folder with all my videos (duh ) is corrupt.
CHDisk is running now... but I don't have any hopes for it. Two other small folders have gone south within the last 2 days too.. so I fear this one will fall with them.
I'm seriously thinking about just using the drives as normal. It's not that I have ultra sensitive data on the RAID.. it's just that I know soon enough I won't even have a working OS
[/end story]
Hardware/BIOS/Driver:
+ Hitachi Deskstar 2x 160GB w/ 8MB cache
+ SIL3112A SATA RAID (BIOS 4.2.43)
+ SATA RAID Driver v1.0.0.40
Software
WinXP SP1
I really want to keep the RAID because the speeds are awesome.. and I'm enjoying learning about them as I finally got them to play with. Does anyone have any suggestions or know anything about what's happening? I'd be willing to do anything needed to help you guys help me.. just let me know.
Thanks for any help..
Well.. it hasn't even been a full week since I set up my first RAID.. and I'm sick of it.
I love the speeds.. no question about that. I tweaked it a bit and got 118MB reads and 92MB writes.. great stuff.. it's just that....I've lost so much. When people say "have a backup! you have 2x the chance of a drive failing and you loose everything!".. I understood that. I took precautions and kept a backup... I never thought stuff like what is happening.. would happen.
Just out of no where.. something's corrupt. Nothing unusual happening at all.. it's just gone! Tonight, after an hour or so of BF1942.. I go to put on a movie and go to sleep.. but my "Video" folder with all my videos (duh ) is corrupt.
CHDisk is running now... but I don't have any hopes for it. Two other small folders have gone south within the last 2 days too.. so I fear this one will fall with them.
I'm seriously thinking about just using the drives as normal. It's not that I have ultra sensitive data on the RAID.. it's just that I know soon enough I won't even have a working OS
[/end story]
Hardware/BIOS/Driver:
+ Hitachi Deskstar 2x 160GB w/ 8MB cache
+ SIL3112A SATA RAID (BIOS 4.2.43)
+ SATA RAID Driver v1.0.0.40
Software
WinXP SP1
I really want to keep the RAID because the speeds are awesome.. and I'm enjoying learning about them as I finally got them to play with. Does anyone have any suggestions or know anything about what's happening? I'd be willing to do anything needed to help you guys help me.. just let me know.
Thanks for any help..
0
Comments
I suspect hardware, or cabling.
Here is a log when I tested my C partition over the past couple of days...
One thing I've noticed is that the SATA headers on mobos nowadays are pretty cheap and crappy. I accidentally snapped one right off with not much effort on an asus board just a few weeks ago ... Check the headers themselves to see if they are marginal or even maybe loose from the board itself.
The affected drive had all sorts of I/O errors. Entire 100GB directories would disappear, and/or files within directories (sometimes all of them, other times files with names starting with H on down to Z, while files with names starting with A through G were still there). Furthermore, when new data was written to the drive, it would always read back as corrupt (movie files would have terrible errors, and EXE files would simply not function).
While the disappearing directories and files would often re-appear after running chkdisk, the files written to the drive which became corrupt would be damaged beyond repair. I was lucky that the drive affected wasn't the system drive (just file archive) or else the OS would have surely needed to be re-installed again, and again, and again.
After talking with Prime over AIM, I thought of lowering the FSB. This fixed the problem, and the drives have been entirely stable to this day.
Have you run Memtest yet?
@SmJ:
nForce 2 PCI is locked.. isn't it? I feel bad I have to question myself on that one... I'm 99% sure it is. My FSB is only @ 200 anyway.. so I don't think that would be it.. who knows though..
@Geeky1 w/ scary voice:
Yeah.. memtest passes 12hrs @ 200FSB.
--- here goes the Hitachi tool testing...
Geeky AND Park, is the RAID chip embedded in the SB??? If so, RAID can be running at a speed based on a ratio to FSB and not locked.... while this would NOT affect a RAID card, embedded RAID could be affected.
I KNOW, Shouldn't, but COULD.
John D.
Now, it's either the cables being bad or loose is the best we've got to go on. The cables are over a year old.. I got them with an MSI KT4 Ultra-SR (SR= SerialRAID). I don't think that would matter in itself--but it doesn't rule out the possibility of them just being bad. I'll double check connections ASAP.
Today we're having "The worst winter storm we've had in a decade".. the roads are already covered in ice.. so there's no way of getting new SATA cables anytime soon. So I can't very well replace for testing.
John -- It's not embedded, but it is onboard. I don't think it should be running at any different speed than what my PCI bus is running. Anyone know if I could check the clock of the controller somehow?
50% of my compressed files are corrupt now.. ZIP/RAR/GZip.. the whole list. I'm not getting CRC32 mismatches.. I'm just getting a plain "File corrupt" message when I test/extract an archive. This also apply to setups that have compressed files in them.. such as 3DMark03. I've downloaded it ~5 times now (from different locations), and during the install a CAB is corrupt.
Last night, I stayed up and backed up anything I needed. If needed, I'm ready to nuke the array/format/whatever to the HDDs. I got SuSE 9.0 LIVE so I can still use the internet (just no Win32 apps) and not have to use the HDDs for a Win install.
Should I try different BIOS versions? I've got the newest NF7-S one.. with the newest SI BIOS... anything wrong with those?
Now I need something that will compress and test archives non-stop.. like a benchmark... anyone know of one?
I made a GZip with like 5,000 small files. It's 40MB. Tested it on C: (where it was made) and no errors. I copy it to E: (just another partition on the RAID).. no errors. I check the driver for errors with Partition Magic.. 0 Returned. I defrag with Diskeeper. Check disk again, 0 errors. BUT the GZip is corrupt. :thumbsdow
I repeat the process, but have uninstalled Diskeeper and just used the Windows Degragger. GZip still works..
Yes, I have defragged the RAID before.. which allows the suspicion that Diskeeper is behind some or all of my corruption. I will continue to do the procedure above on gigs and gigs of data... see if I can get it to choke just using the Windows defrag.
Diskeeper = :shakehead
If these errors only happen shortly after messing with Diskkeeper, than I'd say it's the culpret for sure.
How so? I didn't know that.. nor do I notice anything (besides the corruption ).
Got a link or anything about it?
Here are some ATTOs:
This is my 2nd time using Diskeeper and then trashing it after it killed files. The last time it turned my 40GB in RAW.. and no return. Maybe I should never use it again...
I'll report back here...
A release note for ABIT NF7-S 2.0 BIOSv 14. I've set it to 1ms.. and it works like a charm. I've finally been able to install 3DMark03 without a CRC32 mismatch error, and nothing else has died on me.
I believe that was the problem. I'm going to try Diskeeper tomorrow to see if they can clear their name from my blacklist.. we'll see. Thanks everyone for helping out :smiles:
Now, if anyone has problems with SATA RAID-0 .. we know to check EXT-P2P Discard Time in the BIOS. The answer wasn't too far away this whole time
Nope, nothing noticeable. I thought maybe it did, so I benched it. Same scores, almost exact.
Also, Diskeeper works fine again.
"What exactly is Diskkeeper?" -- It's a defragmenter with options and information. You can defrag for max performance or max free space. It also tells you everything you could ever want to know about your fragmented harddrive -- volume frags, file frags, directory frags, MTF frags... it's all there. They also claim it does a better job than MS Defrag and does it faster. It also has neat pictures to show performance increases from a defrag (below).
I don't know about performance wise, but it looks like they're very similar GUI and information wise.