Random freezes
I'm experiencing random lockups with a computer I've been working on for a while. Currently the components are:
Abit KT7 Raid, with the OS drive on the onboard IDE connectors in a mirrored array.
Win2K Server
1 Aha 2940UW SCSI controller
1 Aha 2190? SCSI controller
2 Maxtor 40GB drives
512MB Ram (not sure of the brand)
2 intel pro server+ nics
1 new highpoint rockraid 133 ide controller
2 new 200GB WD drives
The powersupply seems fine and the motherboard seems fine. The computer locks up during the middle of the night during backups and the HDD light stays on. I've run stress tests on the memory and it seems fine. The nics seem fine as well. I can't find any diagnostic apps for the SCSI cards online anywhere, but I suspect one of them might be problematic (I haven't had a chance to pull them one at a time yet).
Any thoughts or ideas? Thanks.
Abit KT7 Raid, with the OS drive on the onboard IDE connectors in a mirrored array.
Win2K Server
1 Aha 2940UW SCSI controller
1 Aha 2190? SCSI controller
2 Maxtor 40GB drives
512MB Ram (not sure of the brand)
2 intel pro server+ nics
1 new highpoint rockraid 133 ide controller
2 new 200GB WD drives
The powersupply seems fine and the motherboard seems fine. The computer locks up during the middle of the night during backups and the HDD light stays on. I've run stress tests on the memory and it seems fine. The nics seem fine as well. I can't find any diagnostic apps for the SCSI cards online anywhere, but I suspect one of them might be problematic (I haven't had a chance to pull them one at a time yet).
Any thoughts or ideas? Thanks.
0
Comments
Another thing to consider is running disk benchmarks on the drives (like sandra or atto) and see if that locks it up. Then you'd know for sure.
Also, scsi drives get very very hot, especially during intensive use (such as a backup) so I'd check the temps (usually done by touching the drive :P and if you can't then you need some cooling ). A simple 80mm case fan across the from the drive works wonders, keeps my cheetah (15k rpm scsi) nice and cool.
I'm not certain, but I believe the bios is version 3R (with accompanying HPT bios update), dated 7/5/2001.
The computer itself was running fine for 2 years, this problem has just popped up recently, no HW changes, no power surges/sags.
Is it possible that the onboard raid controller has gone bad?
Durring high disk loads, I would say that would put a large sustained power draw. My first look would be PSU, then drivers/hardware conflict in the disk subsystem. Probably because of my never ending adventure tracking down a bad PSU.
Mine ran for over a year on that PSU before it died. I said the same things as you are, it cant be it cant be, it checks fine, well......
NS
Try this - Take the heatsink off the CPU, remove the fan and check for Dust buildup in the fins. You'll be surprised (shocked) how much is there.
Flint
The high temps get in to the realm of unstability. They may be acceptable to you, but the CPU might not like them. Components tolerance change over time. Here's a cheap trick. Knock the side off you machine and see if the temps drop. If so, run it with the side off for awhile and see if it still locks up.
Let us know what happens.
Flint - I looked over the board pretty thoroughly and didn't find any physical problems.
Craig