Ups & Downs
profdlp
The Holy City Of Westlake, Ohio
First the rant - without boring you with specifics, my humble Folding farm has been going through some tough times. Besides problems here at home, I've made two 200+ mile round-trips within the last two weeks to get my dad's and my daughter's rigs back in action. The Old Man has a bum HD, so I'll be going back when his RMA arrives...
The Specific Question:
My main computer has been dumping WU's right and left for the past week+. I've tried the following:
Wipe & Reload F@H program
Dumped cores (several times)
Stopped overclocking...
Ran memtest (passed fully)
Here are excerpts from my log:
Any suggestions????????? :banghead: :banghead:
The Good News: Got a few parts in and have had one of those "ripple effect" upgrades. Bottom line is that an Athlon 1200 has been replaced by an XP 2400+. Once I get the rest of the crap sorted out I should do better than ever.
The Specific Question:
My main computer has been dumping WU's right and left for the past week+. I've tried the following:
Wipe & Reload F@H program
Dumped cores (several times)
Stopped overclocking...
Ran memtest (passed fully)
Here are excerpts from my log:
When I was up at my dads on Tuesday (system idle all day) it dumped 3 or 4 WU's...*******************************
[19:54:02] Quit 101 - Fatal error:
[19:54:02] Step 31327, time 62.654 (ps) LINCS WARNING
[19:54:02] relative constraint deviation after LINCS:
[19:54:02] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[19:54:02] Simulation instability has been encountered. The run has entered a…<snip>
[19:54:03] Folding@home Core Shutdown: EARLY_UNIT_END
[19:54:07] CoreStatus = 72 (114)
[19:54:07] Sending work to server
*******************************
[21:32:02] Gromacs exception handled
[21:32:02] Folding@home Core Shutdown: SPECIAL_EXIT
[21:32:05] CoreStatus = 65 (101)
[21:32:05] Core internal error: SPECIAL_EXIT
*******************************
[23:45:18] Quit 101 - Fatal error:
[23:45:18] Step 12387, time 24.774 (ps) LINCS WARNING
[23:45:18] relative constraint deviation after LINCS:
[23:45:18] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[23:45:18] Simulation instability has been encountered. The run has entered a…<snip>
[23:45:18] Folding@home Core Shutdown: EARLY_UNIT_END
[23:45:22] CoreStatus = 72 (114)
[23:45:22] Sending work to server
******************************
[14:43:27] Completed 85000 out of 500000 steps (17)
[14:45:13] Quit 101 - Fatal error:
[14:45:13] Step 85141, time 170.282 (ps) LINCS WARNING
[14:45:13] relative constraint deviation after LINCS:
[14:45:13] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[14:45:13] Simulation instability has been encountered. The run has entered a…<snip>
[14:45:16] Folding@home Core Shutdown: EARLY_UNIT_END
[14:45:20] CoreStatus = 72 (114)
[14:45:20] Sending work to server
******************************
Any suggestions????????? :banghead: :banghead:
The Good News: Got a few parts in and have had one of those "ripple effect" upgrades. Bottom line is that an Athlon 1200 has been replaced by an XP 2400+. Once I get the rest of the crap sorted out I should do better than ever.
0
Comments
Tried the 4.00, 3.25, and 3.24.
System is an Athlon 1200 on an Abit Kt7A-Raid (not running raid) with 512MB Crucial Cas2.
Not sure what you mean by the first part, how would I run two simultaneously? Haven't tried underclocking , my has it come to that? I'll mess with the voltages and see what happens. Also, I may have created some confusion by mentioning my other upgrades. Those are on other computers, this one has not changed a bit. The other comps are cranking out the WU's just fine.
This is what the comp did overnight:
System has been totally cleaned. I'll try and keep an eye on MBM5 and see if I can spot anything.
Worth 1,000 words?:
but seriously my suggestions are just to see what happens so we can compare results and not posed as solutions - sorta process of elimination. I've only run into LINC problems when ram or the ram bus/subsystem was unstable. Like I ran 128mb pc100 stick on a KT7A-R at 133 for a while but it started to have errors and brought it down to 124 and it was okay. A damaged or unstable cpu (overclocked) usually results in consistant errors when it's dumping work units - always dumping at a certain frame or errors during initial decompress and start of the first frame.