Severe Folding instability.
drasnor
Starship OperatorHawthorne, CA Icrontian
Ok, what would make Folding really unstable on a dual Opteron system? I mean, it's more stable under SSE than 3DNow!. It isn't overclocked or anything, and I'm stumped.
-drasnor
Here's 3DNow!:
I dumped the SSE log, but I'll have one later. It looks pretty much the same though.
-drasnor

Here's 3DNow!:
--- Opening Log file [January 13 02:56:16]
# Windows Console Edition #####################################################
###############################################################################
Folding@home Client Version 4.00
[url]http://folding.stanford.edu[/url]
###############################################################################
###############################################################################
Arguments: -local -forceasm -advmethods -service
Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.
[02:56:16] - Ask before connecting: No
[02:56:16] - Use IE connection settings: Yes
[02:56:16] - User name: drasnor (Team 93)
[02:56:16] - User ID = 414862F622BE5569
[02:56:16] - Machine ID: 2
[02:56:16]
[02:56:16] Work directory not found. Creating...
[02:56:16] Could not open work queue, generating new queue...
[02:56:16] + Benchmarking ...
[02:56:18] - Preparing to get new work unit...
[02:56:18] + Attempting to get work packet
[02:56:18] - Connecting to assignment server
[02:56:19] - Successful: assigned to (171.67.89.151).
[02:56:19] + News From Folding@Home: v.4 client available
[02:56:19] Loaded queue successfully.
[02:57:26] + Closed connections
[02:57:26]
[02:57:26] + Processing work unit
[02:57:26] Core required: FahCore_78.exe
[02:57:26] Core found.
[02:57:26] Working on Unit 01 [January 13 02:57:26]
[02:57:26] + Working ...
[02:57:26]
[02:57:26] *------------------------------*
[02:57:26] Folding@home Gromacs Core
[02:57:26] Version 1.55 (December 22, 2003)
[02:57:26]
[02:57:26] Preparing to commence simulation
[02:57:26] - Assembly optimizations manually forced on.
[02:57:26] - Not checking prior termination.
[02:57:27] - Expanded 509979 -> 2524925 (decompressed 495.1 percent)
[02:57:27] - Starting from initial work packet
[02:57:27]
[02:57:27] Project: 920 (Run 51, Clone 24, Gen 19)
[02:57:27]
[02:57:27] Assembly optimizations on if available.
[02:57:27] Entering M.D.
[02:57:33] Protein: p920_vpf910
[02:57:33]
[02:57:33] Writing local files
[02:57:34] Extra 3DNow boost OK.
[02:57:34] Writing local files
[02:57:36] Completed 0 out of 250000 steps (0)
[02:57:39] Quit 101 - Fatal error:
[02:57:39] Step 1, time 0.002 (ps) LINCS WARNING
[02:57:39] relative constraint deviation after LINCS:
[02:57:39] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[02:57:39]
[02:57:39] Simulation instability has been encountered. The run has entered a
[02:57:39] state from which no further progress can be made.
[02:57:39] If you often see other project units terminating early like this
[02:57:39] too, you may wish to check the stability of your computer (issues
[02:57:39] such as high temperature, overclocking, etc.).
[02:57:39] Going to send back what have done.
[02:57:39] logfile size: 6854
[02:57:39] - Writing 7525 bytes of core data to disk...
[02:57:39] ... Done.
[02:57:39]
[02:57:39] Folding@home Core Shutdown: EARLY_UNIT_END
[02:57:42] CoreStatus = 72 (114)
[02:57:42] Sending work to server
[02:57:42] + Attempting to send results
[02:57:47] + Results successfully sent
[02:57:47] Thank you for your contribution to Folding@home.
[02:57:51] - Preparing to get new work unit...
[02:57:51] + Attempting to get work packet
[02:57:51] - Connecting to assignment server
[02:57:53] - Successful: assigned to (171.67.89.151).
[02:57:53] + News From Folding@Home: v.4 client available
[02:57:53] Loaded queue successfully.
[02:58:48] + Closed connections
[02:58:53]
[02:58:53] + Processing work unit
[02:58:53] Core required: FahCore_78.exe
[02:58:53] Core found.
[02:58:53] Working on Unit 02 [January 13 02:58:53]
[02:58:53] + Working ...
[02:58:53]
[02:58:53] *------------------------------*
[02:58:53] Folding@home Gromacs Core
[02:58:53] Version 1.55 (December 22, 2003)
[02:58:53]
[02:58:53] Preparing to commence simulation
[02:58:53] - Assembly optimizations manually forced on.
[02:58:53] - Not checking prior termination.
[02:58:54] - Expanded 512103 -> 2549549 (decompressed 497.8 percent)
[02:58:54] - Starting from initial work packet
[02:58:54]
[02:58:54] Project: 921 (Run 11, Clone 22, Gen 13)
[02:58:54]
[02:58:54] Assembly optimizations on if available.
[02:58:54] Entering M.D.
[02:59:00] Protein: p921_vpf912
[02:59:00]
[02:59:00] Writing local files
[02:59:01] Extra 3DNow boost OK.
[02:59:01] Writing local files
[02:59:03] Completed 0 out of 250000 steps (0)
[02:59:05] Quit 101 - Fatal error:
[02:59:05] Step 1, time 0.002 (ps) LINCS WARNING
[02:59:05] relative constraint deviation after LINCS:
[02:59:05] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[02:59:05]
[02:59:05] Simulation instability has been encountered. The run has entered a
[02:59:05] state from which no further progress can be made.
[02:59:05] If you often see other project units terminating early like this
[02:59:05] too, you may wish to check the stability of your computer (issues
[02:59:05] such as high temperature, overclocking, etc.).
[02:59:05] Going to send back what have done.
[02:59:05] logfile size: 7232
[02:59:05] - Writing 7903 bytes of core data to disk...
[02:59:05] ... Done.
[02:59:05]
[02:59:05] Folding@home Core Shutdown: EARLY_UNIT_END
[02:59:09] CoreStatus = 72 (114)
[02:59:09] Sending work to server
[02:59:09] + Attempting to send results
[02:59:13] + Results successfully sent
[02:59:13] Thank you for your contribution to Folding@home.
[02:59:17] - Preparing to get new work unit...
[02:59:17] + Attempting to get work packet
[02:59:17] - Connecting to assignment server
[02:59:18] - Successful: assigned to (171.67.89.151).
[02:59:18] + News From Folding@Home: v.4 client available
[02:59:18] Loaded queue successfully.
[03:00:08] + Closed connections
[03:00:13]
[03:00:13] + Processing work unit
[03:00:13] Core required: FahCore_78.exe
[03:00:13] Core found.
[03:00:13] Working on Unit 03 [January 13 03:00:13]
[03:00:13] + Working ...
[03:00:13]
[03:00:13] *------------------------------*
[03:00:13] Folding@home Gromacs Core
[03:00:13] Version 1.55 (December 22, 2003)
[03:00:13]
[03:00:13] Preparing to commence simulation
[03:00:13] - Assembly optimizations manually forced on.
[03:00:13] - Not checking prior termination.
[03:00:13] - Expanded 513033 -> 2549549 (decompressed 496.9 percent)
[03:00:13] - Starting from initial work packet
[03:00:13]
[03:00:13] Project: 921 (Run 11, Clone 23, Gen 31)
[03:00:13]
[03:00:13] Assembly optimizations on if available.
[03:00:13] Entering M.D.
[03:00:19] Protein: p921_vpf912
[03:00:19]
[03:00:19] Writing local files
[03:00:21] Extra 3DNow boost OK.
[03:00:21] Writing local files
[03:00:23] Completed 0 out of 250000 steps (0)
[03:00:25] Quit 101 - Fatal error:
[03:00:25] Step 1, time 0.002 (ps) LINCS WARNING
[03:00:25] relative constraint deviation after LINCS:
[03:00:25] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[03:00:25]
[03:00:25] Simulation instability has been encountered. The run has entered a
[03:00:25] state from which no further progress can be made.
[03:00:25] If you often see other project units terminating early like this
[03:00:25] too, you may wish to check the stability of your computer (issues
[03:00:25] such as high temperature, overclocking, etc.).
[03:00:25] Going to send back what have done.
[03:00:25] logfile size: 7232
[03:00:25] - Writing 7903 bytes of core data to disk...
[03:00:25] ... Done.
[03:00:25]
[03:00:25] Folding@home Core Shutdown: EARLY_UNIT_END
[03:00:29] CoreStatus = 72 (114)
[03:00:29] Sending work to server
[03:00:29] + Attempting to send results
[03:00:31] + Results successfully sent
[03:00:31] Thank you for your contribution to Folding@home.
[03:00:35] - Preparing to get new work unit...
[03:00:35] + Attempting to get work packet
[03:00:35] - Connecting to assignment server
[03:00:37] - Successful: assigned to (171.67.89.151).
[03:00:37] + News From Folding@Home: v.4 client available
[03:00:37] Loaded queue successfully.
I dumped the SSE log, but I'll have one later. It looks pretty much the same though.
0
Comments
For starters try removing the -forceasm flag.
This appears suspicious.
--- Opening Log file [January 13 03:05:54] # Windows Console Edition ##################################################### ############################################################################### Folding@home Client Version 4.00 [url]http://folding.stanford.edu[/url] ############################################################################### ############################################################################### Arguments: -local -forceSSE -advmethods -service Warning: By using the -forceSSE flag, you are overriding program safeguards that monitor the stability of SSE instructions on your system. If you did not intend to do this, please restart the program without -forceSSE. If work units are not completing fully, then please discontinue use of the flag. [03:05:54] - Ask before connecting: No [03:05:54] - Use IE connection settings: Yes [03:05:54] - User name: drasnor (Team 93) [03:05:54] - User ID = 414862F622BE5569 [03:05:54] - Machine ID: 1 [03:05:54] [03:05:54] Work directory not found. Creating... [03:05:54] Could not open work queue, generating new queue... [03:05:54] + Benchmarking ... [03:05:56] - Preparing to get new work unit... [03:05:56] + Attempting to get work packet [03:05:56] - Connecting to assignment server [03:05:56] - Successful: assigned to (171.67.89.151). [03:05:56] + News From Folding@Home: v.4 client available [03:05:57] Loaded queue successfully. [03:06:29] + Closed connections [03:06:29] [03:06:29] + Processing work unit [03:06:29] Core required: FahCore_78.exe [03:06:29] Core found. [03:06:29] Working on Unit 01 [January 13 03:06:29] [03:06:29] + Working ... [03:06:29] [03:06:29] *------------------------------* [03:06:29] Folding@home Gromacs Core [03:06:29] Version 1.55 (December 22, 2003) [03:06:29] [03:06:29] Preparing to commence simulation [03:06:29] - Assembly optimizations manually forced on. [03:06:29] - Not checking prior termination. [03:06:30] - Expanded 508867 -> 2524925 (decompressed 496.1 percent) [03:06:30] - Starting from initial work packet [03:06:30] [03:06:30] Project: 920 (Run 4, Clone 23, Gen 25) [03:06:30] [03:06:30] Assembly optimizations on if available. [03:06:30] Entering M.D. [03:06:36] Protein: p920_vpf910 [03:06:36] [03:06:36] Writing local files [03:06:37] Extra SSE boost OK. [03:06:37] Writing local files [03:06:39] Completed 0 out of 250000 steps (0) [03:10:55] Writing local files [03:10:57] Completed 2500 out of 250000 steps (1) [03:13:37] Quit 101 - Fatal error: [03:13:37] Step 4099, time 8.198 (ps) LINCS WARNING [03:13:37] relative constraint deviation after LINCS: [03:13:37] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0 [03:13:37] [03:13:37] Simulation instability has been encountered. The run has entered a [03:13:37] state from which no further progress can be made. [03:13:37] If you often see other project units terminating early like this [03:13:37] too, you may wish to check the stability of your computer (issues [03:13:37] such as high temperature, overclocking, etc.). [03:13:37] Going to send back what have done. [03:13:37] logfile size: 8648 [03:13:37] - Writing 9322 bytes of core data to disk... [03:13:37] ... Done. [03:13:37] [03:13:37] Folding@home Core Shutdown: EARLY_UNIT_END [03:13:39] CoreStatus = 72 (114) [03:13:39] Sending work to server [03:13:39] + Attempting to send results [03:13:41] + Results successfully sent [03:13:41] Thank you for your contribution to Folding@home. [03:13:45] - Preparing to get new work unit... [03:13:45] + Attempting to get work packet [03:13:45] - Connecting to assignment server [03:13:45] - Successful: assigned to (171.64.122.111). [03:13:45] + News From Folding@Home: v.4 client available [03:13:45] Loaded queue successfully. [03:13:45] Couldn't send HTTP request to server (wininet) [03:13:45] + Could not connect to Work Server [03:13:45] - Error: Attempt #1 to get work failed, and no other work to do. Waiting before retry. [03:13:58] + Attempting to get work packet [03:13:58] - Connecting to assignment server [03:13:58] - Successful: assigned to (171.64.122.111). [03:13:58] + News From Folding@Home: v.4 client available [03:13:58] Loaded queue successfully. [03:13:59] Couldn't send HTTP request to server (wininet) [03:13:59] + Could not connect to Work Server [03:13:59] - Error: Attempt #2 to get work failed, and no other work to do. Waiting before retry. [03:14:18] + Attempting to get work packet [03:14:18] - Connecting to assignment server [03:14:18] - Successful: assigned to (171.64.122.111). [03:14:18] + News From Folding@Home: v.4 client available [03:14:18] Loaded queue successfully. [03:14:19] Couldn't send HTTP request to server (wininet) [03:14:19] + Could not connect to Work Server [03:14:19] - Error: Attempt #3 to get work failed, and no other work to do. Waiting before retry. [03:14:47] + Attempting to get work packet [03:14:47] - Connecting to assignment server [03:14:47] - Successful: assigned to (171.64.122.111). [03:14:47] + News From Folding@Home: v.4 client available [03:14:47] Loaded queue successfully. [03:15:01] + Closed connections [03:15:06] [03:15:06] + Processing work unit [03:15:06] Core required: FahCore_78.exe [03:15:06] Core found. [03:15:06] Working on Unit 02 [January 13 03:15:06] [03:15:06] + Working ... [03:15:06] [03:15:06] *------------------------------* [03:15:06] Folding@home Gromacs Core [03:15:06] Version 1.55 (December 22, 2003) [03:15:06] [03:15:06] Preparing to commence simulation [03:15:06] - Assembly optimizations manually forced on. [03:15:06] - Not checking prior termination. [03:15:06] - Expanded 195216 -> 947693 (decompressed 485.4 percent) [03:15:06] - Starting from initial work packet [03:15:06] [03:15:06] Project: 1033 (Run 0, Clone 33, Gen 16) [03:15:06] [03:15:06] Assembly optimizations on if available. [03:15:06] Entering M.D. [03:15:12] Protein: p1033_A21unf_337_94 [03:15:12] [03:15:12] Writing local files [03:15:12] Extra SSE boost OK. [03:15:12] Writing local files [03:15:14] Completed 0 out of 2500000 steps (0)I'll try deleting cores before I pull assembly optimizations. Production = suck without those anyway.
-drasnor
Deleting cores does nothing.
-drasnor