Severe Folding instability.

drasnordrasnor Starship OperatorHawthorne, CA Icrontian
edited January 2004 in Folding@Home
Ok, what would make Folding really unstable on a dual Opteron system? I mean, it's more stable under SSE than 3DNow!. It isn't overclocked or anything, and I'm stumped.

-drasnor :fold:

Here's 3DNow!:
--- Opening Log file [January 13 02:56:16] 


# Windows Console Edition #####################################################
###############################################################################

                       Folding@home Client Version 4.00

                          [url]http://folding.stanford.edu[/url]

###############################################################################
###############################################################################

Arguments: -local -forceasm -advmethods -service 

Warning:
 By using the -forceasm flag, you are overriding
 safeguards in the program. If you did not intend to
 do this, please restart the program without -forceasm.
 If work units are not completing fully (and particularly
 if your machine is overclocked), then please discontinue
 use of the flag.

[02:56:16] - Ask before connecting: No
[02:56:16] - Use IE connection settings: Yes
[02:56:16] - User name: drasnor (Team 93)
[02:56:16] - User ID = 414862F622BE5569
[02:56:16] - Machine ID: 2
[02:56:16] 
[02:56:16] Work directory not found. Creating...
[02:56:16] Could not open work queue, generating new queue...
[02:56:16] + Benchmarking ...
[02:56:18] - Preparing to get new work unit...
[02:56:18] + Attempting to get work packet
[02:56:18] - Connecting to assignment server
[02:56:19] - Successful: assigned to (171.67.89.151).
[02:56:19] + News From Folding@Home: v.4 client available
[02:56:19] Loaded queue successfully.
[02:57:26] + Closed connections
[02:57:26] 
[02:57:26] + Processing work unit
[02:57:26] Core required: FahCore_78.exe
[02:57:26] Core found.
[02:57:26] Working on Unit 01 [January 13 02:57:26]
[02:57:26] + Working ...
[02:57:26] 
[02:57:26] *------------------------------*
[02:57:26] Folding@home Gromacs Core
[02:57:26] Version 1.55 (December 22, 2003)
[02:57:26] 
[02:57:26] Preparing to commence simulation
[02:57:26] - Assembly optimizations manually forced on.
[02:57:26] - Not checking prior termination.
[02:57:27] - Expanded 509979 -> 2524925 (decompressed 495.1 percent)
[02:57:27] - Starting from initial work packet
[02:57:27] 
[02:57:27] Project: 920 (Run 51, Clone 24, Gen 19)
[02:57:27] 
[02:57:27] Assembly optimizations on if available.
[02:57:27] Entering M.D.
[02:57:33] Protein: p920_vpf910
[02:57:33] 
[02:57:33] Writing local files
[02:57:34] Extra 3DNow boost OK.
[02:57:34] Writing local files
[02:57:36] Completed 0 out of 250000 steps  (0)
[02:57:39] Quit 101 - Fatal error: 
[02:57:39] Step 1, time 0.002 (ps)  LINCS WARNING
[02:57:39] relative constraint deviation after LINCS:
[02:57:39] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[02:57:39] 
[02:57:39] Simulation instability has been encountered. The run has entered a
[02:57:39]   state from which no further progress can be made.
[02:57:39] If you often see other project units terminating early like this
[02:57:39]   too, you may wish to check the stability of your computer (issues
[02:57:39]   such as high temperature, overclocking, etc.).
[02:57:39] Going to send back what have done.
[02:57:39] logfile size: 6854
[02:57:39] - Writing 7525 bytes of core data to disk...
[02:57:39]   ... Done.
[02:57:39] 
[02:57:39] Folding@home Core Shutdown: EARLY_UNIT_END
[02:57:42] CoreStatus = 72 (114)
[02:57:42] Sending work to server


[02:57:42] + Attempting to send results
[02:57:47] + Results successfully sent
[02:57:47] Thank you for your contribution to Folding@home.
[02:57:51] - Preparing to get new work unit...
[02:57:51] + Attempting to get work packet
[02:57:51] - Connecting to assignment server
[02:57:53] - Successful: assigned to (171.67.89.151).
[02:57:53] + News From Folding@Home: v.4 client available
[02:57:53] Loaded queue successfully.
[02:58:48] + Closed connections
[02:58:53] 
[02:58:53] + Processing work unit
[02:58:53] Core required: FahCore_78.exe
[02:58:53] Core found.
[02:58:53] Working on Unit 02 [January 13 02:58:53]
[02:58:53] + Working ...
[02:58:53] 
[02:58:53] *------------------------------*
[02:58:53] Folding@home Gromacs Core
[02:58:53] Version 1.55 (December 22, 2003)
[02:58:53] 
[02:58:53] Preparing to commence simulation
[02:58:53] - Assembly optimizations manually forced on.
[02:58:53] - Not checking prior termination.
[02:58:54] - Expanded 512103 -> 2549549 (decompressed 497.8 percent)
[02:58:54] - Starting from initial work packet
[02:58:54] 
[02:58:54] Project: 921 (Run 11, Clone 22, Gen 13)
[02:58:54] 
[02:58:54] Assembly optimizations on if available.
[02:58:54] Entering M.D.
[02:59:00] Protein: p921_vpf912
[02:59:00] 
[02:59:00] Writing local files
[02:59:01] Extra 3DNow boost OK.
[02:59:01] Writing local files
[02:59:03] Completed 0 out of 250000 steps  (0)
[02:59:05] Quit 101 - Fatal error: 
[02:59:05] Step 1, time 0.002 (ps)  LINCS WARNING
[02:59:05] relative constraint deviation after LINCS:
[02:59:05] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[02:59:05] 
[02:59:05] Simulation instability has been encountered. The run has entered a
[02:59:05]   state from which no further progress can be made.
[02:59:05] If you often see other project units terminating early like this
[02:59:05]   too, you may wish to check the stability of your computer (issues
[02:59:05]   such as high temperature, overclocking, etc.).
[02:59:05] Going to send back what have done.
[02:59:05] logfile size: 7232
[02:59:05] - Writing 7903 bytes of core data to disk...
[02:59:05]   ... Done.
[02:59:05] 
[02:59:05] Folding@home Core Shutdown: EARLY_UNIT_END
[02:59:09] CoreStatus = 72 (114)
[02:59:09] Sending work to server


[02:59:09] + Attempting to send results
[02:59:13] + Results successfully sent
[02:59:13] Thank you for your contribution to Folding@home.
[02:59:17] - Preparing to get new work unit...
[02:59:17] + Attempting to get work packet
[02:59:17] - Connecting to assignment server
[02:59:18] - Successful: assigned to (171.67.89.151).
[02:59:18] + News From Folding@Home: v.4 client available
[02:59:18] Loaded queue successfully.
[03:00:08] + Closed connections
[03:00:13] 
[03:00:13] + Processing work unit
[03:00:13] Core required: FahCore_78.exe
[03:00:13] Core found.
[03:00:13] Working on Unit 03 [January 13 03:00:13]
[03:00:13] + Working ...
[03:00:13] 
[03:00:13] *------------------------------*
[03:00:13] Folding@home Gromacs Core
[03:00:13] Version 1.55 (December 22, 2003)
[03:00:13] 
[03:00:13] Preparing to commence simulation
[03:00:13] - Assembly optimizations manually forced on.
[03:00:13] - Not checking prior termination.
[03:00:13] - Expanded 513033 -> 2549549 (decompressed 496.9 percent)
[03:00:13] - Starting from initial work packet
[03:00:13] 
[03:00:13] Project: 921 (Run 11, Clone 23, Gen 31)
[03:00:13] 
[03:00:13] Assembly optimizations on if available.
[03:00:13] Entering M.D.
[03:00:19] Protein: p921_vpf912
[03:00:19] 
[03:00:19] Writing local files
[03:00:21] Extra 3DNow boost OK.
[03:00:21] Writing local files
[03:00:23] Completed 0 out of 250000 steps  (0)
[03:00:25] Quit 101 - Fatal error: 
[03:00:25] Step 1, time 0.002 (ps)  LINCS WARNING
[03:00:25] relative constraint deviation after LINCS:
[03:00:25] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[03:00:25] 
[03:00:25] Simulation instability has been encountered. The run has entered a
[03:00:25]   state from which no further progress can be made.
[03:00:25] If you often see other project units terminating early like this
[03:00:25]   too, you may wish to check the stability of your computer (issues
[03:00:25]   such as high temperature, overclocking, etc.).
[03:00:25] Going to send back what have done.
[03:00:25] logfile size: 7232
[03:00:25] - Writing 7903 bytes of core data to disk...
[03:00:25]   ... Done.
[03:00:25] 
[03:00:25] Folding@home Core Shutdown: EARLY_UNIT_END
[03:00:29] CoreStatus = 72 (114)
[03:00:29] Sending work to server


[03:00:29] + Attempting to send results
[03:00:31] + Results successfully sent
[03:00:31] Thank you for your contribution to Folding@home.
[03:00:35] - Preparing to get new work unit...
[03:00:35] + Attempting to get work packet
[03:00:35] - Connecting to assignment server
[03:00:37] - Successful: assigned to (171.67.89.151).
[03:00:37] + News From Folding@Home: v.4 client available
[03:00:37] Loaded queue successfully.

I dumped the SSE log, but I'll have one later. It looks pretty much the same though.

Comments

  • mmonninmmonnin Centreville, VA
    edited January 2004
    Have you tried deleting the core or client?
  • csimoncsimon Acadiana Icrontian
    edited January 2004
    Arguments: -local -forceasm -advmethods -service

    Warning:
    By using the -forceasm flag, you are overriding
    safeguards in the program. If you did not intend to
    do this, please restart the program without -forceasm.
    If work units are not completing fully (and particularly
    if your machine is overclocked), then please discontinue
    use of the flag.

    For starters try removing the -forceasm flag.
    [02:59:05] Quit 101 - Fatal error:
    [02:59:05] Step 1, time 0.002 (ps) LINCS WARNING
    [02:59:05] relative constraint deviation after LINCS:
    [02:59:05] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0

    This appears suspicious.
  • drasnordrasnor Starship Operator Hawthorne, CA Icrontian
    edited January 2004
    ForceSSE does this:
    --- Opening Log file [January 13 03:05:54] 
    
    
    # Windows Console Edition #####################################################
    ###############################################################################
    
                           Folding@home Client Version 4.00
    
                              [url]http://folding.stanford.edu[/url]
    
    ###############################################################################
    ###############################################################################
    
    Arguments: -local -forceSSE -advmethods -service 
    
    Warning:
     By using the -forceSSE flag, you are overriding program
     safeguards that monitor the stability of SSE
     instructions on your system. If you did not intend
     to do this, please restart the program without
     -forceSSE. If work units are not completing fully,
     then please discontinue use of the flag.
    
    [03:05:54] - Ask before connecting: No
    [03:05:54] - Use IE connection settings: Yes
    [03:05:54] - User name: drasnor (Team 93)
    [03:05:54] - User ID = 414862F622BE5569
    [03:05:54] - Machine ID: 1
    [03:05:54] 
    [03:05:54] Work directory not found. Creating...
    [03:05:54] Could not open work queue, generating new queue...
    [03:05:54] + Benchmarking ...
    [03:05:56] - Preparing to get new work unit...
    [03:05:56] + Attempting to get work packet
    [03:05:56] - Connecting to assignment server
    [03:05:56] - Successful: assigned to (171.67.89.151).
    [03:05:56] + News From Folding@Home: v.4 client available
    [03:05:57] Loaded queue successfully.
    [03:06:29] + Closed connections
    [03:06:29] 
    [03:06:29] + Processing work unit
    [03:06:29] Core required: FahCore_78.exe
    [03:06:29] Core found.
    [03:06:29] Working on Unit 01 [January 13 03:06:29]
    [03:06:29] + Working ...
    [03:06:29] 
    [03:06:29] *------------------------------*
    [03:06:29] Folding@home Gromacs Core
    [03:06:29] Version 1.55 (December 22, 2003)
    [03:06:29] 
    [03:06:29] Preparing to commence simulation
    [03:06:29] - Assembly optimizations manually forced on.
    [03:06:29] - Not checking prior termination.
    [03:06:30] - Expanded 508867 -> 2524925 (decompressed 496.1 percent)
    [03:06:30] - Starting from initial work packet
    [03:06:30] 
    [03:06:30] Project: 920 (Run 4, Clone 23, Gen 25)
    [03:06:30] 
    [03:06:30] Assembly optimizations on if available.
    [03:06:30] Entering M.D.
    [03:06:36] Protein: p920_vpf910
    [03:06:36] 
    [03:06:36] Writing local files
    [03:06:37] Extra SSE boost OK.
    [03:06:37] Writing local files
    [03:06:39] Completed 0 out of 250000 steps  (0)
    [03:10:55] Writing local files
    [03:10:57] Completed 2500 out of 250000 steps  (1)
    [03:13:37] Quit 101 - Fatal error: 
    [03:13:37] Step 4099, time 8.198 (ps)  LINCS WARNING
    [03:13:37] relative constraint deviation after LINCS:
    [03:13:37] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
    [03:13:37] 
    [03:13:37] Simulation instability has been encountered. The run has entered a
    [03:13:37]   state from which no further progress can be made.
    [03:13:37] If you often see other project units terminating early like this
    [03:13:37]   too, you may wish to check the stability of your computer (issues
    [03:13:37]   such as high temperature, overclocking, etc.).
    [03:13:37] Going to send back what have done.
    [03:13:37] logfile size: 8648
    [03:13:37] - Writing 9322 bytes of core data to disk...
    [03:13:37]   ... Done.
    [03:13:37] 
    [03:13:37] Folding@home Core Shutdown: EARLY_UNIT_END
    [03:13:39] CoreStatus = 72 (114)
    [03:13:39] Sending work to server
    
    
    [03:13:39] + Attempting to send results
    [03:13:41] + Results successfully sent
    [03:13:41] Thank you for your contribution to Folding@home.
    [03:13:45] - Preparing to get new work unit...
    [03:13:45] + Attempting to get work packet
    [03:13:45] - Connecting to assignment server
    [03:13:45] - Successful: assigned to (171.64.122.111).
    [03:13:45] + News From Folding@Home: v.4 client available
    [03:13:45] Loaded queue successfully.
    [03:13:45] Couldn't send HTTP request to server (wininet)
    [03:13:45] + Could not connect to Work Server
    [03:13:45] - Error: Attempt #1  to get work failed, and no other work to do.
                 Waiting before retry.
    [03:13:58] + Attempting to get work packet
    [03:13:58] - Connecting to assignment server
    [03:13:58] - Successful: assigned to (171.64.122.111).
    [03:13:58] + News From Folding@Home: v.4 client available
    [03:13:58] Loaded queue successfully.
    [03:13:59] Couldn't send HTTP request to server (wininet)
    [03:13:59] + Could not connect to Work Server
    [03:13:59] - Error: Attempt #2  to get work failed, and no other work to do.
                 Waiting before retry.
    [03:14:18] + Attempting to get work packet
    [03:14:18] - Connecting to assignment server
    [03:14:18] - Successful: assigned to (171.64.122.111).
    [03:14:18] + News From Folding@Home: v.4 client available
    [03:14:18] Loaded queue successfully.
    [03:14:19] Couldn't send HTTP request to server (wininet)
    [03:14:19] + Could not connect to Work Server
    [03:14:19] - Error: Attempt #3  to get work failed, and no other work to do.
                 Waiting before retry.
    [03:14:47] + Attempting to get work packet
    [03:14:47] - Connecting to assignment server
    [03:14:47] - Successful: assigned to (171.64.122.111).
    [03:14:47] + News From Folding@Home: v.4 client available
    [03:14:47] Loaded queue successfully.
    [03:15:01] + Closed connections
    [03:15:06] 
    [03:15:06] + Processing work unit
    [03:15:06] Core required: FahCore_78.exe
    [03:15:06] Core found.
    [03:15:06] Working on Unit 02 [January 13 03:15:06]
    [03:15:06] + Working ...
    [03:15:06] 
    [03:15:06] *------------------------------*
    [03:15:06] Folding@home Gromacs Core
    [03:15:06] Version 1.55 (December 22, 2003)
    [03:15:06] 
    [03:15:06] Preparing to commence simulation
    [03:15:06] - Assembly optimizations manually forced on.
    [03:15:06] - Not checking prior termination.
    [03:15:06] - Expanded 195216 -> 947693 (decompressed 485.4 percent)
    [03:15:06] - Starting from initial work packet
    [03:15:06] 
    [03:15:06] Project: 1033 (Run 0, Clone 33, Gen 16)
    [03:15:06] 
    [03:15:06] Assembly optimizations on if available.
    [03:15:06] Entering M.D.
    [03:15:12] Protein: p1033_A21unf_337_94
    [03:15:12] 
    [03:15:12] Writing local files
    [03:15:12] Extra SSE boost OK.
    [03:15:12] Writing local files
    [03:15:14] Completed 0 out of 2500000 steps  (0)
    

    I'll try deleting cores before I pull assembly optimizations. Production = suck without those anyway.

    -drasnor :fold:
  • csimoncsimon Acadiana Icrontian
    edited January 2004
    this just started suddenly or what?
  • drasnordrasnor Starship Operator Hawthorne, CA Icrontian
    edited January 2004
    Well, I started checking it a bunch today, since I had a thermal shutdown this afternoon. Stupid MSI stock sinks, I can't wait for my 2 Swiftech MCX478+ to get here. Core temp on the primary is 53 C, secondary is 57 C.

    Deleting cores does nothing.

    -drasnor :fold:
  • mmonninmmonnin Centreville, VA
    edited January 2004
    Its too hot.
Sign In or Register to comment.