new Folding@Home 4.0 final
csimon
Acadiana Icrontian
You can get the new Fah4 client here!
This is the final release non-beta or prerelease.
Thanks to Guha!!!
# Windows Console Edition ########################
##########################################
Folding@home Client Version 4.00
http://folding.stanford.edu
##########################################
##########################################
This is the final release non-beta or prerelease.
Thanks to Guha!!!
# Windows Console Edition ########################
##########################################
Folding@home Client Version 4.00
http://folding.stanford.edu
##########################################
##########################################
0
Comments
The one thing that this final client has over the older prerelease is that the flag -forceSSE is no longer case sensitive!
so your amd will now accept -forcesse.
And does it use the SSE2 extensions or not?
(Same WU, I just shut down and installed the 4.0 instead of 3.25 then ran it through a few frames before checking frame time again)
//edit: Thrax, it still says "Extra SSE boost OK." when SSE is turned on on mine.
Second, I ended up using -forcesse -forceasm -advmethods on both the Barton and the Intel P4 boxes for best overall performance with 4.0.
GHoosdum, your Barton box is a better candidate for 4.0 than the XP2100+. Yes, 4.0 can use some of the SSE2 extensions, though I doubt it uses all by default as if it DID, the forceSSE would hav more bang on the P4 speed than it did. BUT, it checks also to see if it needs to send more often than the version 3 client did, and it does indeed truely try to send if there are completed WUs in work queue-- Vijay said it was set to basicly do a send attempt about every 1\3 WU if there were completed WUs in queue. When my router gets here, will try again and see if can send better, and just let the client handle resends if it logs them for me in final. I guess Monday at latest will be cycling WUs again, possibly late Monday.
Note that forcesse and forceasm are off by default, also, in the graphical client, and this is another clue to me that some SSE2 is being used as the processors that were having issues that I know of do not have a full SSE2 set builtin.
Overall, I was doing WUs of 1000 series in 3\4 the time of older Gromacs, and getting more of the newer ones than the old ones. Expect the new client\core to show percentage on new WU projects, not frames. They are 2.5 Million step workunits, so they do 25,000 steps per percentage point (1/100th of a WU instead of 1/400th for a 400 frame WU where reports are in frames) and they also do more per percentage than what was spoken of as frames in same time frame. I was turning in new Gromacs projects FASTER also, and the new Core_78 subversion picked up the pace some on these WUs when used with the new Pres for 4..0 even, maybe 5-10% improvement, think the CORE is using more SSE2 and SSE if available from client overall config and benchmarking and SSE validation.
So, to get a real speed test, do maybe 15-20% of a WU, take overall time versus similar project on older client from past logs. BUT, would test on Barton first, and expect less improvement, though some, on the XP2100+ than on the Barton. Figure on newer WUs that first percent will probably take LONGER(maybe 2 min longer or 18% longer, looks like the thing is self-testing some as it calcs, I had 12-13 min for first percentage, more like under 10 about 1/4 way into new WU series of 1000 numbering, and last percentage was very close to the one 1/4 of the way in) than one 25% of the way in, and that percent 100 should be about same time to 15 seconds variance from the one at 1/4 done WU.
Does that help explain some???
John.
[edit]yeah...[/edit]
The box I was checking it on at first is the P4 box at work (hence the SSE2 question) - I haven't installed the 4.0 client on either my Barton or my TBred boxes at home yet, but I will.
On a sidenote, I'll get the new 92MM 60 CFM fan for my Primary rig tomorrow, so I can OC it again! Woohoo!
on my p4 & xeons I use -advmethods
that way on all I get the extra SSE boost ok@