PDA

View Full Version : a little help needed


PcCI2iminal
04-12-2007, 03:33 PM
hello
i new on F@H ,i joined the Team 20 days ago and i`m running a QX6700 @ 3200mhz with 1 client per core ,3 works still running without any troubles and with the last work i got some errors

here is the log

[23:21:19] Writing local files
[23:21:19] Completed 1455000 out of 1500000 steps (97)
[23:37:55] Writing local files
[23:37:55] Completed 1470000 out of 1500000 steps (98)
[23:54:20] Writing local files
[23:54:20] Completed 1485000 out of 1500000 steps (99)
[00:10:45] Writing local files
[00:10:45] Completed 1500000 out of 1500000 steps (100)
[00:10:45] Writing final coordinates.
[00:10:45] Past main M.D. loop
[00:11:45]
[00:11:45] Finished Work Unit:
[00:11:45] - Reading up to 261288 from "work/wudata_05.arc": Read 261288
[00:11:45] - Reading up to 38799740 from "work/wudata_05.xtc": Read 38799740
[00:11:46] goefile size: 0
[00:11:46] logfile size: 50882
[00:11:46] Leaving Run
[00:11:48] - Writing 40389338 bytes of core data to disk...
[00:12:10] Done: 40388826 -> 39286074 (compressed to 97.2 percent)
[00:12:11] ... Done.
[00:12:12] - Shutting down core
[00:12:12]
[00:12:12] Folding@home Core Shutdown: FINISHED_UNIT
[00:12:14] CoreStatus = 64 (100)
[00:12:14] Sending work to server


[00:12:14] + Attempting to send results
[00:12:16] Couldn't send HTTP request to server (wininet)
[00:12:16] + Could not connect to Work Server (results)
[00:12:16] (171.65.103.106:8080)
[00:12:16] - Error: Could not transmit unit 05 (completed April 12) to work server.
[00:12:16] Keeping unit 05 in queue.


[00:12:16] + Attempting to send results
[00:12:29] Couldn't send HTTP request to server (wininet)
[00:12:29] + Could not connect to Work Server (results)
[00:12:29] (171.65.103.106:8080)
[00:12:29] - Error: Could not transmit unit 05 (completed April 12) to work server.


[00:12:29] + Attempting to send results
[00:12:30] Couldn't send HTTP request to server (wininet)
[00:12:30] + Could not connect to Work Server (results)
[00:12:30] (171.65.103.100:8080)
[00:12:30] Could not transmit unit 05 to Collection server; keeping in queue.
[00:12:30] - Preparing to get new work unit...
[00:12:30] + Attempting to get work packet
[00:12:30] - Connecting to assignment server
[00:12:30] - Successful: assigned to (171.65.103.160).
[00:12:30] + News From Folding@Home: Welcome to Folding@Home
[00:12:31] Loaded queue successfully.


[00:12:33] + Attempting to send results
[00:12:40] Couldn't send HTTP request to server (wininet)
[00:12:40] + Could not connect to Work Server (results)
[00:12:40] (171.65.103.106:8080)
[00:12:40] - Error: Could not transmit unit 05 (completed April 12) to work server.


[00:12:40] + Attempting to send results
[00:12:41] Couldn't send HTTP request to server (wininet)
[00:12:41] + Could not connect to Work Server (results)
[00:12:41] (171.65.103.100:8080)
[00:12:41] Could not transmit unit 05 to Collection server; keeping in queue.
[00:12:41] + Closed connections
[00:12:41]
[00:12:41] + Processing work unit
[00:12:41] Core required: FahCore_78.exe
[00:12:41] Core found.
[00:12:41] Working on Unit 06 [April 12 00:12:41]
[00:12:41] + Working ...
[00:12:41]
[00:12:41] *------------------------------*
[00:12:41] Folding@Home Gromacs Core
[00:12:41] Version 1.90 (March 8, 2006)
[00:12:41]
[00:12:41] Preparing to commence simulation
[00:12:41] - Looking at optimizations...
[00:12:41] - Created dyn
[00:12:41] - Files status OK
[00:12:42] - Expanded 292157 -> 1461493 (decompressed 500.2 percent)
[00:12:42] - Starting from initial work packet
[00:12:42]
[00:12:42] Project: 3038 (Run 0, Clone 158, Gen 14)
[00:12:42]
[00:12:42] Assembly optimizations on if available.
[00:12:42] Entering M.D.
[00:12:48] Protein: p3038_supervillin-03
[00:12:48]
[00:12:48] Writing local files
[00:12:48] Extra SSE boost OK.
[00:12:48] Writing local files
[00:12:48] Completed 0 out of 5000000 steps (0)
[00:25:34] Writing local files
[00:25:34] Completed 50000 out of 5000000 steps (1)

-----------------------------------------------------------------------

[02:58:52] Completed 650000 out of 5000000 steps (13)


[03:05:14] + Attempting to send results
[03:05:27] Couldn't send HTTP request to server (wininet)
[03:05:27] + Could not connect to Work Server (results)
[03:05:27] (171.65.103.106:8080)
[03:05:27] - Error: Could not transmit unit 05 (completed April 12) to work server.


[03:05:27] + Attempting to send results
[03:07:13] - Server does not have record of this unit. Will try again later.
[03:07:13] Could not transmit unit 05 to Collection server; keeping in queue.
[03:11:39] Writing local files
[03:11:39] Completed 700000 out of 5000000 steps (14)
[03:24:27] Writing local files


------------------------------------------------------------------



[08:56:43] Completed 2050000 out of 5000000 steps (41)


[09:07:13] + Attempting to send results
[09:07:15] Couldn't send HTTP request to server (wininet)
[09:07:15] + Could not connect to Work Server (results)
[09:07:15] (171.65.103.106:8080)
[09:07:15] - Error: Could not transmit unit 05 (completed April 12) to work server.


[09:07:15] + Attempting to send results
[09:09:00] - Server does not have record of this unit. Will try again later.
[09:09:00] Could not transmit unit 05 to Collection server; keeping in queue.
[09:09:32] Writing local files
[09:09:32] Completed 2100000 out of 5000000 steps (42)
[09:22:22] Writing local files
[09:22:22] Completed 2150000 out of 5000000 steps (43)

----------------------------------------------------------------

[15:04:44] Completed 3450000 out of 5000000 steps (69)


[15:09:00] + Attempting to send results
[15:09:03] Couldn't send HTTP request to server (wininet)
[15:09:03] + Could not connect to Work Server (results)
[15:09:03] (171.65.103.106:8080)
[15:09:03] - Error: Could not transmit unit 05 (completed April 12) to work server.


[15:09:03] + Attempting to send results
[15:10:39] - Unknown packet returned from server, expected ACK for results
[15:10:39] Could not transmit unit 05 to Collection server; keeping in queue.
[15:17:46] Writing local files
[15:17:46] Completed 3500000 out of 5000000 steps (70)

--------------------------------------------------------------------------

[21:41:53] Completed 5000000 out of 5000000 steps (100)
[21:41:53] Writing final coordinates.
[21:41:53] Past main M.D. loop
[21:42:53]
[21:42:53] Finished Work Unit:
[21:42:53] - Reading up to 232536 from "work/wudata_06.arc": Read 232536
[21:42:53] - Reading up to 453396 from "work/wudata_06.xtc": Read 453396
[21:42:53] goefile size: 0
[21:42:53] logfile size: 249302
[21:42:53] Leaving Run
[21:42:56] - Writing 1132958 bytes of core data to disk...
[21:42:56] ... Done.
[21:42:56] - Shutting down core
[21:42:56]
[21:42:56] Folding@home Core Shutdown: FINISHED_UNIT
[21:42:59] CoreStatus = 64 (100)
[21:42:59] Sending work to server


[21:42:59] + Attempting to send results
[21:42:59] Error: Got status code 503 from server
[21:42:59] + Could not connect to Work Server (results)
[21:42:59] (171.65.103.160:8080)
[21:42:59] - Error: Could not transmit unit 06 (completed April 12) to work server.
[21:42:59] Keeping unit 06 in queue.

what it seems to be?

Chas_The_Man
04-12-2007, 04:19 PM
I have had that before. It eventualy went away for me though. I just started and stopped the client about 100 times before they accepted the units. I think its an issue on there end, not yours. Error 503 means they are busy normally.

embeejay
04-12-2007, 04:26 PM
you should really switch to the smp client - your ppd will probably double or more if you do :)

Chas_The_Man
04-12-2007, 06:48 PM
Yeah, well, I, Like countless others, try to download it and get a CRC error extracting it. But every time someone complains about it, 500 people who didnt get the CRC error say that it can't possibly be the fault of the file on Stanford's site. Well, I dunno, I get the error on my two machines. Its possible it is network or ISP related. Whatever the case, I cant run it as it won't install.

I asked, three days ago for someone to email it to me and see if that works but noone responded.

On another note, oner of my cores is getting 503's. OP: U R NOT ALONE!

jimwah
04-13-2007, 12:50 AM
Hmm it's just having a problem getting at the servers, a quick look at the server status page here (http://fah-web.stanford.edu/serverstat.html) shows that at least some of the IP's it tryed are up & accepting units right now.

Try starting the offending client with the -send all argument added if you can, which should just attempt to send all stored units & then exit. Otherwise it will keep trying as it folds the next unit (which it's obviously downloaded ok).

Bear in mind if you consider going SMP, that it has big units, and therefore quite a long upload time (depending on your connection) I think Sparky mentioned they were ~20mb for WinSMP which is pretty painful on my 256k up speed, and completely saturates my bandwidth for almost 15mins sometimes. Linux SMP seems a lot less demanding in terms of uploads :shrug:

> Chas ygpm :)

PcCI2iminal
04-13-2007, 04:52 AM
thx for all replys :)

I just started and stopped the client about 100 times before they accepted the units. I think its an issue on there end, not yours. Error 503 means they are busy normally.

good to know that issue is from the server :)
well i will try that stop/start

Hmm it's just having a problem getting at the servers, a quick look at the server status page here shows that at least some of the IP's it tryed are up & accepting units right now

thx for that info

PcCI2iminal
04-13-2007, 04:57 AM
you should really switch to the smp client - your ppd will probably double or more if you do :)

i will do it ;)

thanks & keep folding

PcCI2iminal
04-13-2007, 05:06 AM
Yeah, well, I, Like countless others, try to download it and get a CRC error extracting it. But every time someone complains about it, 500 people who didnt get the CRC error say that it can't possibly be the fault of the file on Stanford's site. Well, I dunno, I get the error on my two machines. Its possible it is network or ISP related. Whatever the case, I cant run it as it won't install.

I asked, three days ago for someone to email it to me and see if that works but noone responded.

On another note, oner of my cores is getting 503's. OP: U R NOT ALONE!


give me your email and i will send it to you

;)

Chas_The_Man
04-13-2007, 09:41 AM
I have received the SMP file via email. I cant wait to get home and give it a go. Thanks for the offer.

I would have emailed it to myself but I dont like to communicate with my personal email at work if at all possible.

PcCI2iminal
04-13-2007, 01:49 PM
I would have emailed it to myself but I dont like to communicate with my personal email at work if at all possible.

hehehe
;)


i have tried many times to close/open the client and got the same Error 503



i quit those four consoles and now i`m in SMP

http://img217.imageshack.us/img217/4599/smpfheb2.jpg

embeejay
04-13-2007, 03:15 PM
nice :)

Chas_The_Man
04-13-2007, 08:12 PM
I got SMP up now too. Although I have 4 units in the other FAH app. Ill do a couple with smp and then go back and finish them. One of these must be worth about twice what four of the others are worth??? Is that true? That would be like 2400 points or something.

jimwah
04-14-2007, 03:26 AM
Check out any units labelled GRO-SMP here : http://fah-web.stanford.edu/psummary.html They are worth big points, mainly due to the big upload & possible instability as it's still early days for this client, and they have a relatively tight deadlines too. But nice ppd :)

PcCI2iminal what's the CPU utilisation like on the QX6700, does it hit 100% on all 4 threads OK?

PcCI2iminal
04-14-2007, 05:30 AM
a nice boost on my points
yesterday->17900
today->19664


PcCI2iminal what's the CPU utilisation like on the QX6700, does it hit 100% on all 4 threads OK?

yup ,4 cores @ full load ;)

keep folding

Chas_The_Man
04-14-2007, 10:19 AM
Looks like I will find out in about 2 more hours :-)