climateprediction.net home page
HadSM3MH - no final trickle and zip file upload

HadSM3MH - no final trickle and zip file upload

Message boards : Number crunching : HadSM3MH - no final trickle and zip file upload
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user596405

Send message
Joined: 4 Oct 09
Posts: 73
Credit: 7,242,427
RAC: 0
Message 38772 - Posted: 27 Jan 2010, 9:20:50 UTC

Something not quite right with this model
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=10497540
which finished at 14:41 on 26th. The final trickle has not been received (recorded in results) and the final zip file cannot get uploaded.

Here are the messages recorded when it finished. The failed messages are repeated as it keeps attempting to upload.
26/01/2010 14:41:10	climateprediction.net	Sending scheduler request: To send trickle-up message.
26/01/2010 14:41:10	climateprediction.net	Not reporting or requesting tasks
26/01/2010 14:41:11	climateprediction.net	Started upload of hadsm3mh_ks7e_006485486_8_4.zip
26/01/2010 14:41:13	climateprediction.net	Computation for task hadsm3mh_ks7e_006485486_8 finished
26/01/2010 14:41:32		Project communication failed: attempting access to reference site
26/01/2010 14:41:33		Internet access OK - project servers may be temporarily down.
26/01/2010 14:41:33	climateprediction.net	Temporarily failed upload of hadsm3mh_ks7e_006485486_8_4.zip: connect() failed
26/01/2010 14:41:33	climateprediction.net	Backing off 1 min 0 sec on upload of hadsm3mh_ks7e_006485486_8_4.zip
26/01/2010 14:41:35	climateprediction.net	Scheduler request failed: Couldn\'t connect to server
26/01/2010 14:42:33	climateprediction.net	Started upload of hadsm3mh_ks7e_006485486_8_4.zip
26/01/2010 14:42:35	climateprediction.net	Sending scheduler request: To send trickle-up message.
26/01/2010 14:42:35	climateprediction.net	Not reporting or requesting tasks
26/01/2010 14:42:55		Project communication failed: attempting access to reference site
26/01/2010 14:42:55	climateprediction.net	Temporarily failed upload of hadsm3mh_ks7e_006485486_8_4.zip: connect() failed
26/01/2010 14:42:55	climateprediction.net	Backing off 1 min 0 sec on upload of hadsm3mh_ks7e_006485486_8_4.zip
26/01/2010 14:42:56		Internet access OK - project servers may be temporarily down.
26/01/2010 14:42:57		Project communication failed: attempting access to reference site
26/01/2010 14:42:58		Internet access OK - project servers may be temporarily down.


Server status is ok. Several other HadSM3 and SM3MH models are tricklng up AND zip file uploading on other machines.
This suggests an issue with the specific PC but only CPDN related as it is successfully uploading completed WUs in WCG.

Ideas welcome!
ID: 38772 · Report as offensive     Reply Quote
Lockleys

Send message
Joined: 13 Jan 07
Posts: 195
Credit: 10,581,566
RAC: 0
Message 38773 - Posted: 27 Jan 2010, 10:09:21 UTC - in response to Message 38772.  

Something not quite right with this model
26/01/2010 14:42:55 Project communication failed: attempting access to reference site
26/01/2010 14:42:55 climateprediction.net Temporarily failed upload of hadsm3mh_ks7e_006485486_8_4.zip: connect() failed
26/01/2010 14:42:55 climateprediction.net Backing off 1 min 0 sec on upload of hadsm3mh_ks7e_006485486_8_4.zip
26/01/2010 14:42:56 Internet access OK - project servers may be temporarily down.
26/01/2010 14:42:57 Project communication failed: attempting access to reference site
26/01/2010 14:42:58 Internet access OK - project servers may be temporarily down.


I\'ve had several similar issues recently, including a reluctant 10 year zip with a 160 year CM this morning, but they have all cleared in the end. Sometimes in an hour and sometimes next day. I have noticed a correlation with other use of the Internet connection, almost as though it doesn\'t like sharing the bandwidth, even though I have 6meg connection. So I usually close down email client, web browser, etc if it happens before giving it another try.

Plus, cross fingers
ID: 38773 · Report as offensive     Reply Quote
old_user596405

Send message
Joined: 4 Oct 09
Posts: 73
Credit: 7,242,427
RAC: 0
Message 38774 - Posted: 27 Jan 2010, 10:24:46 UTC
Last modified: 27 Jan 2010, 10:29:28 UTC

EDIT (missed the 60 minute cutoff for editing a post) !

Posted the above having never experienced this before (and not finding same problem in forum).
Resolved by exiting BOINC. Did usual backup after every exit (note - always copy BOINC data folder to a different drive,
then copy back again to working drive). Restarted, hit \"retry now\" in transfer tab and, voila, it worked and final trickle duly recorded.

Very odd one.

EDIT No. 2 (after seeing Lockleys post).

Ok, but I thought 20 hours was a bit long to wait, considering no other connections (overnight). Perhaps some contention sharing BOINC with WCG?
Not to worry, another one bites the dust - phew!
ID: 38774 · Report as offensive     Reply Quote

Message boards : Number crunching : HadSM3MH - no final trickle and zip file upload

©2024 cpdn.org