Questions and Answers :
Windows :
data transfer (upload) failure & re-tries
Message board moderation
Author | Message |
---|---|
Send message Joined: 29 Sep 10 Posts: 3 Credit: 203,683 RAC: 0 |
Hi. My computer finished a task titled hadam3p_eu_wa5c_1959_1_006808504_1. This was right around the time when the servers were down for relocation or maintenance or something over the weekend. So when the completed work could not be uploaded and kept retrying I just assumed it was because the servers were offline. Now that the servers are online again, the data is still not uploading. I also do Einstein@Home and that seems to upload fine. Any help is appreciated. Thanks. |
Send message Joined: 16 Jan 10 Posts: 1081 Credit: 7,068,231 RAC: 5,852 |
I've not been able to upload anything to climateapps1.oerc.ox.ac.uk and have reported that to the project team. A variety of HADAM3P and HADCM3N zip files did upload, but to other upload servers. If there's any news, I'll report back here. More likely, however, the machine will just start working again when someone flicks a metaphorical switch somewhere. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,112,197 RAC: 2,623 |
I noticed that all through the outage I could upload anything except the 12.zip and 13.zip files from the hadam3p WUs. I still have 4 stuck of these (one 12.zip and three 13.zips) stuck in the transfer tabs of my 2 machines. Hopefully it will get sorted out soon. |
Send message Joined: 29 Sep 10 Posts: 3 Credit: 203,683 RAC: 0 |
Thanks for replying (both of you). Since I had never seen this happen before I was worried that it had something to do with my PC - glad to know this is a known issue. |
Send message Joined: 29 Sep 10 Posts: 3 Credit: 203,683 RAC: 0 |
How do you report this other than post here? |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Thanks for the report, Danish. Fortunately we did already know about this problem of some regional model (hadam3P) files. If you look at the News and Announcements thread at the top of the Number Crunching section you'll see a post about it. And there's a link in my signature. It's worth subscribing to that News thread for email notification of new posts (as long as in your account you have email notification enabled). When Jonathan gets the last server problems sorted out your files will upload. Having to wait a while in the Transfers tab won't do them any harm. Cpdn news |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,112,197 RAC: 2,623 |
As you can see the problem is: can't resolve hostname. I don?t know if this helps. 11/16/2011 1:44:24 PM Project communication failed: attempting access to reference site 11/16/2011 1:44:24 PM climateprediction.net Temporarily failed upload of hadam3p_eu_6i74_2007_1_007490853_0_12.zip: can't resolve hostname 11/16/2011 1:44:24 PM climateprediction.net Backing off 24 min 49 sec on upload of hadam3p_eu_6i74_2007_1_007490853_0_12.zip 11/16/2011 1:44:25 PM Internet access OK - project servers may be temporarily down. 11/16/2011 1:48:50 PM climateprediction.net Started upload of hadam3p_eu_6h23_2005_1_007533045_0_13.zip 11/16/2011 1:48:51 PM Project communication failed: attempting access to reference site 11/16/2011 1:48:51 PM climateprediction.net Temporarily failed upload of hadam3p_eu_6h23_2005_1_007533045_0_13.zip: can't resolve hostname 11/16/2011 1:48:51 PM climateprediction.net Backing off 3 hr 43 min 10 sec on upload of hadam3p_eu_6h23_2005_1_007533045_0_13.zip 11/16/2011 1:48:52 PM Internet access OK - project servers may be temporarily down. |
Send message Joined: 16 Jan 10 Posts: 1081 Credit: 7,068,231 RAC: 5,852 |
As you can see the problem is: can't resolve hostname. I don?t know if this helps. ... Looks like some changes are taking place: Wed Nov 16 06:37:54 2011 | climateprediction.net | Temporarily failed upload of hadam3p_eu_6523_2009_1_007468568_2_11.zip: connect() failed ... has become ... Wed Nov 16 15:14:46 2011 | climateprediction.net | Temporarily failed upload of hadam3p_eu_6523_2009_1_007468568_2_11.zip: can't resolve hostname |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,112,197 RAC: 2,623 |
Any progress with the: Can?t resolve host name problem? I still have four Hadam3p zip filed from finished WUs that I can?t upload. There is no point in running the WUs if we can?t send the results back. In fact the project will grind to a halt without the 13.zip files needed to generate the next segment of each model. |
Send message Joined: 16 Jan 10 Posts: 1081 Credit: 7,068,231 RAC: 5,852 |
Any progress with the: Can?t resolve host name problem? ... The project staff are waiting for the university's IT people to update the relevant DNS records. As I understand it, this will solve some of the upload problems but not all, as two old machines (climateapps1 and climateapps3) are to be replaced by new or better existing hardware. It's not clear to me whether there will be a window in which the old hardware can collect some uploads before being re-configured or replaced. |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
Any workaround for this? Such as an edit to client_state.xml. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Some editing may work. Or it may not. Both of these applied to the 16 zips that I had waiting. Backups: Here |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
Alrighty then! It works. I will post details in the number crunching section. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
When I said it may work, I meant changing oerc to oucs, NOT changing it to use a different upload server. In my case, this allowed all except 2 files to upload. All were zips 10-13. And for the last 2, both zip10, I had to change the url back to oerc. But the best advice is JUST BE PATIENT!!!! Backups: Here |
Send message Joined: 1 Sep 04 Posts: 3 Credit: 5,837,226 RAC: 0 |
But the best advice is JUST BE PATIENT!!!! [/quote] I have an additional problem with the 4 finished WUs that are stuck in my queue. Every time an upload is attempted, my network connection stops working - I can't can't get onto the internet at all until I disable/enable the connection. Is there any way I can stop BOINC Manager from attempting the upload? I tried the workaround listed in this thread and it worked for 6 or 8 other files that had been stuck, but these 4 refuse to budge. |
Send message Joined: 15 May 09 Posts: 4352 Credit: 16,576,710 RAC: 5,724 |
Different files go to different servers, there is one server not running @ the moment. If they are due to go there it would explain it. |
©2024 climateprediction.net