21)
Message boards :
Number crunching :
Project communication failed: attempting access to reference site
(Message 54803)
Posted 20 Sep 2016 by Thyme Lawn Post: Thanks Vitalii, that's definitely appears to be a server problem (there are no [http_xfer] messages when the file content should be being transmitted to the server). I'll notify the project team. |
22)
Message boards :
Number crunching :
Project communication failed: attempting access to reference site
(Message 54799)
Posted 20 Sep 2016 by Thyme Lawn Post: That contains this line: That's not where the problem lies Les. The 302 is the response from google.com to BOINC's internet connectivity check telling it to redirect the request to google.com.ua. The relevant lines from Vitali's output are: 19.09.2016 22:49:38 | climateprediction.net | [http] [ID#9] Info: Connected to upload6.cpdn.org (158.97.9.11) port 80 (#35) 19.09.2016 22:49:38 | climateprediction.net | [http] [ID#9] Sent header to server: Content-Length: 4540690 19.09.2016 22:49:38 | climateprediction.net | [http] [ID#9] Sent header to server: Expect: 100-continue 19.09.2016 22:49:39 | climateprediction.net | [http] [ID#9] Received header from server: HTTP/1.1 100 Continue 19.09.2016 22:50:05 | climateprediction.net | [http] [ID#9] Info: Recv failure: Connection was reset 19.09.2016 22:50:05 | climateprediction.net | [http] [ID#9] Info: Closing connection 35 19.09.2016 22:50:05 | climateprediction.net | [http] HTTP error: Failure when receiving data from the peer The "HTTP/1.1 100 Continue" response indicates that an HTTP/1.1 connection has been successfully made to the upload server, the file offset to start uploading from has been determined and BOINC has started sending 4,540,690 bytes of data (if the file being sent is larger than that it's indicating that some data has successfully been transferred to upload6). The next line is indicating that the connection was reset 26 seconds later due to a receive failure. My guess is that it's indicating a problem on that upload server (upload6.cpdn.org (158.97.9.11)). Enabling "http_xfer_debug" will directly indicate whether any data has being transferred. I didn't suggest it earlier because it generates a message for each section of data sent to the server which very quickly mounts up (in my case it's about 180 messages per second for an upload). The messages don't identify the project and the ID# is the only way to tie them to the original request (i.e. enabling http_xfer_debug with http_debug disabled is meaningless). The following messages are selected from my previous post with a couple of http_xfer_debug messages included and lots of lines removed. The first POST sequence determines how many bytes the server has already received and the second is transferring the file from that offset: 19/09/2016 11:11:32 | climateprediction.net | Started upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627) 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 311 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: We are completely uploaded and fine 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Connection #627 to host upload2.cpdn.org left intact 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Re-using existing connection! (#627) with host upload2.cpdn.org 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627) 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 23198108 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Expect: 100-continue 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 100 Continue 19/09/2016 11:11:35 | | [http_xfer] [ID#9825] HTTP: wrote 2956 bytes 19/09/2016 11:11:35 | | [http_xfer] [ID#9825] HTTP: wrote 2441 bytes 19/09/2016 11:18:04 | climateprediction.net | [http] [ID#9825] Info: We are completely uploaded and fine 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK 19/09/2016 11:18:07 | climateprediction.net | Finished upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip |
23)
Message boards :
Number crunching :
Project communication failed: attempting access to reference site
(Message 54794)
Posted 19 Sep 2016 by Thyme Lawn Post: In BOINC Manager you can enable debugging of HTTP conversations by typing <Ctrl>+<Shift>+F, clicking on the "http_debug" checkbox to make it ticked and clicking OK (to disable the debug repeat that process, this time removing the "http_debug" checkbox tick). A successful file transfer will result in an event log sequence similar to the following: 19/09/2016 11:11:32 | climateprediction.net | Started upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Trying 129.67.195.140... 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627) 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Host: upload2.cpdn.org 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.6.22) 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept: */* 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Encoding: deflate, gzip 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Type: application/x-www-form-urlencoded 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Language: en_GB 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 311 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: We are completely uploaded and fine 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Date: Mon, 19 Sep 2016 10:10:26 GMT 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Server: Apache/2.4.6 (CentOS) 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Content-Length: 93 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Content-Type: text/plain; charset=UTF-8 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: 19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Connection #627 to host upload2.cpdn.org left intact 19/09/2016 11:11:34 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle set 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Found bundle for host upload2.cpdn.org: 0x1c365e04cb0 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Re-using existing connection! (#627) with host upload2.cpdn.org 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627) 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Host: upload2.cpdn.org 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.6.22) 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept: */* 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Encoding: deflate, gzip 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Type: application/x-www-form-urlencoded 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Language: en_GB 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 23198108 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Expect: 100-continue 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: 19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 100 Continue 19/09/2016 11:18:04 | climateprediction.net | [http] [ID#9825] Info: We are completely uploaded and fine 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Date: Mon, 19 Sep 2016 10:10:28 GMT 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Server: Apache/2.4.6 (CentOS) 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Transfer-Encoding: chunked 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Content-Type: text/plain; charset=UTF-8 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: 19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Info: Connection #627 to host upload2.cpdn.org left intact 19/09/2016 11:18:07 | climateprediction.net | Finished upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip The upload messages will be preceded with those for the associated trickle: 19/09/2016 11:11:30 | climateprediction.net | Sending scheduler request: To send trickle-up message. 19/09/2016 11:11:30 | climateprediction.net | Not requesting tasks: don't need (not highest priority project) 19/09/2016 11:11:30 | climateprediction.net | [http] HTTP_OP::init_post(): http://climateapps2.oerc.ox.ac.uk/cpdnboinc_cgi/cgi 19/09/2016 11:11:30 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle set 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Info: Trying 129.67.195.185... 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Info: Connected to climateapps2.oerc.ox.ac.uk (129.67.195.185) port 80 (#626) 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: POST /cpdnboinc_cgi/cgi HTTP/1.1 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Host: climateapps2.oerc.ox.ac.uk 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.6.22) 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Accept: */* 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Accept-Encoding: deflate, gzip 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Content-Type: application/x-www-form-urlencoded 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Accept-Language: en_GB 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Content-Length: 37921 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Expect: 100-continue 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Received header from server: HTTP/1.1 100 Continue 19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Info: We are completely uploaded and fine 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: HTTP/1.1 200 OK 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Date: Mon, 19 Sep 2016 10:11:33 GMT 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Server: Apache 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Connection: close 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Transfer-Encoding: chunked 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Content-Type: text/xml 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: 19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Info: Closing connection 626 19/09/2016 11:11:32 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle 'C:\Program Files\BOINC\ca-bundle.crt' 19/09/2016 11:11:32 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle set 19/09/2016 11:11:32 | climateprediction.net | Scheduler request completed |
24)
Message boards :
Number crunching :
News and Announcements 2
(Message 53885)
Posted 4 Apr 2016 by Thyme Lawn Post: Archived credits have been restored for all users. These aren't included in the total at the moment but should be on completion of the credit script Andy has started. |
25)
Message boards :
Number crunching :
Total Credit
(Message 53884)
Posted 4 Apr 2016 by Thyme Lawn Post: Not sure if I should be posting this as I don't want to raise false hopes but credits are currently being re-calculated and should include archived credit. Archived credits now appear on user pages but won't be included in the total until the credit script finishes running. My account page currently shows that I'll be getting back more than 5.5 million when it's done: Computing and credit |
26)
Message boards :
Number crunching :
Total Credit
(Message 53866)
Posted 1 Apr 2016 by Thyme Lawn Post: The outstanding missing credit is from tasks which were archived in 2010. The account page for all users has "Archived credit (included in total)" set to 0. The project team are aware that archived credits haven't been carried forward to the new server and are investigating. |
27)
Message boards :
Number crunching :
News and Announcements 2
(Message 53052)
Posted 8 Dec 2015 by Thyme Lawn Post: Project will be offline from 1000 UTC on Wednesday 9th December for at least 24 hours News from the project team: We will be taking the project offline tomorrow (Wednesday 9th December) from 10am (UK time) in order to take a snapshot of the database. This is part of the process of the re-configuration of a slave database machine. Once this snapshot process has completed we will bring the project back online again, we anticipate that this process will take a minimum of 24 hours to complete. We apologise in advance for any inconvenience. |
28)
Message boards :
Number crunching :
CPDN process is not running??
(Message 53049)
Posted 7 Dec 2015 by Thyme Lawn Post: Windows Task Manager is not very detailed, so maybe I should look for a better tool to find out how much my memory load really is. I use Sysinternals Process Explorer. The Sysinternals suite started off as a third party freeware product. Microsoft bought it up 9 years ago and at least one of the original developers is still involved in its development. |
29)
Message boards :
Number crunching :
Late November batch of Windows work
(Message 53042)
Posted 7 Dec 2015 by Thyme Lawn Post: Two of my WAH2 models from the 29 November batch have completed. Some others have failed early on: at least one of those has made some progress on another computer, which makes we wonder whether they don't like being run with too many in parallel (my habit is to run 25% CPUs, except when getting new work when I put CPUs back to 100% - the crashes all occurred during the 100% period). The memory load for WAH2 seems to be much higher than was the case for previous applications. My wah2_eu25 tasks have a total working set size of around 460MB and I've changed the project resource shares on my Q6600 (which only has 2GB of RAM) to prevent it from running more than one of these tasks. |
30)
Message boards :
Number crunching :
Download failed
(Message 53014)
Posted 3 Dec 2015 by Thyme Lawn Post: The problem has been traced to a now corrected permission denied error on transferring the affected workunits to the download server. |
31)
Message boards :
Number crunching :
Download failed
(Message 53011)
Posted 2 Dec 2015 by Thyme Lawn Post: It's definitely not your problem ChrisD. Clicking the '+' on the Stderr line of one of your failed tasks (e.g. hadam3p_pnw_xg5r_200712_12_010210835_1) shows that there was a permanent HTTP error on one of the input files (i.e. it wasn't available at the URL the server told BOINC to fetch it from). I've passed that up to the project team. |
32)
Message boards :
Number crunching :
Late November batch of Windows work
(Message 53010)
Posted 2 Dec 2015 by Thyme Lawn Post: How often do these tasks checkpoint? Looking at the task running now, it seems it's been over 50 minutes of CPU time since the last checkpoint. All CPDN models checkpoint at fixed points in the calculation. For these models it's at the end of each model day, with trickles and uploads being made every 30 model days. My 15 has a checkpoint interval of just under 50 minutes and for the Q6600 it's around 70 minutes. |
33)
Message boards :
Number crunching :
Total Credit
(Message 52998)
Posted 30 Nov 2015 by Thyme Lawn Post: Credits are only granted when the daily script is run and appears to be working fine to me. The last of the 5 HadAM3P EU tasks downloaded to my i5 a couple of weeks ago started running a couple of days ago and has been receiving credits for its trickles (hadam3p_eu_fb17_2014_0_010389523_0). Looking at the task list for your computer it completed 2 tasks on 23rd November and has had no CPDN work (other than a couple of probable ghost tasks from June and July) until 4 WAH2 tasks were downloaded at 19:07:08 UTC yesterday. The first trickle from those tasks was returned at 15:33:40 UTC today and should be credited overnight. |
34)
Message boards :
Number crunching :
No work for Windows?
(Message 52990)
Posted 30 Nov 2015 by Thyme Lawn Post: The application "WAH2 Region Independent (Weather At Home 2)" has now been added to the project preferences page. |
35)
Message boards :
Number crunching :
News and Announcements 2
(Message 52989)
Posted 30 Nov 2015 by Thyme Lawn Post: Although there is a new batch of WAH2 work for Windows systems you'll only be able to receive a task if you have the "If no work for selected applications is available, accept work from other applications?" option enabled in your project preferences. The application "WAH2 Region Independent (Weather At Home 2)" has now been added to the project preferences page. |
36)
Message boards :
Number crunching :
No work for Windows?
(Message 52974)
Posted 29 Nov 2015 by Thyme Lawn Post: I also run NOINC as a Windows service. I've always accepted work from non-selected applications and haven't been sent any HadCM3S tasks since 27th August despite being starved of work from the project for much of that time. The server status page says there are currently 10,513 unsent WAH2 tasks, so it's extremely unlikely that you'll be sent the occasional HadCM3S resend rather than one of the WAH2 ones. Given the odds Les's suggestion of temporarily enabling the work from other applications option seems reasonable to me. |
37)
Message boards :
Number crunching :
News and Announcements 2
(Message 52969)
Posted 29 Nov 2015 by Thyme Lawn Post: Although there is a new batch of WAH2 work for Windows systems you'll only be able to receive a task if you have the "If no work for selected applications is available, accept work from other applications?" option enabled in your project preferences. I've posted a technical explanation here for those who are interested. |
38)
Message boards :
Number crunching :
No work for Windows?
(Message 52968)
Posted 29 Nov 2015 by Thyme Lawn Post: Server Status: 6,592 WAH2 tasks ready to send. You definitely need to be accepting work from other applications for now because the new work is running short application name "wah2_ri" instead of "wah2". They both have the long application name "WAH2 (Weather At Home 2)" which only appears once in the list of selectable applications. Selecting that only allows work to be fetched for "wah2" (which doesn't have any work available). Confused? Join the club ... |
39)
Message boards :
Number crunching :
News and Announcements 2
(Message 52767)
Posted 30 Oct 2015 by Thyme Lawn Post: The CPDN project will be taken offline for a scheduled downtime starting at 1000 UTC on Monday the 2nd November. This is in order to perform firmware updates to the controllers of the storage array on which the CPDN servers reside. The firmware updates will correct an issue with the controllers that was a cause of the recent unscheduled downtime. This work should not affect the climateprediction.net pages of the website, however the climateapps2.oerc portion of the website will be taken offline. Will in OeRC support will be performing this work on the controllers of the storage array. At present we don't have an expected length of time that this firmware update will take, however we will endeavour to restore the project as soon as the updates have completed. |
40)
Message boards :
Number crunching :
CPDN SITE STILL UNRESPONIVE
(Message 52760)
Posted 29 Oct 2015 by Thyme Lawn Post: WAH2 PNW task uploads for beta are sent to http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler which is working. All of the other beta WAH2 region uploads are sent to http://cpdnbeta.oerc.ox.ac.uk/beta_cgi/file_upload_handler and are failing. BOINC treats uploads on the project level rather that the upload URL level, so pending WAH2 PNW uploads might never be attempted if you have uploads for other regions. You can force the PNW uploads to be done by selecting the first one and clicking the "Retry Now" button, but you'll probably have to repeat this a few times because BOINC starts with the first file which isn't backed off, tries 3 files and goes into a project backoff if they all fail. |
©2024 climateprediction.net