climateprediction.net home page
Task 16622109

Task 16622109

Name hadam3p_anz_r6wn_2012_1_008738565_0
Workunit 8884543
Created 8 May 2014, 19:12:32 UTC
Sent 15 May 2014, 17:01:31 UTC
Report deadline 27 Apr 2015, 22:21:31 UTC
Received 15 Jun 2014, 10:40:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1316091
Run time 2 days 0 hours 38 min 24 sec
CPU time 1 days 22 hours 29 min 29 sec
Validate state Invalid
Credit 1,503.36
Device peak FLOPS 3.64 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3392, selfPID=6524, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:51:16 (6984): No heartbeat from core client for 30 sec - exiting
19:51:17 (6984): No heartbeat from core client for 30 sec - exiting
19:51:18 (6984): No heartbeat from core client for 30 sec - exiting
19:51:19 (6984): No heartbeat from core client for 30 sec - exiting
19:51:20 (6984): No heartbeat from core client for 30 sec - exiting
19:51:21 (6984): No heartbeat from core client for 30 sec - exiting
19:51:22 (6984): No heartbeat from core client for 30 sec - exiting
19:51:23 (6984): No heartbeat from core client for 30 sec - exiting
19:51:24 (6984): No heartbeat from core client for 30 sec - exiting
19:51:25 (6984): No heartbeat from core client for 30 sec - exiting
19:51:26 (6984): No heartbeat from core client for 30 sec - exiting
19:51:27 (6984): No heartbeat from core client for 30 sec - exiting
19:51:28 (6984): No heartbeat from core client for 30 sec - exiting
19:51:29 (6984): No heartbeat from core client for 30 sec - exiting
19:51:30 (6984): No heartbeat from core client for 30 sec - exiting
19:51:31 (6984): No heartbeat from core client for 30 sec - exiting
19:51:32 (6984): No heartbeat from core client for 30 sec - exiting
19:51:33 (6984): No heartbeat from core client for 30 sec - exiting
19:51:34 (6984): No heartbeat from core client for 30 sec - exiting
19:51:35 (6984): No heartbeat from core client for 30 sec - exiting
19:51:36 (6984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:28:29 (6920): No heartbeat from core client for 30 sec - exiting
21:28:30 (6920): No heartbeat from core client for 30 sec - exiting
21:28:31 (6920): No heartbeat from core client for 30 sec - exiting
21:28:32 (6920): No heartbeat from core client for 30 sec - exiting
21:28:33 (6920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:30:16 (4116): No heartbeat from core client for 30 sec - exiting
09:30:17 (4116): No heartbeat from core client for 30 sec - exiting
09:30:18 (4116): No heartbeat from core client for 30 sec - exiting
09:30:19 (4116): No heartbeat from core client for 30 sec - exiting
09:30:20 (4116): No heartbeat from core client for 30 sec - exiting
09:30:21 (4116): No heartbeat from core client for 30 sec - exiting
09:30:22 (4116): No heartbeat from core client for 30 sec - exiting
09:30:23 (4116): No heartbeat from core client for 30 sec - exiting
09:30:24 (4116): No heartbeat from core client for 30 sec - exiting
09:30:25 (4116): No heartbeat from core client for 30 sec - exiting
09:30:26 (4116): No heartbeat from core client for 30 sec - exiting
09:30:27 (4116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7452, selfPID=7452, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6116, selfPID=6116, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6848, selfPID=6848, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3644, selfPID=2152, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1468, selfPID=5532, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4360, selfPID=6380, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wn_2012_1_008738565_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Jun 2014 09:03:51 1316091 16622109 hadam3p_anz_r6wn_2012_1_008738565_0 34,859 131,874 3.7831
01 Jun 2014 12:38:48 1316091 16622109 hadam3p_anz_r6wn_2012_1_008738565_0 23,339 88,419 3.7885
25 May 2014 10:41:54 1316091 16622109 hadam3p_anz_r6wn_2012_1_008738565_0 11,819 42,289 3.5781


©2024 climateprediction.net