climateprediction.net home page
Task 16622110

Task 16622110

Name hadam3p_anz_r6wo_2012_1_008738566_0
Workunit 8884544
Created 8 May 2014, 19:12:37 UTC
Sent 15 May 2014, 17:01:31 UTC
Report deadline 27 Apr 2015, 22:21:31 UTC
Received 15 Jun 2014, 14:37:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1316091
Run time 1 days 13 hours 27 min 8 sec
CPU time 1 days 11 hours 40 min 14 sec
Validate state Invalid
Credit 1,006.54
Device peak FLOPS 3.64 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:51:16 (6952): No heartbeat from core client for 30 sec - exiting
19:51:17 (6952): No heartbeat from core client for 30 sec - exiting
19:51:18 (6952): No heartbeat from core client for 30 sec - exiting
19:51:19 (6952): No heartbeat from core client for 30 sec - exiting
19:51:20 (6952): No heartbeat from core client for 30 sec - exiting
19:51:21 (6952): No heartbeat from core client for 30 sec - exiting
19:51:22 (6952): No heartbeat from core client for 30 sec - exiting
19:51:23 (6952): No heartbeat from core client for 30 sec - exiting
19:51:24 (6952): No heartbeat from core client for 30 sec - exiting
19:51:25 (6952): No heartbeat from core client for 30 sec - exiting
19:51:26 (6952): No heartbeat from core client for 30 sec - exiting
19:51:27 (6952): No heartbeat from core client for 30 sec - exiting
19:51:28 (6952): No heartbeat from core client for 30 sec - exiting
19:51:29 (6952): No heartbeat from core client for 30 sec - exiting
19:51:30 (6952): No heartbeat from core client for 30 sec - exiting
19:51:31 (6952): No heartbeat from core client for 30 sec - exiting
19:51:32 (6952): No heartbeat from core client for 30 sec - exiting
19:51:33 (6952): No heartbeat from core client for 30 sec - exiting
19:51:34 (6952): No heartbeat from core client for 30 sec - exiting
19:51:35 (6952): No heartbeat from core client for 30 sec - exiting
19:51:36 (6952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2524, selfPID=2524, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:28:29 (6928): No heartbeat from core client for 30 sec - exiting
21:28:30 (6928): No heartbeat from core client for 30 sec - exiting
21:28:31 (6928): No heartbeat from core client for 30 sec - exiting
21:28:32 (6928): No heartbeat from core client for 30 sec - exiting
21:28:33 (6928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:30:16 (4152): No heartbeat from core client for 30 sec - exiting
09:30:17 (4152): No heartbeat from core client for 30 sec - exiting
09:30:18 (4152): No heartbeat from core client for 30 sec - exiting
09:30:19 (4152): No heartbeat from core client for 30 sec - exiting
09:30:20 (4152): No heartbeat from core client for 30 sec - exiting
09:30:21 (4152): No heartbeat from core client for 30 sec - exiting
09:30:22 (4152): No heartbeat from core client for 30 sec - exiting
09:30:23 (4152): No heartbeat from core client for 30 sec - exiting
09:30:24 (4152): No heartbeat from core client for 30 sec - exiting
09:30:25 (4152): No heartbeat from core client for 30 sec - exiting
09:30:26 (4152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=664, selfPID=664, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7464, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7684, selfPID=5772, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2892, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2664, selfPID=5516, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7268, selfPID=6708, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6wo_2012_1_008738566_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Jun 2014 09:03:13 1316091 16622110 hadam3p_anz_r6wo_2012_1_008738566_0 23,339 87,946 3.7682
25 May 2014 12:42:23 1316091 16622110 hadam3p_anz_r6wo_2012_1_008738566_0 11,819 42,579 3.6026


©2024 cpdn.org