climateprediction.net home page
Task 18289846

Task 18289846

Name hadam3p_anz_l1yj_2013_1_009735682_0
Workunit 9807527
Created 9 Apr 2015, 9:40:30 UTC
Sent 12 Apr 2015, 13:09:49 UTC
Report deadline 24 Mar 2016, 18:29:49 UTC
Received 28 Apr 2015, 19:21:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1212841
Run time 3 days 12 hours 44 min 23 sec
CPU time 3 days 11 hours 17 min 22 sec
Validate state Invalid
Credit 2,993.82
Device peak FLOPS 3.26 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7804, selfPID=7852, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=432, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6792, selfPID=5952, iMonCtr=1
Model crash detected, will try to restart...
07:41:22 (2100): No heartbeat from core client for 30 sec - exiting
07:41:23 (2100): No heartbeat from core client for 30 sec - exiting
07:41:24 (2100): No heartbeat from core client for 30 sec - exiting
07:41:25 (2100): No heartbeat from core client for 30 sec - exiting
07:41:26 (2100): No heartbeat from core client for 30 sec - exiting
07:41:27 (2100): No heartbeat from core client for 30 sec - exiting
07:41:28 (2100): No heartbeat from core client for 30 sec - exiting
07:41:29 (2100): No heartbeat from core client for 30 sec - exiting
07:41:30 (2100): No heartbeat from core client for 30 sec - exiting
07:41:31 (2100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:13:13 (7504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:13:14 (7504): No heartbeat from core client for 30 sec - exiting
19:10:48 (3916): No heartbeat from core client for 30 sec - exiting
19:10:49 (3916): No heartbeat from core client for 30 sec - exiting
19:10:50 (3916): No heartbeat from core client for 30 sec - exiting
19:10:51 (3916): No heartbeat from core client for 30 sec - exiting
19:10:52 (3916): No heartbeat from core client for 30 sec - exiting
19:10:53 (3916): No heartbeat from core client for 30 sec - exiting
19:10:54 (3916): No heartbeat from core client for 30 sec - exiting
19:10:55 (3916): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=8176, iMonCtr=1
CPDN Monitor - No 'heartbeat' from BOINC...
18:53:03 (2416): No heartbeat from core client for 30 sec - exiting
18:53:04 (2416): No heartbeat from core client for 30 sec - exiting
18:53:05 (2416): No heartbeat from core client for 30 sec - exiting
18:53:06 (2416): No heartbeat from core client for 30 sec - exiting
18:53:07 (2416): No heartbeat from core client for 30 sec - exiting
18:53:08 (2416): No heartbeat from core client for 30 sec - exiting
18:53:09 (2416): No heartbeat from core client for 30 sec - exiting
18:53:10 (2416): No heartbeat from core client for 30 sec - exiting
18:53:11 (2416): No heartbeat from core client for 30 sec - exiting
18:53:12 (2416): No heartbeat from core client for 30 sec - exiting
18:53:13 (2416): No heartbeat from core client for 30 sec - exiting
18:53:14 (2416): No heartbeat from core client for 30 sec - exiting
18:53:15 (2416): No heartbeat from core client for 30 sec - exiting
18:53:16 (2416): No heartbeat from core client for 30 sec - exiting
18:53:17 (2416): No heartbeat from core client for 30 sec - exiting
18:53:18 (2416): No heartbeat from core client for 30 sec - exiting
18:53:19 (2416): No heartbeat from core client for 30 sec - exiting
18:53:20 (2416): No heartbeat from core client for 30 sec - exiting
18:53:21 (2416): No heartbeat from core client for 30 sec - exiting
18:53:22 (2416): No heartbeat from core client for 30 sec - exiting
18:53:23 (2416): No heartbeat from core client for 30 sec - exiting
18:53:24 (2416): No heartbeat from core client for 30 sec - exiting
18:53:25 (2416): No heartbeat from core client for 30 sec - exiting
18:53:26 (2416): No heartbeat from core client for 30 sec - exiting
18:53:27 (2416): No heartbeat from core client for 30 sec - exiting
18:53:28 (2416): No heartbeat from core client for 30 sec - exiting
18:53:29 (2416): No heartbeat from core client for 30 sec - exiting
18:53:30 (2416): No heartbeat from core client for 30 sec - exiting
18:53:31 (2416): No heartbeat from core client for 30 sec - exiting
18:53:32 (2416): No heartbeat from core client for 30 sec - exiting
18:53:33 (2416): No heartbeat from core client for 30 sec - exiting
18:53:34 (2416): No heartbeat from core client for 30 sec - exiting
18:53:35 (2416): No heartbeat from core client for 30 sec - exiting
18:53:36 (2416): No heartbeat from core client for 30 sec - exiting
18:53:37 (2416): No heartbeat from core client for 30 sec - exiting
18:53:38 (2416): No heartbeat from core client for 30 sec - exiting
18:53:39 (2416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:33:28 (5580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:33:29 (5580): No heartbeat from core client for 30 sec - exiting
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8404, selfPID=7184, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=8020, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1428, selfPID=780, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_l1yj_2013_1_009735682_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_l1yj_2013_1_009735682_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_l1yj_2013_1_009735682_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_l1yj_2013_1_009735682_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_l1yj_2013_1_009735682_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_l1yj_2013_1_009735682_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Apr 2015 10:08:38 1212841 18289846 hadam3p_anz_l1yj_2013_1_009735682_0 69,419 258,226 3.7198
25 Apr 2015 12:15:42 1212841 18289846 hadam3p_anz_l1yj_2013_1_009735682_0 57,899 215,543 3.7227
22 Apr 2015 19:27:04 1212841 18289846 hadam3p_anz_l1yj_2013_1_009735682_0 46,379 172,995 3.7300
19 Apr 2015 14:18:22 1212841 18289846 hadam3p_anz_l1yj_2013_1_009735682_0 34,859 129,028 3.7014
18 Apr 2015 17:03:07 1212841 18289846 hadam3p_anz_l1yj_2013_1_009735682_0 23,339 88,057 3.7730
17 Apr 2015 18:37:18 1212841 18289846 hadam3p_anz_l1yj_2013_1_009735682_0 11,819 45,478 3.8479


©2024 climateprediction.net