climateprediction.net home page
Task 18807581

Task 18807581

Name hadam3p_anz_a1i5_2008_1_010097278_0
Workunit 10076868
Created 12 Aug 2015, 16:52:16 UTC
Sent 14 Aug 2015, 13:28:22 UTC
Report deadline 26 Jul 2016, 18:48:22 UTC
Received 25 Aug 2015, 1:51:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1322479
Run time 2 days 3 hours 1 min 37 sec
CPU time 1 days 23 hours 1 min 2 sec
Validate state Invalid
Credit 3,490.64
Device peak FLOPS 3.35 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
16:46:08 (7968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:21:56 (6796): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:42:16 (6604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:07:06 (7744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:32:15 (2732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:58:21 (6420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:58:22 (6420): No heartbeat from core client for 30 sec - exiting
22:17:18 (4932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7820, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7848, selfPID=5528, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
20:18:01 (7556): No heartbeat from core client for 30 sec - exiting
20:18:02 (7556): No heartbeat from core client for 30 sec - exiting
20:18:03 (7556): No heartbeat from core client for 30 sec - exiting
20:18:04 (7556): No heartbeat from core client for 30 sec - exiting
20:18:05 (7556): No heartbeat from core client for 30 sec - exiting
20:18:06 (7556): No heartbeat from core client for 30 sec - exiting
20:18:07 (7556): No heartbeat from core client for 30 sec - exiting
20:18:08 (7556): No heartbeat from core client for 30 sec - exiting
20:18:09 (7556): No heartbeat from core client for 30 sec - exiting
20:18:10 (7556): No heartbeat from core client for 30 sec - exiting
20:18:11 (7556): No heartbeat from core client for 30 sec - exiting
20:18:12 (7556): No heartbeat from core client for 30 sec - exiting
20:18:13 (7556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1024, selfPID=1024, iMonCtr=2
22:24:45 (740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:40:19 (7084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:29:00 (8124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7160, selfPID=7160, iMonCtr=2
16:10:32 (6400): No heartbeat from core client for 30 sec - exiting
16:10:33 (6400): No heartbeat from core client for 30 sec - exiting
16:10:34 (6400): No heartbeat from core client for 30 sec - exiting
16:10:35 (6400): No heartbeat from core client for 30 sec - exiting
16:10:36 (6400): No heartbeat from core client for 30 sec - exiting
16:10:37 (6400): No heartbeat from core client for 30 sec - exiting
16:10:38 (6400): No heartbeat from core client for 30 sec - exiting
16:10:39 (6400): No heartbeat from core client for 30 sec - exiting
16:10:40 (6400): No heartbeat from core client for 30 sec - exiting
16:10:42 (6400): No heartbeat from core client for 30 sec - exiting
16:10:43 (6400): No heartbeat from core client for 30 sec - exiting
16:10:44 (6400): No heartbeat from core client for 30 sec - exiting
16:10:45 (6400): No heartbeat from core client for 30 sec - exiting
16:10:46 (6400): No heartbeat from core client for 30 sec - exiting
16:10:47 (6400): No heartbeat from core client for 30 sec - exiting
16:10:48 (6400): No heartbeat from core client for 30 sec - exiting
16:10:49 (6400): No heartbeat from core client for 30 sec - exiting
16:10:50 (6400): No heartbeat from core client for 30 sec - exiting
16:10:51 (6400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8028, selfPID=8028, iMonCtr=2
16:16:25 (8088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5992, selfPID=7744, iMonCtr=1
Model crash detected, will try to restart...
10:28:54 (4844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7748, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1584, iMonCtr=2
Model crash detected, will try to restart...
17:51:25 (6796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7472, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5480, selfPID=8008, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
11:10:55 (7804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:58:39 (2528): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:49:32 (2408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5728, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6396, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
20:18:08 (5096): No heartbeat from core client for 30 sec - exiting
20:18:09 (5096): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_a1i5_2008_1_010097278_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_a1i5_2008_1_010097278_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_a1i5_2008_1_010097278_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_a1i5_2008_1_010097278_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_a1i5_2008_1_010097278_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Aug 2015 10:22:11 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 80,939 156,921 1.9388
22 Aug 2015 07:13:41 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 69,419 134,884 1.9430
20 Aug 2015 01:31:27 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 57,899 112,512 1.9432
18 Aug 2015 05:48:42 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 46,379 90,561 1.9526
17 Aug 2015 13:36:23 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 34,859 68,560 1.9668
16 Aug 2015 13:53:06 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 23,339 46,530 1.9937
15 Aug 2015 16:28:04 1322479 18807581 hadam3p_anz_a1i5_2008_1_010097278_0 11,819 24,080 2.0374


©2024 climateprediction.net