climateprediction.net home page
Task 16696987

Task 16696987

Name hadam3p_anz_rapx_2012_1_008743507_1
Workunit 8889485
Created 29 Jun 2014, 15:18:54 UTC
Sent 29 Jun 2014, 15:18:58 UTC
Report deadline 11 Jun 2015, 20:38:58 UTC
Received 27 Jul 2014, 18:39:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1332532
Run time 7 days 23 hours 28 min 11 sec
CPU time 7 days 12 hours 47 min 50 sec
Validate state Invalid
Credit 4,484.28
Device peak FLOPS 2.58 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
17:25:02 (6268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:55:17 (6668): No heartbeat from core client for 30 sec - exiting
07:55:18 (6668): No heartbeat from core client for 30 sec - exiting
07:55:19 (6668): No heartbeat from core client for 30 sec - exiting
07:55:20 (6668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:25:36 (7620): No heartbeat from core client for 30 sec - exiting
09:25:37 (7620): No heartbeat from core client for 30 sec - exiting
09:25:38 (7620): No heartbeat from core client for 30 sec - exiting
09:25:39 (7620): No heartbeat from core client for 30 sec - exiting
09:25:40 (7620): No heartbeat from core client for 30 sec - exiting
09:25:41 (7620): No heartbeat from core client for 30 sec - exiting
09:25:42 (7620): No heartbeat from core client for 30 sec - exiting
09:25:43 (7620): No heartbeat from core client for 30 sec - exiting
09:25:44 (7620): No heartbeat from core client for 30 sec - exiting
09:25:45 (7620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3080, selfPID=4748, iMonCtr=1
Model crash detected, will try to restart...
21:09:56 (5196): No heartbeat from core client for 30 sec - exiting
21:09:57 (5196): No heartbeat from core client for 30 sec - exiting
21:09:58 (5196): No heartbeat from core client for 30 sec - exiting
21:09:59 (5196): No heartbeat from core client for 30 sec - exiting
21:10:00 (5196): No heartbeat from core client for 30 sec - exiting
21:10:01 (5196): No heartbeat from core client for 30 sec - exiting
21:10:02 (5196): No heartbeat from core client for 30 sec - exiting
21:10:03 (5196): No heartbeat from core client for 30 sec - exiting
21:10:04 (5196): No heartbeat from core client for 30 sec - exiting
21:10:05 (5196): No heartbeat from core client for 30 sec - exiting
21:10:06 (5196): No heartbeat from core client for 30 sec - exiting
21:10:07 (5196): No heartbeat from core client for 30 sec - exiting
21:10:08 (5196): No heartbeat from core client for 30 sec - exiting
21:10:39 (5196): No heartbeat from core client for 30 sec - exiting
21:10:40 (5196): No heartbeat from core client for 30 sec - exiting
21:10:41 (5196): No heartbeat from core client for 30 sec - exiting
21:10:42 (5196): No heartbeat from core client for 30 sec - exiting
21:10:43 (5196): No heartbeat from core client for 30 sec - exiting
21:10:44 (5196): No heartbeat from core client for 30 sec - exiting
21:10:45 (5196): No heartbeat from core client for 30 sec - exiting
21:10:46 (5196): No heartbeat from core client for 30 sec - exiting
21:10:47 (5196): No heartbeat from core client for 30 sec - exiting
21:10:48 (5196): No heartbeat from core client for 30 sec - exiting
21:10:49 (5196): No heartbeat from core client for 30 sec - exiting
21:10:50 (5196): No heartbeat from core client for 30 sec - exiting
21:10:51 (5196): No heartbeat from core client for 30 sec - exiting
21:10:52 (5196): No heartbeat from core client for 30 sec - exiting
21:10:53 (5196): No heartbeat from core client for 30 sec - exiting
21:10:54 (5196): No heartbeat from core client for 30 sec - exiting
21:10:55 (5196): No heartbeat from core client for 30 sec - exiting
21:10:56 (5196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2564, selfPID=2564, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2524, selfPID=1600, iMonCtr=1
Model crash detected, will try to restart...
07:51:09 (2064): No heartbeat from core client for 30 sec - exiting
07:51:10 (2064): No heartbeat from core client for 30 sec - exiting
07:51:11 (2064): No heartbeat from core client for 30 sec - exiting
07:51:12 (2064): No heartbeat from core client for 30 sec - exiting
07:51:13 (2064): No heartbeat from core client for 30 sec - exiting
07:51:14 (2064): No heartbeat from core client for 30 sec - exiting
07:51:15 (2064): No heartbeat from core client for 30 sec - exiting
07:51:16 (2064): No heartbeat from core client for 30 sec - exiting
07:51:17 (2064): No heartbeat from core client for 30 sec - exiting
07:51:18 (2064): No heartbeat from core client for 30 sec - exiting
07:51:19 (2064): No heartbeat from core client for 30 sec - exiting
07:51:20 (2064): No heartbeat from core client for 30 sec - exiting
07:51:21 (2064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:51:22 (2064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14844, selfPID=6540, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4032, selfPID=4984, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5400, selfPID=5400, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5892, iMonCtr=2
Model crash detected, will try to restart...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_rapx_2012_1_008743507_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rapx_2012_1_008743507_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rapx_2012_1_008743507_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2014 10:34:34 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 103,979 596,258 5.7344
21 Jul 2014 01:55:58 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 92,459 528,622 5.7174
20 Jul 2014 06:38:41 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 80,939 461,491 5.7017
18 Jul 2014 10:08:37 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 69,419 394,216 5.6788
17 Jul 2014 00:13:21 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 57,899 327,598 5.6581
15 Jul 2014 05:26:30 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 46,379 261,213 5.6321
13 Jul 2014 16:14:50 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 34,859 193,274 5.5445
12 Jul 2014 20:31:21 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 23,339 128,056 5.4868
06 Jul 2014 14:26:51 1332532 16696987 hadam3p_anz_rapx_2012_1_008743507_1 11,819 64,770 5.4802


©2024 cpdn.org