climateprediction.net home page
Task 16617518

Task 16617518

Name hadam3p_anz_r3dh_2012_1_008733987_0
Workunit 8879965
Created 8 May 2014, 18:35:31 UTC
Sent 19 May 2014, 8:05:37 UTC
Report deadline 1 May 2015, 13:25:37 UTC
Received 29 May 2014, 22:02:27 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1268275
Run time 9 days 15 hours 18 min 6 sec
CPU time 8 days 5 hours 43 min 9 sec
Validate state Invalid
Credit 4,981.10
Device peak FLOPS 2.93 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
13:19:15 (10232): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10372, selfPID=3732, iMonCtr=1
Model crash detected, will try to restart...
20:11:34 (8144): No heartbeat from core client for 30 sec - exiting
20:11:35 (8144): No heartbeat from core client for 30 sec - exiting
20:11:36 (8144): No heartbeat from core client for 30 sec - exiting
20:11:37 (8144): No heartbeat from core client for 30 sec - exiting
20:11:38 (8144): No heartbeat from core client for 30 sec - exiting
20:11:39 (8144): No heartbeat from core client for 30 sec - exiting
20:11:40 (8144): No heartbeat from core client for 30 sec - exiting
20:11:41 (8144): No heartbeat from core client for 30 sec - exiting
20:11:42 (8144): No heartbeat from core client for 30 sec - exiting
20:11:43 (8144): No heartbeat from core client for 30 sec - exiting
20:11:44 (8144): No heartbeat from core client for 30 sec - exiting
20:11:45 (8144): No heartbeat from core client for 30 sec - exiting
20:11:46 (8144): No heartbeat from core client for 30 sec - exiting
20:11:47 (8144): No heartbeat from core client for 30 sec - exiting
20:11:48 (8144): No heartbeat from core client for 30 sec - exiting
20:11:49 (8144): No heartbeat from core client for 30 sec - exiting
20:11:50 (8144): No heartbeat from core client for 30 sec - exiting
20:11:51 (8144): No heartbeat from core client for 30 sec - exiting
20:11:52 (8144): No heartbeat from core client for 30 sec - exiting
20:11:53 (8144): No heartbeat from core client for 30 sec - exiting
20:11:54 (8144): No heartbeat from core client for 30 sec - exiting
20:11:55 (8144): No heartbeat from core client for 30 sec - exiting
20:11:56 (8144): No heartbeat from core client for 30 sec - exiting
20:11:57 (8144): No heartbeat from core client for 30 sec - exiting
20:11:58 (8144): No heartbeat from core client for 30 sec - exiting
20:11:59 (8144): No heartbeat from core client for 30 sec - exiting
20:12:00 (8144): No heartbeat from core client for 30 sec - exiting
20:12:01 (8144): No heartbeat from core client for 30 sec - exiting
20:12:02 (8144): No heartbeat from core client for 30 sec - exiting
20:12:03 (8144): No heartbeat from core client for 30 sec - exiting
20:12:04 (8144): No heartbeat from core client for 30 sec - exiting
20:12:05 (8144): No heartbeat from core client for 30 sec - exiting
20:12:06 (8144): No heartbeat from core client for 30 sec - exiting
20:12:07 (8144): No heartbeat from core client for 30 sec - exiting
20:12:08 (8144): No heartbeat from core client for 30 sec - exiting
20:12:09 (8144): No heartbeat from core client for 30 sec - exiting
20:12:10 (8144): No heartbeat from core client for 30 sec - exiting
20:12:11 (8144): No heartbeat from core client for 30 sec - exiting
20:12:12 (8144): No heartbeat from core client for 30 sec - exiting
20:12:13 (8144): No heartbeat from core client for 30 sec - exiting
20:12:14 (8144): No heartbeat from core client for 30 sec - exiting
20:12:15 (8144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7872, selfPID=7872, iMonCtr=2
12:43:44 (1664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:08:13 (11220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:23:19 (11676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:25:06 (12556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:42:49 (9360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:24:52 (9492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:17:06 (7584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1276, selfPID=1276, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
14:29:47 (3908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:35:50 (9820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:12:35 (12888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:27:29 (12364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:52:14 (10852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:53:19 (8356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:29:49 (12996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:41:03 (13280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:03:12 (9424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:38:10 (11752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2300, selfPID=2300, iMonCtr=2
13:50:56 (11792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:00:06 (2236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:24:18 (13428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:58:09 (5860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13416, selfPID=13416, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
16:13:15 (12940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:35:08 (10144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process isSuspended CPDN Monitor - Suspend request from BOINC...
22:00:48 (13916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14192, selfPID=7876, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r3dh_2012_1_008733987_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r3dh_2012_1_008733987_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 May 2014 18:39:00 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 115,499 704,510 6.0997
28 May 2014 15:02:02 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 103,979 632,816 6.0860
27 May 2014 14:25:29 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 92,459 561,265 6.0704
26 May 2014 15:16:47 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 80,939 489,546 6.0483
25 May 2014 16:07:58 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 69,419 418,576 6.0297
24 May 2014 18:18:26 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 57,899 347,767 6.0064
23 May 2014 10:35:51 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 46,379 279,517 6.0268
22 May 2014 09:10:12 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 34,859 210,921 6.0507
21 May 2014 09:06:09 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 23,339 142,037 6.0858
20 May 2014 08:51:54 1268275 16617518 hadam3p_anz_r3dh_2012_1_008733987_0 11,819 71,483 6.0481


©2024 climateprediction.net