climateprediction.net home page
Task 14814187

Task 14814187

Name hadam3p_pnw_zz48_1967_1_007028832_1
Workunit 7232148
Created 21 Jun 2012, 22:51:41 UTC
Sent 21 Jun 2012, 22:51:58 UTC
Report deadline 4 Jun 2013, 4:11:58 UTC
Received 29 Jun 2012, 6:23:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1222868
Run time 2 days 15 hours 56 min 35 sec
CPU time 2 days 14 hours 26 min 40 sec
Validate state Invalid
Credit 1,754.30
Device peak FLOPS 2.65 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=6088, iMonCtr=1
Model crash detected, will try to restart...
10:58:33 (4400): No heartbeat from core client for 30 sec - exiting
10:58:34 (4400): No heartbeat from core client for 30 sec - exiting
10:58:35 (4400): No heartbeat from core client for 30 sec - exiting
10:58:36 (4400): No heartbeat from core client for 30 sec - exiting
10:58:37 (4400): No heartbeat from core client for 30 sec - exiting
10:58:38 (4400): No heartbeat from core client for 30 sec - exiting
10:58:39 (4400): No heartbeat from core client for 30 sec - exiting
10:58:40 (4400): No heartbeat from core client for 30 sec - exiting
10:58:41 (4400): No heartbeat from core client for 30 sec - exiting
10:58:42 (4400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2952, selfPID=2952, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
G14:36:05 (4104): No heartbeat from core client for 30 sec - exiting
14:36:06 (4104): No heartbeat from core client for 30 sec - exiting
14:36:07 (4104): No heartbeat from core client for 30 sec - exiting
14:36:08 (4104): No heartbeat from core client for 30 sec - exiting
14:36:09 (4104): No heartbeat from core client for 30 sec - exiting
14:36:10 (4104): No heartbeat from core client for 30 sec - exiting
14:36:11 (4104): No heartbeat from core client for 30 sec - exiting
14:36:12 (4104): No heartbeat from core client for 30 sec - exiting
14:36:13 (4104): No heartbeat from core client for 30 sec - exiting
14:36:14 (4104): No heartbeat from core client for 30 sec - exiting
14:36:15 (4104): No heartbeat from core client for 30 sec - exiting
14:36:16 (4104): No heartbeat from core client for 30 sec - exiting
14:36:17 (4104): No heartbeat from core client for 30 sec - exiting
14:36:18 (4104): No heartbeat from core client for 30 sec - exiting
14:36:20 (4104): No heartbeat from core client for 30 sec - exiting
14:36:21 (4104): No heartbeat from core client for 30 sec - exiting
14:36:22 (4104): No heartbeat from core client for 30 sec - exiting
14:36:23 (4104): No heartbeat from core client for 30 sec - exiting
14:36:24 (4104): No heartbeat from core client for 30 sec - exiting
14:36:25 (4104): No heartbeat from core client for 30 sec - exiting
14:36:26 (4104): No heartbeat from core client for 30 sec - exiting
14:36:27 (4104): No heartbeat from core client for 30 sec - exiting
14:36:28 (4104): No heartbeat from core client for 30 sec - exiting
14:36:29 (4104): No heartbeat from core client for 30 sec - exiting
14:36:30 (4104): No heartbeat from core client for 30 sec - exiting
14:36:31 (4104): No heartbeat from core client for 30 sec - exiting
14:36:32 (4104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:21:07 (3516): No heartbeat from core client for 30 sec - exiting
01:21:09 (3516): No heartbeat from core client for 30 sec - exiting
01:21:10 (3516): No heartbeat from core client for 30 sec - exiting
01:21:11 (3516): No heartbeat from core client for 30 sec - exiting
01:21:12 (3516): No heartbeat from core client for 30 sec - exiting
01:21:13 (3516): No heartbeat from core client for 30 sec - exiting
01:21:14 (3516): No heartbeat from core client for 30 sec - exiting
01:21:15 (3516): No heartbeat from core client for 30 sec - exiting
01:21:16 (3516): No heartbeat from core client for 30 sec - exiting
01:21:17 (3516): No heartbeat from core client for 30 sec - exiting
01:21:18 (3516): No heartbeat from core client for 30 sec - exiting
01:21:19 (3516): No heartbeat from core client for 30 sec - exiting
01:21:20 (3516): No heartbeat from core client for 30 sec - exiting
01:21:21 (3516): No heartbeat from core client for 30 sec - exiting
01:21:22 (3516): No heartbeat from core client for 30 sec - exiting
01:21:23 (3516): No heartbeat from core client for 30 sec - exiting
01:21:24 (3516): No heartbeat from core client for 30 sec - exiting
01:21:25 (3516): No heartbeat from core client for 30 sec - exiting
01:21:26 (3516): No heartbeat from core client for 30 sec - exiting
01:21:27 (3516): No heartbeat from core client for 30 sec - exiting
01:21:28 (3516): No heartbeat from core client for 30 sec - exiting
01:21:29 (3516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:35:55 (3632): No heartbeat from core client for 30 sec - exiting
17:35:56 (3632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4392, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4928, selfPID=3372, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_zz48_1967_1_007028832_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zz48_1967_1_007028832_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zz48_1967_1_007028832_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zz48_1967_1_007028832_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zz48_1967_1_007028832_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jun 2012 12:11:11 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 80,736 212,508 2.6321
27 Jun 2012 03:37:54 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 69,216 183,232 2.6472
26 Jun 2012 18:37:14 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 57,698 152,568 2.6443
26 Jun 2012 17:35:44 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 57,696 152,261 2.6390
26 Jun 2012 09:05:58 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 46,176 122,071 2.6436
25 Jun 2012 06:22:18 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 34,656 91,087 2.6283
23 Jun 2012 06:40:57 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 23,136 60,866 2.6308
22 Jun 2012 21:50:14 1222868 14814187 hadam3p_pnw_zz48_1967_1_007028832_1 11,616 30,358 2.6135


©2024 climateprediction.net