climateprediction.net home page
Task 15336185

Task 15336185

Name hadam3p_pnw_301d_1981_1_008213267_2
Workunit 8368391
Created 5 Oct 2012, 15:07:55 UTC
Sent 5 Oct 2012, 18:07:11 UTC
Report deadline 17 Sep 2013, 23:27:11 UTC
Received 17 Oct 2012, 12:48:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1229783
Run time 3 days 9 hours 34 min 29 sec
CPU time 12 hours 46 min 13 sec
Validate state Invalid
Credit 1,503.98
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5228, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6932, selfPID=788, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6012, selfPID=1120, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2408, selfPID=1564, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Suspended CPDN Monitor - Suspend request from BOINC...
21:27:32 (3612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4896, selfPID=1712, iMonCtr=1
Model crash detected, will try to restart...
19:36:40 (3648): No heartbeat from core client for 30 sec - exiting
19:36:42 (3648): No heartbeat from core client for 30 sec - exiting
19:36:43 (3648): No heartbeat from core client for 30 sec - exiting
19:36:44 (3648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2296, selfPID=3644, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4700, selfPID=2068, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5496, selfPID=2304, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=3552, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6096, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_301d_1981_1_008213267_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_301d_1981_1_008213267_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_301d_1981_1_008213267_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_301d_1981_1_008213267_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_301d_1981_1_008213267_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_301d_1981_1_008213267_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Oct 2012 14:52:13 1229783 15336185 hadam3p_pnw_301d_1981_1_008213267_2 69,216 153,326 2.2152
12 Oct 2012 18:42:49 1229783 15336185 hadam3p_pnw_301d_1981_1_008213267_2 57,696 128,148 2.2211
11 Oct 2012 13:31:35 1229783 15336185 hadam3p_pnw_301d_1981_1_008213267_2 46,176 102,397 2.2175
10 Oct 2012 14:07:12 1229783 15336185 hadam3p_pnw_301d_1981_1_008213267_2 34,656 76,670 2.2123
09 Oct 2012 18:05:25 1229783 15336185 hadam3p_pnw_301d_1981_1_008213267_2 23,136 51,351 2.2195
06 Oct 2012 16:39:53 1229783 15336185 hadam3p_pnw_301d_1981_1_008213267_2 11,616 26,114 2.2481


©2024 climateprediction.net