climateprediction.net home page
Task 15012582

Task 15012582

Name hadam3p_pnw_b2o8_1985_1_008095531_1
Workunit 8250645
Created 27 Jul 2012, 8:27:19 UTC
Sent 27 Jul 2012, 8:46:51 UTC
Report deadline 9 Jul 2013, 14:06:51 UTC
Received 29 Jul 2012, 7:33:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1229481
Run time 14 hours 30 min 46 sec
CPU time 12 hours 53 min 15 sec
Validate state Invalid
Credit 252.40
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=440, selfPID=6724, iMonCtr=1
Model crash detected, will try to restart...
14:03:28 (4052): No heartbeat from core client for 30 sec - exiting
14:03:30 (4052): No heartbeat from core client for 30 sec - exiting
14:03:31 (4052): No heartbeat from core client for 30 sec - exiting
14:03:32 (4052): No heartbeat from core client for 30 sec - exiting
14:03:33 (4052): No heartbeat from core client for 30 sec - exiting
14:03:34 (4052): No heartbeat from core client for 30 sec - exiting
14:03:35 (4052): No heartbeat from core client for 30 sec - exiting
14:03:36 (4052): No heartbeat from core client for 30 sec - exiting
14:03:37 (4052): No heartbeat from core client for 30 sec - exiting
14:03:38 (4052): No heartbeat from core client for 30 sec - exiting
14:03:39 (4052): No heartbeat from core client for 30 sec - exiting
14:03:41 (4052): No heartbeat from core client for 30 sec - exiting
14:03:42 (4052): No heartbeat from core client for 30 sec - exiting
14:03:43 (4052): No heartbeat from core client for 30 sec - exiting
14:03:44 (4052): No heartbeat from core client for 30 sec - exiting
14:03:45 (4052): No heartbeat from core client for 30 sec - exiting
14:03:46 (4052): No heartbeat from core client for 30 sec - exiting
14:03:47 (4052): No heartbeat from core client for 30 sec - exiting
14:03:48 (4052): No heartbeat from core client for 30 sec - exiting
14:03:49 (4052): No heartbeat from core client for 30 sec - exiting
14:03:50 (4052): No heartbeat from core client for 30 sec - exiting
14:03:51 (4052): No heartbeat from core client for 30 sec - exiting
14:03:53 (4052): No heartbeat from core client for 30 sec - exiting
14:03:54 (4052): No heartbeat from core client for 30 sec - exiting
14:03:55 (4052): No heartbeat from core client for 30 sec - exiting
14:03:56 (4052): No heartbeat from core client for 30 sec - exiting
14:03:57 (4052): No heartbeat from core client for 30 sec - exiting
14:03:58 (4052): No heartbeat from core client for 30 sec - exiting
14:03:59 (4052): No heartbeat from core client for 30 sec - exiting
14:04:00 (4052): No heartbeat from core client for 30 sec - exiting
14:04:01 (4052): No heartbeat from core client for 30 sec - exiting
14:04:02 (4052): No heartbeat from core client for 30 sec - exiting
14:04:03 (4052): No heartbeat from core client for 30 sec - exiting
14:04:05 (4052): No heartbeat from core client for 30 sec - exiting
14:04:06 (4052): No heartbeat from core client for 30 sec - exiting
14:04:07 (4052): No heartbeat from core client for 30 sec - exiting
14:04:08 (4052): No heartbeat from core client for 30 sec - exiting
14:04:09 (4052): No heartbeat from core client for 30 sec - exiting
14:04:10 (4052): No heartbeat from core client for 30 sec - exiting
14:04:11 (4052): No heartbeat from core client for 30 sec - exiting
14:04:12 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5984, selfPID=5984, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3516, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_b2o8_1985_1_008095531/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_b2o8_1985_1_008095531/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                        
M                              f file in READ from history file for n                                                                                                    tmp/xaakg.pipe_dummy                                                                2048    
                                       tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_2.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b2o8_1985_1_008095531_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jul 2012 00:32:31 1229481 15012582 hadam3p_pnw_b2o8_1985_1_008095531_1 11,616 32,578 2.8046


©2024 climateprediction.net