climateprediction.net home page
Task 14421611

Task 14421611

Name hadam3p_pnw_b0sz_1962_1_007886774_0
Workunit 8041886
Created 16 Apr 2012, 19:30:50 UTC
Sent 21 May 2012, 5:21:38 UTC
Report deadline 3 May 2013, 10:41:38 UTC
Received 28 May 2012, 8:17:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1204123
Run time 4 days 18 hours 46 min 54 sec
CPU time 3 days 8 hours 35 min 47 sec
Validate state Invalid
Credit 1,253.67
Device peak FLOPS 1.38 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3588, selfPID=1716, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Called boinc_finish
01:58:16 (3480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4372, selfPID=4372, iMonCtr=2
13:05:24 (4992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:05:25 (4992): No heartbeat from core client for 30 sec - exiting
13:05:26 (4992): No heartbeat from core client for 30 sec - exiting
13:05:27 (4992): No heartbeat from core client for 30 sec - exiting
13:05:28 (4992): No heartbeat from core client for 30 sec - exiting
13:05:29 (4992): No heartbeat from core client for 30 sec - exiting
13:05:30 (4992): No heartbeat from core client for 30 sec - exiting
13:05:31 (4992): No heartbeat from core client for 30 sec - exiting
13:05:32 (4992): No heartbeat from core client for 30 sec - exiting
22:32:21 (1584): No heartbeat from core client for 30 sec - exiting
22:32:22 (1584): No heartbeat from core client for 30 sec - exiting
22:32:23 (1584): No heartbeat from core client for 30 sec - exiting
22:32:24 (1584): No heartbeat from core client for 30 sec - exiting
22:32:25 (1584): No heartbeat from core client for 30 sec - exiting
22:32:26 (1584): No heartbeat from core client for 30 sec - exiting
22:32:27 (1584): No heartbeat from core client for 30 sec - exiting
22:32:28 (1584): No heartbeat from core client for 30 sec - exiting
22:32:29 (1584): No heartbeat from core client for 30 sec - exiting
22:32:30 (1584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2124, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Colobal ntroll:::: CPDN process isot rt running, exiting, bRetl =  = 1, checkPID=0, selfPID=3236, iMotr=r=2

Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
GClontroller::bal Workere:: CPDN process is not running, exiting,, checkaID=0, selheckPID=0, selfPID=2244, iMInCtr=2
, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2208, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5376, selfPID=2812, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b0sz_1962_1_007886774_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 May 2012 17:11:44 1204123 14421611 hadam3p_pnw_b0sz_1962_1_007886774_0 57,696 274,164 4.7519
27 May 2012 01:49:53 1204123 14421611 hadam3p_pnw_b0sz_1962_1_007886774_0 46,176 219,779 4.7596
26 May 2012 10:17:47 1204123 14421611 hadam3p_pnw_b0sz_1962_1_007886774_0 34,656 164,945 4.7595
25 May 2012 18:48:30 1204123 14421611 hadam3p_pnw_b0sz_1962_1_007886774_0 23,136 110,163 4.7615
21 May 2012 22:54:59 1204123 14421611 hadam3p_pnw_b0sz_1962_1_007886774_0 11,616 55,599 4.7864


©2024 climateprediction.net