climateprediction.net home page
Task 18666060

Task 18666060

Name hadam3p_pnw_pn2w_2013_1_009977595_1
Workunit 9983953
Created 5 Jul 2015, 8:39:21 UTC
Sent 5 Jul 2015, 9:03:41 UTC
Report deadline 16 Jun 2016, 14:23:41 UTC
Received 12 Aug 2015, 10:37:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1268401
Run time 1 days 9 hours 39 min 9 sec
CPU time 1 days 1 hours 24 min 29 sec
Validate state Invalid
Credit 1,508.39
Device peak FLOPS 3.58 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.27
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8464, selfPID=6356, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2792, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8452, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4724, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10208, selfPID=6152, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
13:16:01 (6152): called boinc_finish(0)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8396, selfPID=6540, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8728, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8764, selfPID=6672, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8680, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7932, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3264, selfPID=6552, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9040, selfPID=8736, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7188, selfPID=10008, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9120, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
11:34:16 (7180): No heartbeat from client for 30 sec - exiting
11:34:16 (7180): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:19:47 (2460): No heartbeat from client for 30 sec - exiting
11:19:47 (2460): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8364, selfPID=6724, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10220, selfPID=6268, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5876, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6544, selfPID=9132, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
18:40:47 (9132): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_13.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_14.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_15.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_16.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_17.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pn2w_2013_1_009977595_1_18.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Aug 2015 09:29:05 1268401 18666060 hadam3p_pnw_pn2w_2013_1_009977595_1 69,419 86,824 1.2507
12 Aug 2015 05:31:56 1268401 18666060 hadam3p_pnw_pn2w_2013_1_009977595_1 57,899 72,424 1.2509
24 Jul 2015 02:41:36 1268401 18666060 hadam3p_pnw_pn2w_2013_1_009977595_1 46,379 57,604 1.2420
15 Jul 2015 00:39:29 1268401 18666060 hadam3p_pnw_pn2w_2013_1_009977595_1 34,859 42,860 1.2295
12 Jul 2015 05:00:49 1268401 18666060 hadam3p_pnw_pn2w_2013_1_009977595_1 23,339 27,875 1.1944
11 Jul 2015 08:08:18 1268401 18666060 hadam3p_pnw_pn2w_2013_1_009977595_1 11,819 13,696 1.1588


©2024 cpdn.org