climateprediction.net home page
Task 16735191

Task 16735191

Name hadam3p_eu_l44t_2013_1_008816582_0
Workunit 8962511
Created 8 Jul 2014, 9:18:23 UTC
Sent 30 Jul 2014, 16:34:41 UTC
Report deadline 12 Jul 2015, 21:54:41 UTC
Received 14 Aug 2014, 16:46:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1168062
Run time 1 days 6 hours 29 min 21 sec
CPU time 58 min 11 sec
Validate state Invalid
Credit 399.11
Device peak FLOPS 2.66 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9280, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2232, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1940, selfPID=4080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10200, selfPID=5628, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9820, selfPID=5364, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3676, selfPID=2300, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3816, selfPID=2692, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3588, selfPID=1240, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3756, selfPID=2272, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
18:45:56 (2456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3708, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3772, selfPID=1296, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l44t_2013_1_008816582_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2014 16:47:06 1168062 16735191 hadam3p_eu_l44t_2013_1_008816582_0 23,136 45,955 1.9863
06 Aug 2014 17:14:40 1168062 16735191 hadam3p_eu_l44t_2013_1_008816582_0 11,653 25,163 2.1594
04 Aug 2014 10:19:44 1168062 16735191 hadam3p_eu_l44t_2013_1_008816582_0 11,640 24,755 2.1267
04 Aug 2014 09:48:40 1168062 16735191 hadam3p_eu_l44t_2013_1_008816582_0 11,616 24,383 2.0991


©2024 cpdn.org