climateprediction.net home page
Task 16739121

Task 16739121

Name hadam3p_eu_l753_2013_1_008820480_0
Workunit 8966409
Created 8 Jul 2014, 10:01:21 UTC
Sent 29 Jul 2014, 12:10:53 UTC
Report deadline 11 Jul 2015, 17:30:53 UTC
Received 19 Aug 2014, 18:23:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1258506
Run time 1 days 4 hours 55 min
CPU time 1 days 3 hours 59 min 8 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 4.49 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3948, selfPID=7116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6976, selfPID=6040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6016, selfPID=6088, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5508, selfPID=5908, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=5976, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5328, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7052, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6036, selfPID=6232, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6832, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6216, selfPID=5552, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4504, selfPID=5644, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7196, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5172, selfPID=5976, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6544, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7500, selfPID=6072, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7428, selfPID=5848, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7076, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:06:34 (6232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=8008, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4404, selfPID=4404, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4404, selfPID=1520, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_l753_2013_1_008820480_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l753_2013_1_008820480_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_l753_2013_1_008820480_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Aug 2014 20:03:08 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 103,776 91,756 0.8842
16 Aug 2014 20:03:08 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 92,256 81,716 0.8858
16 Aug 2014 20:03:08 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 80,736 71,224 0.8822
16 Aug 2014 20:03:08 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 69,216 60,440 0.8732
06 Aug 2014 18:16:45 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 57,696 50,334 0.8724
06 Aug 2014 18:16:45 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 46,176 40,214 0.8709
04 Aug 2014 18:47:57 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 34,656 30,163 0.8704
03 Aug 2014 16:35:04 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 23,136 19,799 0.8558
02 Aug 2014 12:05:27 1258506 16739121 hadam3p_eu_l753_2013_1_008820480_0 11,616 9,616 0.8278


©2024 cpdn.org