climateprediction.net home page
Task 19241612

Task 19241612

Name hadam3p_anz_k2b1_201212_12_306_010265838_1
Workunit 10265838
Created 29 Jan 2016, 22:43:58 UTC
Sent 30 Jan 2016, 11:27:48 UTC
Report deadline 11 Jan 2017, 16:47:48 UTC
Received 26 Feb 2016, 11:41:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1353778
Run time 5 days 19 hours 26 min 58 sec
CPU time 4 days 23 hours 48 min 24 sec
Validate state Invalid
Credit 4,981.10
Device peak FLOPS 3.52 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3696, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4616, selfPID=2676, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2172, selfPID=3508, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4996, selfPID=4012, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=608, selfPID=3024, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=1172, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:28:32 (3252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2660, selfPID=1524, iMonCtr=1
Model crash detected, will try to restart...
15:09:24 (3624): No heartbeat from core client for 30 sec - exiting
15:09:25 (3624): No heartbeat from core client for 30 sec - exiting
15:09:26 (3624): No heartbeat from core client for 30 sec - exiting
15:09:27 (3624): No heartbeat from core client for 30 sec - exiting
15:09:28 (3624): No heartbeat from core client for 30 sec - exiting
15:09:30 (3624): No heartbeat from core client for 30 sec - exiting
15:09:31 (3624): No heartbeat from core client for 30 sec - exiting
15:09:32 (3624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3624, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4148, selfPID=3484, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4040, selfPID=2556, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4432, selfPID=2148, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=1048, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1588, selfPID=3540, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1740, selfPID=3172, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2204, selfPID=1024, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2080, selfPID=3152, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_anz_k2b1_201212_12_306_010265838/dataout/atmos_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3972, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_k2b1_201212_12_306_010265838_1_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k2b1_201212_12_306_010265838_1_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Feb 2016 17:13:46 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 115,499 395,611 3.4252
23 Feb 2016 10:50:18 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 103,979 359,004 3.4527
21 Feb 2016 20:05:11 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 92,459 321,712 3.4795
21 Feb 2016 08:15:20 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 80,939 285,618 3.5288
19 Feb 2016 15:47:02 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 69,419 248,737 3.5831
16 Feb 2016 19:34:20 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 57,899 212,161 3.6643
14 Feb 2016 17:15:30 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 46,379 174,133 3.7546
13 Feb 2016 14:39:56 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 34,859 132,749 3.8082
12 Feb 2016 10:45:42 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 23,339 88,550 3.7941
11 Feb 2016 11:06:48 1353778 19241612 hadam3p_anz_k2b1_201212_12_306_010265838_1 11,819 42,708 3.6135


©2024 cpdn.org