climateprediction.net home page
Task 16797156

Task 16797156

Name hadam3p_eu_p4av_2013_1_008877120_0
Workunit 9023049
Created 9 Jul 2014, 16:51:03 UTC
Sent 11 Jul 2014, 15:40:21 UTC
Report deadline 23 Jun 2015, 21:00:21 UTC
Received 19 Aug 2014, 0:28:34 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1329805
Run time 13 days 2 hours 41 min 37 sec
CPU time 9 days 22 hours 20 min 56 sec
Validate state Workunit error - check skipped
Credit 2,386.39
Device peak FLOPS 1.53 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3132, selfPID=6412, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
14:42:06 (5840): No heartbeat from core client for 30 sec - exiting
14:42:07 (5840): No heartbeat from core client for 30 sec - exiting
14:42:08 (5840): No heartbeat from core client for 30 sec - exiting
14:42:09 (5840): No heartbeat from core client for 30 sec - exiting
14:42:10 (5840): No heartbeat from core client for 30 sec - exiting
14:42:11 (5840): No heartbeat from core client for 30 sec - exiting
14:42:12 (5840): No heartbeat from core client for 30 sec - exiting
14:42:13 (5840): No heartbeat from core client for 30 sec - exiting
14:42:14 (5840): No heartbeat from core client for 30 sec - exiting
14:42:15 (5840): No heartbeat from core client for 30 sec - exiting
14:42:16 (5840): No heartbeat from core client for 30 sec - exiting
14:42:17 (5840): No heartbeat from core client for 30 sec - exiting
14:42:18 (5840): No heartbeat from core client for 30 sec - exiting
14:42:19 (5840): No heartbeat from core client for 30 sec - exiting
14:42:20 (5840): No heartbeat from core client for 30 sec - exiting
14:42:21 (5840): No heartbeat from core client for 30 sec - exiting
14:42:22 (5840): No heartbeat from core client for 30 sec - exiting
14:42:23 (5840): No heartbeat from core client for 30 sec - exiting
14:42:24 (5840): No heartbeat from core client for 30 sec - exiting
14:42:25 (5840): No heartbeat from core client for 30 sec - exiting
14:42:27 (5840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6656, selfPID=5968, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5636, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8084, iMonCtr=2
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3284, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6800, selfPID=7824, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7964, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7972, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7624, selfPID=6120, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7436, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6468, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
11:55:29 (3668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8016, selfPID=8016, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2852, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5944, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7336, selfPID=5392, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Colobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6324, iMonCtr=2
ntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7932, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7940, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6184, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3344, selfPID=5780, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:19:44 (5660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3228, selfPID=7964, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6992, selfPID=6540, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3820, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3172, selfPID=3888, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
10:36:17 (5732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7572, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2412, selfPID=7136, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5208, selfPID=3772, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6324, selfPID=4284, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5712, selfPID=5480, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6620, selfPID=1184, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8012, selfPID=2176, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6852, selfPID=6672, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1244, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3596, selfPID=6120, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7348, selfPID=5756, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7624, selfPID=2320, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7172, selfPID=5428, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4044, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2014 17:49:11 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 138,336 856,720 6.1930
14 Aug 2014 17:49:11 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 126,816 788,607 6.2185
14 Aug 2014 17:49:11 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 115,296 719,953 6.2444
07 Aug 2014 18:04:44 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 103,776 652,835 6.2908
04 Aug 2014 19:59:52 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 92,256 578,111 6.2664
02 Aug 2014 01:02:44 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 80,736 502,930 6.2293
28 Jul 2014 15:17:00 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 69,219 428,074 6.1843
28 Jul 2014 06:57:22 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 69,216 427,123 6.1709
23 Jul 2014 20:58:44 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 57,696 356,298 6.1754
20 Jul 2014 20:09:28 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 46,176 284,426 6.1596
18 Jul 2014 02:50:22 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 34,656 212,550 6.1331
16 Jul 2014 00:13:48 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 23,136 142,347 6.1526
13 Jul 2014 20:07:12 1329805 16797156 hadam3p_eu_p4av_2013_1_008877120_0 11,616 72,441 6.2363


©2024 cpdn.org