Task 16797156

Name	hadam3p_eu_p4av_2013_1_008877120_0
Workunit	9023049
Created	9 Jul 2014, 16:51:03 UTC
Sent	11 Jul 2014, 15:40:21 UTC
Report deadline	23 Jun 2015, 21:00:21 UTC
Received	19 Aug 2014, 0:28:34 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1329805
Run time	13 days 2 hours 41 min 37 sec
CPU time	9 days 22 hours 20 min 56 sec
Validate state	Workunit error - check skipped
Credit	2,386.39
Device peak FLOPS	1.53 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3132, selfPID=6412, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 14:42:06 (5840): No heartbeat from core client for 30 sec - exiting 14:42:07 (5840): No heartbeat from core client for 30 sec - exiting 14:42:08 (5840): No heartbeat from core client for 30 sec - exiting 14:42:09 (5840): No heartbeat from core client for 30 sec - exiting 14:42:10 (5840): No heartbeat from core client for 30 sec - exiting 14:42:11 (5840): No heartbeat from core client for 30 sec - exiting 14:42:12 (5840): No heartbeat from core client for 30 sec - exiting 14:42:13 (5840): No heartbeat from core client for 30 sec - exiting 14:42:14 (5840): No heartbeat from core client for 30 sec - exiting 14:42:15 (5840): No heartbeat from core client for 30 sec - exiting 14:42:16 (5840): No heartbeat from core client for 30 sec - exiting 14:42:17 (5840): No heartbeat from core client for 30 sec - exiting 14:42:18 (5840): No heartbeat from core client for 30 sec - exiting 14:42:19 (5840): No heartbeat from core client for 30 sec - exiting 14:42:20 (5840): No heartbeat from core client for 30 sec - exiting 14:42:21 (5840): No heartbeat from core client for 30 sec - exiting 14:42:22 (5840): No heartbeat from core client for 30 sec - exiting 14:42:23 (5840): No heartbeat from core client for 30 sec - exiting 14:42:24 (5840): No heartbeat from core client for 30 sec - exiting 14:42:25 (5840): No heartbeat from core client for 30 sec - exiting 14:42:27 (5840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6656, selfPID=5968, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5636, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8084, iMonCtr=2 Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3284, selfPID=5860, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6800, selfPID=7824, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7964, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7972, selfPID=4412, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7624, selfPID=6120, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7436, selfPID=4108, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6468, selfPID=3032, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:55:29 (3668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8016, selfPID=8016, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2852, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5944, selfPID=5400, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7336, selfPID=5392, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Colobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6324, iMonCtr=2 ntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7932, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7940, selfPID=5224, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6184, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3344, selfPID=5780, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:19:44 (5660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3228, selfPID=7964, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6992, selfPID=6540, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3820, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3172, selfPID=3888, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 10:36:17 (5732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7572, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2412, selfPID=7136, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5208, selfPID=3772, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6324, selfPID=4284, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5712, selfPID=5480, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6620, selfPID=1184, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8012, selfPID=2176, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6852, selfPID=6672, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1244, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3596, selfPID=6120, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7348, selfPID=5756, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7624, selfPID=2320, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7172, selfPID=5428, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4044, selfPID=5036, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Aug 2014 17:49:11	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	138,336	856,720	6.1930
14 Aug 2014 17:49:11	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	126,816	788,607	6.2185
14 Aug 2014 17:49:11	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	115,296	719,953	6.2444
07 Aug 2014 18:04:44	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	103,776	652,835	6.2908
04 Aug 2014 19:59:52	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	92,256	578,111	6.2664
02 Aug 2014 01:02:44	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	80,736	502,930	6.2293
28 Jul 2014 15:17:00	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	69,219	428,074	6.1843
28 Jul 2014 06:57:22	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	69,216	427,123	6.1709
23 Jul 2014 20:58:44	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	57,696	356,298	6.1754
20 Jul 2014 20:09:28	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	46,176	284,426	6.1596
18 Jul 2014 02:50:22	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	34,656	212,550	6.1331
16 Jul 2014 00:13:48	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	23,136	142,347	6.1526
13 Jul 2014 20:07:12	1329805	16797156	hadam3p_eu_p4av_2013_1_008877120_0	11,616	72,441	6.2363