Task 16433015

Name	hadam3p_eu_aa8l_2013_1_008604619_0
Workunit	8751131
Created	1 Apr 2014, 15:30:02 UTC
Sent	6 Apr 2014, 11:31:46 UTC
Report deadline	19 Mar 2015, 16:51:46 UTC
Received	3 Jul 2014, 16:31:42 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1284092
Run time	12 days 9 hours 30 min 42 sec
CPU time	9 days 3 hours 49 min 29 sec
Validate state	Workunit error - check skipped
Credit	2,386.39
Device peak FLOPS	1.74 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86
Stderr	<core_client_version>7.2.28</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4428, selfPID=2960, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3820, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3996, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1800, selfPID=2532, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:23:41 (3316): No heartbeat from core client for 30 sec - exiting 11:23:43 (3316): No heartbeat from core client for 30 sec - exiting 11:23:44 (3316): No heartbeat from core client for 30 sec - exiting 11:23:45 (3316): No heartbeat from core client for 30 sec - exiting 11:23:46 (3316): No heartbeat from core client for 30 sec - exiting 11:23:47 (3316): No heartbeat from core client for 30 sec - exiting 11:23:48 (3316): No heartbeat from core client for 30 sec - exiting 11:23:49 (3316): No heartbeat from core client for 30 sec - exiting 11:23:50 (3316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4504, selfPID=3292, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=776, selfPID=3396, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4032, selfPID=3812, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4480, selfPID=3340, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2460, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1932, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3496, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1028, selfPID=3200, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3300, selfPID=2300, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2520, selfPID=3124, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4060, selfPID=4060, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1724, iMonCtr=2 05:51:53 (3832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2344, selfPID=3964, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3636, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2500, iMonCtr=2 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=800, selfPID=3144, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3636, selfPID=656, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1796, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2912, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3612, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2688, selfPID=3304, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3468, selfPID=3064, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2252, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4424, selfPID=2192, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3856, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1284, selfPID=1568, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4856, selfPID=4856, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
02 Jul 2014 16:18:00	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	138,336	790,232	5.7124
11 Jun 2014 00:16:10	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	126,823	723,461	5.7045
10 Jun 2014 09:04:15	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	126,816	722,483	5.6971
10 Jun 2014 05:55:06	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	115,296	656,479	5.6939
05 Jun 2014 12:59:21	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	103,776	592,693	5.7113
04 Jun 2014 08:21:00	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	92,256	529,965	5.7445
31 May 2014 06:56:04	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	80,736	466,247	5.7750
29 May 2014 19:04:07	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	69,216	402,529	5.8155
28 May 2014 08:36:18	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	57,696	339,599	5.8860
24 May 2014 11:01:58	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	46,176	271,514	5.8800
21 May 2014 14:27:57	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	34,656	205,103	5.9183
09 May 2014 09:58:33	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	23,136	137,374	5.9377
24 Apr 2014 08:14:48	1284092	16433015	hadam3p_eu_aa8l_2013_1_008604619_0	11,616	69,178	5.9554