Task 13687465

Name	hadam3p_saf_7c0r_2002_1_007572516_0
Workunit	7750646
Created	2 Dec 2011, 15:36:35 UTC
Sent	17 Dec 2011, 18:26:13 UTC
Report deadline	28 Nov 2012, 23:46:13 UTC
Received	27 Jan 2012, 20:56:13 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	870485
Run time	3 days 23 hours 41 min 40 sec
CPU time	3 days 23 hours 41 min 40 sec
Validate state	Workunit error - check skipped
Credit	2,244.09
Device peak FLOPS	2.30 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86
Stderr	<core_client_version>5.10.45</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6108, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=1056, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5952, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5812, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5324, selfPID=5392, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1360, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4908, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=2 13:19:11 (944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CGntroller:: CPDN process is not running, exiting, bRetVal ng,= 1, checkPID=0, selfPID==0, selfPID=4100 ,odiMonCtr=2 detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5748, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5076, selfPID=4864, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5908, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5960, selfPID=3476, iMonCtr=1 Model crash detected, will try to restart... 15:34:24 (4276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5748, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5664, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4572, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5072, selfPID=6084, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... GGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Jan 2012 22:21:26	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	138,336	343,956	2.4864
08 Jan 2012 21:19:59	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	126,816	315,135	2.4850
07 Jan 2012 17:49:18	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	115,296	286,760	2.4872
04 Jan 2012 20:25:42	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	103,776	258,410	2.4901
01 Jan 2012 22:32:20	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	92,256	230,219	2.4954
31 Dec 2011 12:02:11	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	80,748	202,156	2.5035
30 Dec 2011 15:33:02	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	80,736	201,728	2.4986
29 Dec 2011 19:13:54	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	69,216	173,385	2.5050
27 Dec 2011 19:45:50	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	57,696	144,655	2.5072
26 Dec 2011 16:11:32	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	46,176	115,764	2.5070
22 Dec 2011 22:02:56	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	34,656	86,886	2.5071
21 Dec 2011 11:38:37	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	23,136	58,072	2.5100
19 Dec 2011 15:04:12	870485	13687465	hadam3p_saf_7c0r_2002_1_007572516_0	11,616	29,388	2.5300