Task 12117794

Name	hadam3p_saf_0om9_1992_1_006845881_1
Workunit	7049197
Created	18 Nov 2010, 17:53:39 UTC
Sent	18 Nov 2010, 22:30:05 UTC
Report deadline	1 Nov 2011, 3:50:05 UTC
Received	8 Dec 2010, 0:50:42 UTC
Server state	Over
Outcome	Didn't need
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1116757
Run time	4 days 18 hours 27 min 37 sec
CPU time	4 days 11 hours 5 min 16 sec
Validate state	Initial
Credit	2,244.09
Device peak FLOPS	2.34 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 19:35:52 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5044, selfPID=5044, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3516, iMonCtr=2 11:57:24 (3516): No heartbeat from core client for 30 sec - exiting 11:57:25 (3516): No heartbeat from core client for 30 sec - exiting 11:57:26 (3516): No heartbeat from core client for 30 sec - exiting 11:57:27 (3516): No heartbeat from core client for 30 sec - exiting 11:57:28 (3516): No heartbeat from core client for 30 sec - exiting 11:58:06 (3516): No heartbeat from core client for 30 sec - exiting 11:58:07 (3516): No heartbeat from core client for 30 sec - exiting 11:58:08 (3516): No heartbeat from core client for 30 sec - exiting 11:58:09 (3516): No heartbeat from core client for 30 sec - exiting 11:58:10 (3516): No heartbeat from core client for 30 sec - exiting 11:58:11 (3516): No heartbeat from core client for 30 sec - exiting 11:58:12 (3516): No heartbeat from core client for 30 sec - exiting 11:58:13 (3516): No heartbeat from core client for 30 sec - exiting 11:58:14 (3516): No heartbeat from core client for 30 sec - exiting 11:58:15 (3516): No heartbeat from core client for 30 sec - exiting 11:58:16 (3516): No heartbeat from core client for 30 sec - exiting 11:58:17 (3516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4464, selfPID=2164, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2140, selfPID=3952, iMonCtr=1 Model crash detected, will try to restart... 10:22:58 (2256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5260, selfPID=5260, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4648, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4980, selfPID=4620, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4420, selfPID=4420, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2492, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4800, selfPID=4404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4916, selfPID=4748, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4948, selfPID=4688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4984, selfPID=4976, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4524, selfPID=4524, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5332, selfPID=4672, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3532, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... 18:06:39 (3640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4056, selfPID=4056, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2384, selfPID=2612, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4496, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 18:53:59 (2316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3700, selfPID=2272, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1332, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2712, selfPID=2864, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1532, selfPID=2088, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2744, selfPID=2744, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3544, selfPID=5580, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2464, selfPID=2772, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5956, selfPID=4420, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 01:04:11 (2360): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Dec 2010 23:54:42	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	138,336	384,685	2.7808
05 Dec 2010 13:01:28	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	126,816	346,316	2.7309
03 Dec 2010 19:05:13	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	115,296	310,608	2.6940
01 Dec 2010 21:08:41	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	103,776	276,594	2.6653
30 Nov 2010 22:11:35	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	92,256	247,975	2.6879
30 Nov 2010 16:23:10	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	80,736	220,927	2.7364
29 Nov 2010 14:26:04	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	69,216	193,301	2.7927
28 Nov 2010 13:48:23	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	57,696	162,107	2.8097
26 Nov 2010 16:26:02	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	46,176	130,065	2.8167
24 Nov 2010 18:22:41	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	34,664	97,670	2.8176
24 Nov 2010 18:07:05	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	34,656	97,221	2.8053
22 Nov 2010 20:56:33	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	23,136	64,631	2.7935
21 Nov 2010 11:51:02	1116757	12117794	hadam3p_saf_0om9_1992_1_006845881_1	11,616	32,673	2.8128