Task 12147159

Name	hadam3p_saf_0w6a_1993_1_006874874_0
Workunit	7078190
Created	19 Nov 2010, 13:37:03 UTC
Sent	13 Apr 2011, 11:50:01 UTC
Report deadline	25 Mar 2012, 17:10:01 UTC
Received	24 May 2011, 13:41:54 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1012620
Run time	6 days 4 hours 28 min 23 sec
CPU time	5 days 3 hours 38 min 51 sec
Validate state	Workunit error - check skipped
Credit	2,244.09
Device peak FLOPS	2.17 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1032, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4056, iMonCtr=2 Model crash detected, will try to restart... 18:23:33 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6004, selfPID=6004, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2068, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4552, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5228, selfPID=3500, iMonCtr=1 Model crash detected, will try to restart... 20:09:57 (3508): No heartbeat from core client for 30 sec - exiting 20:09:58 (3508): No heartbeat from core client for 30 sec - exiting 20:09:59 (3508): No heartbeat from core client for 30 sec - exiting 20:10:00 (3508): No heartbeat from core client for 30 sec - exiting 20:10:01 (3508): No heartbeat from core client for 30 sec - exiting 20:10:02 (3508): No heartbeat from core client for 30 sec - exiting 20:10:03 (3508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5348, selfPID=5280, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3912, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3052, selfPID=2312, iMonCtr=1 Model crash detected, will try to restart... 15:36:18 (4436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:48:27 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:23:11 (4296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2 Model crash detected, will try to restart... 19:39:58 (4424): No heartbeat from core client for 30 sec - exiting 19:39:59 (4424): No heartbeat from core client for 30 sec - exiting 19:40:01 (4424): No heartbeat from core client for 30 sec - exiting 19:40:02 (4424): No heartbeat from core client for 30 sec - exiting 19:40:03 (4424): No heartbeat from core client for 30 sec - exiting 19:40:04 (4424): No heartbeat from core client for 30 sec - exiting 19:40:05 (4424): No heartbeat from core client for 30 sec - exiting 19:40:06 (4424): No heartbeat from core client for 30 sec - exiting 19:40:07 (4424): No heartbeat from core client for 30 sec - exiting 19:40:08 (4424): No heartbeat from core client for 30 sec - exiting 19:40:09 (4424): No heartbeat from core client for 30 sec - exiting 19:40:10 (4424): No heartbeat from core client for 30 sec - exiting 19:40:12 (4424): No heartbeat from core client for 30 sec - exiting 19:40:13 (4424): No heartbeat from core client for 30 sec - exiting 19:40:14 (4424): No heartbeat from core client for 30 sec - exiting 19:40:15 (4424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5664, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3484, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4684, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=2 CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=212, selfPID=4228, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5544, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
24 May 2011 13:25:34	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	138,336	444,534	3.2134
21 May 2011 17:13:36	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	126,816	412,782	3.2550
21 May 2011 06:24:23	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	115,296	378,385	3.2819
17 May 2011 13:07:30	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	103,776	341,037	3.2863
09 May 2011 19:41:50	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	92,256	305,356	3.3099
07 May 2011 20:34:53	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	80,736	268,222	3.3222
07 May 2011 10:43:05	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	69,216	231,636	3.3466
07 May 2011 10:43:05	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	57,696	194,203	3.3660
28 Apr 2011 14:21:46	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	46,176	157,031	3.4007
27 Apr 2011 10:35:09	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	34,656	119,593	3.4509
22 Apr 2011 17:27:45	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	23,139	81,744	3.5327
22 Apr 2011 17:27:45	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	23,136	81,219	3.5105
22 Apr 2011 17:27:45	1012620	12147159	hadam3p_saf_0w6a_1993_1_006874874_0	11,616	40,906	3.5215