Task 13030504

Name	hadam3p_eu_2tvr_1960_1_007305769_1
Workunit	7503193
Created	30 Jun 2011, 15:26:08 UTC
Sent	30 Jun 2011, 15:26:17 UTC
Report deadline	11 Jun 2012, 20:46:17 UTC
Received	28 Feb 2012, 12:44:17 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	953840
Run time	7 days 21 hours 3 min 15 sec
CPU time	7 days 21 hours 3 min 15 sec
Validate state	Workunit error - check skipped
Credit	2,386.50
Device peak FLOPS	1.98 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86
Stderr	<core_client_version>6.4.5</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2268, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6132, selfPID=820, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5252, selfPID=5252, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2040, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=4992, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... ContrController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4496, selfPID=5880, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5792, selfPID=4884, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5488, selfPID=4356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2316, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5352, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... 18:40:14 (1120): No heartbeat from core client for 30 sec - exiting 18:40:15 (1120): No heartbeat from core client for 30 sec - exiting 18:40:16 (1120): No heartbeat from core client for 30 sec - exiting 18:40:17 (1120): No heartbeat from core client for 30 sec - exiting 18:40:18 (1120): No heartbeat from core client for 30 sec - exiting 18:40:19 (1120): No heartbeat from core client for 30 sec - exiting 18:40:20 (1120): No heartbeat from core client for 30 sec - exiting 18:40:21 (1120): No heartbeat from core client for 30 sec - exiting 18:40:22 (1120): No heartbeat from core client for 30 sec - exiting 18:40:23 (1120): No heartbeat from core client for 30 sec - exiting 18:40:24 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2152, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2816, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5616, iMonCtr=2 CPDN Monitor - Quit request fRegional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=4460, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Atmos Restart file copy failed on atmos_restart.day Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2316, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3544, selfPID=5848, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4840, selfPID=4424, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3112, iMonCtr=2 Model crash detected, will try to restart... 18:13:11 (4116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4048, selfPID=2440, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4692, selfPID=2652, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=2 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5472, selfPID=5472, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5660, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5940, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5960, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4128, selfPID=2340, iMonCtr=1 Model crash detected, will try to restart... 20:11:16 (156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3812, iMonCtr=2 Model crash detected, will try to restart... 20:19:02 (4224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:37:04 (4408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=5652, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=940, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4576, selfPID=4268, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
28 Feb 2012 08:26:13	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	138,342	679,604	4.9125
27 Feb 2012 13:07:25	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	138,336	678,796	4.9069
02 Jan 2012 13:20:57	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	126,816	623,161	4.9139
26 Dec 2011 19:43:16	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	115,296	564,854	4.8992
22 Dec 2011 10:24:02	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	103,776	506,469	4.8804
18 Dec 2011 14:13:36	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	92,256	452,158	4.9011
05 Dec 2011 19:09:08	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	80,736	395,245	4.8955
08 Nov 2011 18:52:55	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	69,216	339,051	4.8984
31 Oct 2011 17:32:47	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	57,703	283,397	4.9113
31 Oct 2011 17:15:32	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	57,696	282,547	4.8972
19 Sep 2011 15:47:57	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	46,176	226,369	4.9023
01 Sep 2011 15:51:31	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	34,656	170,472	4.9190
22 Aug 2011 17:11:49	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	23,136	114,433	4.9461
25 Jul 2011 19:43:04	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	11,624	58,168	5.0041
25 Jul 2011 19:39:52	953840	13030504	hadam3p_eu_2tvr_1960_1_007305769_1	11,616	57,381	4.9398