Task 14931831

Name	hadam3p_eu_972c_1965_1_008058270_0
Workunit	8213384
Created	17 Jul 2012, 23:34:07 UTC
Sent	17 Jul 2012, 23:36:08 UTC
Report deadline	30 Jun 2013, 4:56:08 UTC
Received	29 Jul 2012, 13:17:52 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1176562
Run time	5 days 22 hours 18 min 30 sec
CPU time	5 days 3 hours 3 min 54 sec
Validate state	Workunit error - check skipped
Credit	2,386.39
Device peak FLOPS	2.67 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8144, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7580, selfPID=8180, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3636, selfPID=4272, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4748, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1888, selfPID=6092, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2672, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1896, selfPID=2740, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6000, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=396, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4564, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=2 Model crash detected, will try to restart... 14:36:24 (3240): No heartbeat from core client for 30 sec - exiting 14:36:25 (3240): No heartbeat from core client for 30 sec - exiting 14:36:26 (3240): No heartbeat from core client for 30 sec - exiting 14:36:27 (3240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6488, selfPID=4724, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3364, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=520, selfPID=5712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=5992, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5764, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:05:45 (5688): No heartbeat from core client for 30 sec - exiting 19:05:46 (5688): No heartbeat from core client for 30 sec - exiting 19:05:47 (5688): No heartbeat from core client for 30 sec - exiting 19:05:48 (5688): No heartbeat from core client for 30 sec - exiting 19:05:49 (5688): No heartbeat from core client for 30 sec - exiting 19:05:50 (5688): No heartbeat from core client for 30 sec - exiting 19:05:51 (5688): No heartbeat from core client for 30 sec - exiting 19:05:52 (5688): No heartbeat from core client for 30 sec - exiting 19:05:53 (5688): No heartbeat from core client for 30 sec - exiting 19:05:54 (5688): No heartbeat from core client for 30 sec - exiting 19:06:26 (5688): No heartbeat from core client for 30 sec - exiting 19:06:28 (5688): No heartbeat from core client for 30 sec - exiting 19:06:29 (5688): No heartbeat from core client for 30 sec - exiting 19:06:30 (5688): No heartbeat from core client for 30 sec - exiting 19:06:31 (5688): No heartbeat from core client for 30 sec - exiting 19:06:32 (5688): No heartbeat from core client for 30 sec - exiting 19:06:33 (5688): No heartbeat from core client for 30 sec - exiting 19:06:34 (5688): No heartbeat from core client for 30 sec - exiting 19:06:35 (5688): No heartbeat from core client for 30 sec - exiting 19:06:36 (5688): No heartbeat from core client for 30 sec - exiting 19:06:37 (5688): No heartbeat from core client for 30 sec - exiting 19:06:38 (5688): No heartbeat from core client for 30 sec - exiting 19:06:39 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:07:33 (4432): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5292, selfPID=4432, iMonCtr=1 Model crash detected, will try to restart... Glontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=2 Model crash detected, will try to restart... obal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2996, iMonCtr=2 Mode l crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=2 Model crash detected, will try to restart... 04:20:30 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
29 Jul 2012 12:19:55	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	138,336	442,252	3.1969
28 Jul 2012 12:54:51	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	126,816	402,788	3.1762
27 Jul 2012 16:40:47	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	115,296	365,375	3.1690
26 Jul 2012 17:38:24	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	103,776	328,270	3.1633
25 Jul 2012 20:47:45	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	92,256	291,039	3.1547
25 Jul 2012 08:12:06	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	80,740	255,081	3.1593
25 Jul 2012 00:15:26	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	80,736	254,636	3.1539
24 Jul 2012 12:04:50	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	69,216	217,931	3.1486
22 Jul 2012 12:02:35	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	57,696	180,643	3.1309
21 Jul 2012 09:16:24	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	46,176	145,498	3.1509
20 Jul 2012 14:53:10	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	34,656	109,415	3.1572
19 Jul 2012 15:42:33	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	23,136	73,474	3.1757
18 Jul 2012 19:11:28	1176562	14931831	hadam3p_eu_972c_1965_1_008058270_0	11,616	37,988	3.2703