Task 14878974

Name	hadam3p_pnw_bdbr_1991_1_008033091_1
Workunit	8188205
Created	8 Jul 2012, 20:22:27 UTC
Sent	8 Jul 2012, 20:22:37 UTC
Report deadline	21 Jun 2013, 1:42:37 UTC
Received	8 Sep 2012, 15:27:50 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1183189
Run time	5 days 12 hours 22 min 29 sec
CPU time	5 days 4 hours 9 min 29 sec
Validate state	Workunit error - check skipped
Credit	3,005.88
Device peak FLOPS	2.31 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:20:01 (6808): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:18:53 (10148): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=5156, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4664, selfPID=6080, iMonCtr=1 Model crash detected, will try to restart... 15:55:24 (5452): No heartbeat from core client for 30 sec - exiting 15:55:25 (5452): No heartbeat from core client for 30 sec - exiting 15:55:26 (5452): No heartbeat from core client for 30 sec - exiting 15:55:27 (5452): No heartbeat from core client for 30 sec - exiting 15:55:28 (5452): No heartbeat from core client for 30 sec - exiting 15:55:29 (5452): No heartbeat from core client for 30 sec - exiting 15:55:30 (5452): No heartbeat from core client for 30 sec - exiting 15:55:31 (5452): No heartbeat from core client for 30 sec - exiting 15:55:32 (5452): No heartbeat from core client for 30 sec - exiting 15:55:34 (5452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:55:35 (5452): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=4784, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5392, selfPID=5048, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 22:26:23 (13352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:26:26 (13352): No heartbeat from core client for 30 sec - exiting 22:26:27 (13352): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14452, selfPID=17236, iMonCtr=1 Model crash detected, will try to restart... 17:43:13 (8784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4260, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:10:04 (4016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=956, selfPID=956, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9176, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8552, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2168, selfPID=5584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7088, selfPID=3236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3544, iMonCtr=2 07:46:52 (5716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=868, selfPID=5028, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5796, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 21:05:27 (5004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11968, selfPID=10400, iMonCtr=1 Model crash detected, will try to restart... 20:26:00 (6064): No heartbeat from core client for 30 sec - exiting 20:26:01 (6064): No heartbeat from core client for 30 sec - exiting 20:26:02 (6064): No heartbeat from core client for 30 sec - exiting 20:26:03 (6064): No heartbeat from core client for 30 sec - exiting 20:26:04 (6064): No heartbeat from core client for 30 sec - exiting 20:26:05 (6064): No heartbeat from core client for 30 sec - exiting 20:26:07 (6064): No heartbeat from core client for 30 sec - exiting 20:26:08 (6064): No heartbeat from core client for 30 sec - exiting 20:26:09 (6064): No heartbeat from core client for 30 sec - exiting 20:26:10 (6064): No heartbeat from core client for 30 sec - exiting 20:26:11 (6064): No heartbeat from core client for 30 sec - exiting 20:26:12 (6064): No heartbeat from core client for 30 sec - exiting 20:26:13 (6064): No heartbeat from core client for 30 sec - exiting 20:26:14 (6064): No heartbeat from core client for 30 sec - exiting 20:26:15 (6064): No heartbeat from core client for 30 sec - exiting 20:26:16 (6064): No heartbeat from core client for 30 sec - exiting 20:26:17 (6064): No heartbeat from core client for 30 sec - exiting 20:26:19 (6064): No heartbeat from core client for 30 sec - exiting 20:26:20 (6064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:26:21 (6064): No heartbeat from core client for 30 sec - exiting 23:42:39 (2704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:41:37 (8432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12636, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4784, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3936, selfPID=5148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7792, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12256, iMonCtr=2 10:59:05 (5352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:17:22 (3740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8596, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4896, iMonCtr=2 Model crash detected, will try to restart... 15:28:55 (5860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:27:47 (3100): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:13:26 (7620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GLeaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Sep 2012 15:28:18	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	138,336	446,152	3.2251
06 Sep 2012 17:30:40	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	126,816	410,793	3.2393
03 Sep 2012 00:24:19	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	115,296	374,443	3.2477
30 Aug 2012 01:04:07	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	103,776	337,382	3.2511
29 Aug 2012 00:48:27	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	92,256	300,826	3.2608
26 Aug 2012 00:03:30	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	80,736	263,928	3.2690
22 Aug 2012 00:34:23	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	69,219	225,969	3.2646
21 Aug 2012 01:53:18	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	69,216	225,381	3.2562
17 Aug 2012 14:00:55	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	57,696	188,421	3.2658
12 Aug 2012 19:29:51	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	46,176	150,730	3.2642
09 Aug 2012 02:10:35	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	34,656	111,895	3.2287
06 Aug 2012 23:58:10	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	23,136	74,922	3.2383
14 Jul 2012 21:29:40	1183189	14878974	hadam3p_pnw_bdbr_1991_1_008033091_1	11,616	37,751	3.2499