Task 12660446

Name	hadam3p_pnw_2z0i_1959_1_007178874_1
Workunit	7377156
Created	11 Mar 2011, 12:18:07 UTC
Sent	11 Mar 2011, 15:37:34 UTC
Report deadline	21 Feb 2012, 20:57:34 UTC
Received	25 Apr 2011, 18:25:53 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	989453
Run time	6 days 12 hours 23 min 43 sec
CPU time	5 days 22 hours 1 min 7 sec
Validate state	Workunit error - check skipped
Credit	3,005.88
Device peak FLOPS	1.80 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86
Stderr	<core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4364, selfPID=2464, iMonCtr=1 Model crash detected, will try to restart... 10:35:41 (3488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1184, selfPID=4396, iMonCtr=1 Model crash detected, will try to restart... 13:12:02 (4176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1616, selfPID=1616, iMonCtr=2 11:14:28 (1192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:16 (3384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1980, selfPID=3228, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1240, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1164, selfPID=4252, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4208, selfPID=1824, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=2 Model crash detected, will try to restart... 09:57:02 (3596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4992, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3540, selfPID=684, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3492, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... 13:18:52 (4616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4216, selfPID=2296, iMonCtr=1 Model crash detected, will try to restart... 10:11:23 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2624, selfPID=1200, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3576, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3744, selfPID=3116, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 9 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=2 Model crash detected, will try to restart... 10:11:08 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4724, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=1348, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4484, selfPID=3480, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4368, selfPID=4008, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 11 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4216, selfPID=748, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3516, selfPID=3896, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=2 Leaving CPDN_Main::Monitor... 13:38:13 (3756): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Apr 2011 17:32:14	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	138,336	510,340	3.6891
20 Apr 2011 21:32:50	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	126,816	467,696	3.6880
20 Apr 2011 18:11:54	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	115,296	425,096	3.6870
11 Apr 2011 20:53:44	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	103,776	382,965	3.6903
08 Apr 2011 17:29:53	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	92,256	341,107	3.6974
06 Apr 2011 17:55:10	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	80,736	299,158	3.7054
04 Apr 2011 20:42:35	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	69,216	257,304	3.7174
30 Mar 2011 20:42:33	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	57,696	214,376	3.7156
28 Mar 2011 14:38:31	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	46,176	172,154	3.7282
22 Mar 2011 18:41:32	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	34,656	128,451	3.7065
20 Mar 2011 01:29:15	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	23,136	85,606	3.7001
14 Mar 2011 19:17:43	989453	12660446	hadam3p_pnw_2z0i_1959_1_007178874_1	11,616	43,300	3.7276