Task 13391147

Name	hadcm3n_o58p_1900_40_007440287_3
Workunit	7637790
Created	16 Sep 2011, 5:13:20 UTC
Sent	20 Sep 2011, 3:22:22 UTC
Report deadline	20 Dec 2011, 10:49:33 UTC
Received	3 Oct 2011, 7:40:57 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1169698
Run time	6 days 23 hours 52 min
CPU time	6 days 20 hours 21 min 40 sec
Validate state	Invalid
Credit	4,976.64
Device peak FLOPS	2.89 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:14:54 (3868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:11:33 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:10:32 (6216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:09:28 (3876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:08:24 (6692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:07:22 (8100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:06:22 (5936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:04:15 (7088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
02 Oct 2011 18:07:46	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	414,720	578,654	1.3953
02 Oct 2011 08:05:10	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	388,800	543,108	1.3969
01 Oct 2011 21:51:20	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	362,880	507,011	1.3972
01 Oct 2011 11:46:02	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	336,960	471,198	1.3984
01 Oct 2011 01:41:51	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	311,040	435,443	1.4000
30 Sep 2011 01:03:35	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	285,120	398,315	1.3970
29 Sep 2011 14:46:07	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	259,200	362,128	1.3971
29 Sep 2011 04:39:35	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	233,280	326,495	1.3996
28 Sep 2011 17:59:29	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	207,360	290,481	1.4009
28 Sep 2011 08:04:55	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	181,440	255,083	1.4059
27 Sep 2011 22:06:21	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	155,520	219,723	1.4128
27 Sep 2011 12:15:24	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	129,600	184,550	1.4240
27 Sep 2011 02:28:48	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	103,680	149,627	1.4432
25 Sep 2011 04:07:31	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	77,760	114,278	1.4696
24 Sep 2011 09:47:08	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	51,840	73,987	1.4272
23 Sep 2011 17:49:34	1169698	13391147	hadcm3n_o58p_1900_40_007440287_3	25,920	37,231	1.4364