Task 13605935

Name	hadcm3n_o3p6_1980_40_007538224_1
Workunit	7735456
Created	5 Nov 2011, 19:43:50 UTC
Sent	5 Nov 2011, 19:46:05 UTC
Report deadline	5 Feb 2012, 3:13:16 UTC
Received	7 Dec 2011, 1:55:11 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1169252
Run time	13 days 9 hours 32 min 57 sec
CPU time	13 days 4 hours 57 min 35 sec
Validate state	Invalid
Credit	9,642.24
Device peak FLOPS	3.31 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:18:33 (4404): No heartbeat from core client for 30 sec - exiting 17:18:35 (4404): No heartbeat from core client for 30 sec - exiting 17:18:36 (4404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4380, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Dec 2011 19:04:47	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	803,520	1,124,424	1.3994
06 Dec 2011 08:54:59	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	777,600	1,088,191	1.3994
05 Dec 2011 21:40:50	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	751,680	1,051,999	1.3995
05 Dec 2011 11:16:36	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	725,760	1,015,097	1.3987
05 Dec 2011 00:59:04	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	699,840	978,347	1.3980
04 Dec 2011 14:36:22	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	673,920	941,533	1.3971
04 Dec 2011 04:18:53	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	648,000	904,788	1.3963
03 Dec 2011 16:28:12	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	622,080	868,049	1.3954
28 Nov 2011 01:48:30	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	596,160	831,721	1.3951
25 Nov 2011 15:37:41	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	570,240	795,138	1.3944
22 Nov 2011 13:55:15	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	544,320	758,815	1.3941
22 Nov 2011 03:47:37	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	518,400	722,485	1.3937
21 Nov 2011 14:45:05	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	492,480	686,279	1.3935
21 Nov 2011 04:37:27	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	466,560	650,021	1.3932
18 Nov 2011 14:40:48	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	440,640	615,054	1.3958
18 Nov 2011 04:23:00	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	414,720	580,442	1.3996
17 Nov 2011 12:29:21	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	388,800	543,945	1.3990
17 Nov 2011 02:21:43	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	362,880	507,602	1.3988
15 Nov 2011 20:23:07	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	336,960	471,322	1.3987
15 Nov 2011 20:23:07	1169252	13605935	hadcm3n_o3p6_1980_40_007538224_1	311,040	434,981	1.3985