Task 16010092

Name	hadcm3n_o212_1940_40_008382402_2
Workunit	8533261
Created	10 Sep 2013, 0:50:49 UTC
Sent	10 Sep 2013, 1:08:41 UTC
Report deadline	10 Dec 2013, 8:35:52 UTC
Received	9 Dec 2013, 17:14:07 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1218458
Run time	4 days 9 hours 31 min 31 sec
CPU time	4 days 8 hours 41 min 28 sec
Validate state	Invalid
Credit	5,598.72
Device peak FLOPS	3.80 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.31</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:20:36 (10136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14140, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14140, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14140, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14140, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14140, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14140, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
24 Nov 2013 20:05:00	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	466,560	364,039	0.7803
24 Nov 2013 14:32:58	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	440,640	344,212	0.7812
24 Nov 2013 09:01:43	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	414,720	324,334	0.7821
24 Nov 2013 03:15:29	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	388,800	303,623	0.7809
23 Nov 2013 20:43:52	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	362,880	281,319	0.7752
23 Nov 2013 15:20:35	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	336,960	261,874	0.7772
23 Nov 2013 09:32:13	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	311,040	241,183	0.7754
23 Nov 2013 02:00:45	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	285,120	219,522	0.7699
11 Oct 2013 16:19:53	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	259,200	200,117	0.7721
11 Oct 2013 11:08:21	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	233,280	181,493	0.7780
11 Oct 2013 01:01:04	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	207,360	162,800	0.7851
10 Oct 2013 19:48:28	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	181,440	144,046	0.7939
13 Sep 2013 17:58:07	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	155,520	125,298	0.8057
13 Sep 2013 11:21:39	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	129,600	106,521	0.8219
13 Sep 2013 06:05:25	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	103,680	87,698	0.8459
10 Sep 2013 21:22:45	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	77,760	67,602	0.8694
10 Sep 2013 14:56:10	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	51,840	45,104	0.8701
10 Sep 2013 08:39:20	1218458	16010092	hadcm3n_o212_1940_40_008382402_2	25,920	22,651	0.8739