Task 17377451

Name	hadcm3n_x14o_1940_40_009148731_1
Workunit	9279067
Created	8 Nov 2014, 6:58:57 UTC
Sent	8 Nov 2014, 7:18:26 UTC
Report deadline	7 Feb 2015, 14:45:37 UTC
Received	13 Nov 2014, 22:17:02 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1432463
Run time	4 days 10 hours 13 min 9 sec
CPU time	4 days 7 hours 27 min 33 sec
Validate state	Invalid
Credit	5,287.68
Device peak FLOPS	4.18 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 01:07:37 (4876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
13 Nov 2014 10:38:26	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	440,640	366,651	0.8321
13 Nov 2014 04:48:47	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	414,720	343,861	0.8291
12 Nov 2014 21:31:02	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	388,800	321,525	0.8270
12 Nov 2014 16:02:36	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	362,880	300,497	0.8281
12 Nov 2014 08:52:38	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	336,960	277,625	0.8239
12 Nov 2014 01:58:25	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	311,040	253,562	0.8152
11 Nov 2014 19:14:17	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	285,120	230,339	0.8079
11 Nov 2014 13:21:44	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	259,200	209,695	0.8090
11 Nov 2014 07:04:18	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	233,280	189,124	0.8107
11 Nov 2014 01:11:47	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	207,360	168,307	0.8117
10 Nov 2014 19:18:06	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	181,440	147,972	0.8155
10 Nov 2014 12:49:46	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	155,520	127,986	0.8230
10 Nov 2014 06:00:40	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	129,600	107,576	0.8301
10 Nov 2014 00:03:03	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	103,680	86,701	0.8362
09 Nov 2014 17:50:12	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	77,760	65,176	0.8382
08 Nov 2014 19:55:25	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	51,840	43,565	0.8404
08 Nov 2014 13:47:53	1343956	17377451	hadcm3n_x14o_1940_40_009148731_1	25,920	22,135	0.8540