Task 13366311

Name	hadcm3n_yk17_1940_40_007450247_0
Workunit	7647750
Created	10 Sep 2011, 5:06:05 UTC
Sent	10 Sep 2011, 5:16:18 UTC
Report deadline	10 Dec 2011, 12:43:29 UTC
Received	4 Oct 2011, 19:15:46 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1167073
Run time	14 days 5 hours 45 min 40 sec
CPU time	13 days 9 hours 26 min 49 sec
Validate state	Invalid
Credit	9,020.16
Device peak FLOPS	2.90 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5080, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4836, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Oct 2011 00:23:50	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	751,680	1,146,408	1.5251
03 Oct 2011 05:08:41	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	725,760	1,108,201	1.5270
02 Oct 2011 18:17:58	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	699,840	1,070,139	1.5291
02 Oct 2011 07:24:39	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	673,920	1,032,152	1.5316
01 Oct 2011 20:25:02	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	648,000	993,661	1.5334
01 Oct 2011 09:08:53	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	622,080	954,669	1.5346
30 Sep 2011 16:07:46	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	596,160	914,935	1.5347
30 Sep 2011 04:32:12	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	570,240	875,075	1.5346
29 Sep 2011 16:54:57	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	544,320	835,356	1.5347
29 Sep 2011 05:20:02	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	518,400	795,603	1.5347
28 Sep 2011 17:28:17	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	492,480	755,666	1.5344
28 Sep 2011 05:47:19	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	466,560	716,539	1.5358
27 Sep 2011 17:47:57	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	440,640	676,529	1.5353
23 Sep 2011 23:35:28	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	414,720	634,373	1.5296
23 Sep 2011 11:06:21	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	388,800	592,763	1.5246
22 Sep 2011 22:47:31	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	362,880	552,252	1.5219
22 Sep 2011 10:29:08	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	336,960	511,452	1.5178
21 Sep 2011 22:36:12	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	311,040	470,919	1.5140
21 Sep 2011 10:42:48	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	285,120	431,276	1.5126
20 Sep 2011 18:30:28	1167073	13366311	hadcm3n_yk17_1940_40_007450247_0	259,200	393,098	1.5166