Task 16278102

Name	hadcm3n_83lq_1980_40_008462609_3
Workunit	8613465
Created	3 Feb 2014, 13:17:22 UTC
Sent	3 Feb 2014, 13:17:31 UTC
Report deadline	5 May 2014, 20:44:42 UTC
Received	14 Aug 2014, 15:43:50 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1137771
Run time	14 days 16 hours 13 min 59 sec
CPU time	14 days 7 hours 43 min 13 sec
Validate state	Invalid
Credit	11,508.48
Device peak FLOPS	3.20 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3628, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5940, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
18 Feb 2014 12:02:56	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	959,040	1,217,194	1.2692
18 Feb 2014 12:02:09	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	933,120	1,183,514	1.2683
18 Feb 2014 12:01:44	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	907,200	1,148,935	1.2665
18 Feb 2014 12:01:44	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	881,280	1,116,722	1.2672
16 Feb 2014 10:43:54	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	855,360	1,084,576	1.2680
16 Feb 2014 01:27:33	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	829,440	1,052,092	1.2684
15 Feb 2014 16:29:45	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	803,520	1,020,466	1.2700
15 Feb 2014 06:51:29	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	777,600	986,666	1.2689
14 Feb 2014 21:14:56	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	751,680	952,986	1.2678
14 Feb 2014 11:46:24	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	725,760	919,931	1.2675
14 Feb 2014 02:49:53	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	699,840	888,443	1.2695
13 Feb 2014 17:32:31	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	673,920	855,848	1.2700
13 Feb 2014 08:12:22	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	648,000	822,940	1.2700
12 Feb 2014 23:15:58	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	622,080	791,446	1.2723
12 Feb 2014 14:17:39	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	596,160	759,801	1.2745
12 Feb 2014 04:45:54	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	570,240	726,404	1.2739
11 Feb 2014 18:23:31	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	544,320	690,858	1.2692
11 Feb 2014 08:10:52	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	518,400	656,103	1.2656
10 Feb 2014 22:44:09	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	492,480	622,986	1.2650
10 Feb 2014 13:38:11	1137771	16278102	hadcm3n_83lq_1980_40_008462609_3	466,560	590,759	1.2662