Task 15637359

Name	hadcm3n_n02h_1920_40_008321957_0
Workunit	8473092
Created	24 Feb 2013, 21:54:50 UTC
Sent	24 Feb 2013, 22:00:13 UTC
Report deadline	27 May 2013, 5:27:24 UTC
Received	7 Mar 2013, 9:03:57 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1183146
Run time	7 days 5 hours 37 min 16 sec
CPU time	7 days 4 hours 7 min 21 sec
Validate state	Invalid
Credit	8,087.04
Device peak FLOPS	3.57 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:24:11 (6112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8920, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7632, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7632, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7016, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7016, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7016, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7016, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Mar 2013 08:20:43	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	673,920	616,158	0.9143
06 Mar 2013 01:04:01	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	648,000	593,390	0.9157
05 Mar 2013 19:40:23	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	622,080	570,762	0.9175
05 Mar 2013 12:36:53	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	596,160	548,256	0.9196
03 Mar 2013 01:54:13	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	570,240	525,258	0.9211
02 Mar 2013 18:35:50	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	544,320	499,635	0.9179
02 Mar 2013 11:14:14	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	518,400	473,489	0.9134
02 Mar 2013 04:16:15	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	492,480	449,232	0.9122
01 Mar 2013 22:48:51	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	466,560	427,917	0.9172
01 Mar 2013 16:01:01	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	440,640	406,189	0.9218
01 Mar 2013 09:27:38	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	414,720	382,755	0.9229
01 Mar 2013 02:51:04	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	388,800	359,318	0.9242
28 Feb 2013 20:22:49	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	362,880	335,895	0.9256
28 Feb 2013 13:40:06	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	336,960	312,058	0.9261
28 Feb 2013 07:02:18	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	311,040	288,251	0.9267
28 Feb 2013 00:25:30	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	285,120	264,739	0.9285
27 Feb 2013 17:46:25	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	259,200	241,064	0.9300
27 Feb 2013 11:12:22	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	233,280	217,767	0.9335
27 Feb 2013 04:35:53	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	207,360	193,978	0.9355
26 Feb 2013 21:48:41	1183146	15637359	hadcm3n_n02h_1920_40_008321957_0	181,440	169,659	0.9351