Task 15992164

Name	hadcm3n_809m_1980_40_008458285_0
Workunit	8609141
Created	30 Aug 2013, 20:10:19 UTC
Sent	8 Sep 2013, 14:30:55 UTC
Report deadline	8 Dec 2013, 21:58:06 UTC
Received	16 Sep 2013, 6:45:13 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1289668
Run time	6 days 17 hours 15 min 9 sec
CPU time	5 days 21 hours 13 min 1 sec
Validate state	Invalid
Credit	6,842.88
Device peak FLOPS	3.16 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 02:06:30 (5896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Sep 2013 14:40:54	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	570,240	499,645	0.8762
15 Sep 2013 08:13:40	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	544,320	477,765	0.8777
15 Sep 2013 01:32:08	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	518,400	455,773	0.8792
14 Sep 2013 18:05:18	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	492,480	433,389	0.8800
14 Sep 2013 10:03:23	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	466,560	410,581	0.8800
14 Sep 2013 03:10:43	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	440,640	388,128	0.8808
13 Sep 2013 17:43:04	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	414,720	365,304	0.8808
13 Sep 2013 10:26:27	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	388,800	342,978	0.8821
12 Sep 2013 21:13:21	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	362,880	320,355	0.8828
12 Sep 2013 12:35:41	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	336,960	296,566	0.8801
12 Sep 2013 03:37:23	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	311,040	273,473	0.8792
11 Sep 2013 18:38:50	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	285,120	250,289	0.8778
11 Sep 2013 10:31:34	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	259,200	227,085	0.8761
11 Sep 2013 03:34:49	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	233,280	204,356	0.8760
10 Sep 2013 22:18:06	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	207,360	182,025	0.8778
10 Sep 2013 14:46:07	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	181,440	158,912	0.8758
10 Sep 2013 07:39:00	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	155,520	135,724	0.8727
10 Sep 2013 00:15:31	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	129,600	113,112	0.8728
09 Sep 2013 18:18:20	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	103,680	90,140	0.8694
09 Sep 2013 11:19:42	1289668	15992164	hadcm3n_809m_1980_40_008458285_0	77,760	67,014	0.8618