Task 16273125

Name	hadcm3n_o77m_2060_40_008399404_4
Workunit	8550260
Created	17 Jan 2014, 4:19:39 UTC
Sent	17 Jan 2014, 4:19:53 UTC
Report deadline	18 Apr 2014, 11:47:04 UTC
Received	14 Mar 2014, 1:02:42 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1300825
Run time	11 days 16 hours 6 min 31 sec
CPU time	11 days 6 hours 28 min 41 sec
Validate state	Invalid
Credit	5,909.76
Device peak FLOPS	2.90 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3796, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1 Model crash detected, will try to restart... 16:11:05 (3392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3540, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3540, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3540, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3540, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
05 Mar 2014 06:07:34	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	492,480	942,218	1.9132
02 Mar 2014 14:38:44	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	466,560	892,646	1.9133
01 Mar 2014 11:44:37	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	440,640	842,193	1.9113
27 Feb 2014 11:05:17	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	414,720	792,944	1.9120
25 Feb 2014 05:58:59	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	388,800	742,052	1.9086
22 Feb 2014 09:57:39	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	362,880	692,968	1.9096
21 Feb 2014 22:31:10	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	336,960	643,277	1.9091
15 Feb 2014 10:37:03	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	311,040	595,088	1.9132
14 Feb 2014 20:34:48	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	285,120	545,608	1.9136
13 Feb 2014 21:13:53	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	259,200	495,737	1.9126
09 Feb 2014 09:10:43	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	233,280	447,083	1.9165
08 Feb 2014 07:55:39	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	207,360	397,809	1.9184
07 Feb 2014 06:14:48	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	181,440	348,241	1.9193
06 Feb 2014 02:34:15	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	155,520	299,879	1.9282
31 Jan 2014 02:21:51	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	129,600	251,172	1.9381
26 Jan 2014 04:41:22	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	103,680	202,340	1.9516
22 Jan 2014 08:48:26	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	77,760	152,309	1.9587
20 Jan 2014 06:54:54	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	51,840	100,958	1.9475
18 Jan 2014 06:31:33	1300825	16273125	hadcm3n_o77m_2060_40_008399404_4	25,920	50,882	1.9630