Task 13734852

Name	hadcm3n_001a_1940_40_007546589_2
Workunit	7743821
Created	6 Dec 2011, 4:23:08 UTC
Sent	6 Dec 2011, 12:36:40 UTC
Report deadline	6 Mar 2012, 20:03:51 UTC
Received	28 Dec 2011, 21:11:09 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1132123
Run time	12 days 8 hours 10 min 49 sec
CPU time	12 days 5 hours 31 min 2 sec
Validate state	Invalid
Credit	10,575.36
Device peak FLOPS	3.16 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:00:18 (1516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:06:15 (2908): No heartbeat from core client for 30 sec - exiting 20:06:16 (2908): No heartbeat from core client for 30 sec - exiting 20:06:17 (2908): No heartbeat from core client for 30 sec - exiting 20:06:18 (2908): No heartbeat from core client for 30 sec - exiting 20:06:19 (2908): No heartbeat from core client for 30 sec - exiting 20:06:20 (2908): No heartbeat from core client for 30 sec - exiting 20:06:21 (2908): No heartbeat from core client for 30 sec - exiting 20:06:22 (2908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
28 Dec 2011 11:40:27	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	881,280	1,030,959	1.1698
28 Dec 2011 03:13:38	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	855,360	1,000,879	1.1701
27 Dec 2011 19:04:50	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	829,440	970,894	1.1705
27 Dec 2011 10:27:58	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	803,520	940,798	1.1708
27 Dec 2011 02:00:52	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	777,600	910,716	1.1712
26 Dec 2011 17:37:19	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	751,680	880,761	1.1717
26 Dec 2011 09:10:12	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	725,760	850,675	1.1721
26 Dec 2011 00:48:36	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	699,840	820,746	1.1728
25 Dec 2011 16:25:28	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	673,920	790,891	1.1736
25 Dec 2011 07:58:46	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	648,000	760,822	1.1741
24 Dec 2011 23:46:34	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	622,080	730,592	1.1744
24 Dec 2011 15:51:17	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	596,160	700,689	1.1753
24 Dec 2011 05:19:19	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	570,240	670,636	1.1761
23 Dec 2011 20:57:22	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	544,320	640,669	1.1770
23 Dec 2011 11:41:52	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	518,400	610,843	1.1783
23 Dec 2011 03:19:49	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	492,480	580,973	1.1797
22 Dec 2011 19:09:51	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	466,560	551,084	1.1812
22 Dec 2011 10:34:04	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	440,640	521,208	1.1828
22 Dec 2011 02:11:27	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	414,720	491,259	1.1846
21 Dec 2011 19:06:56	1132123	13734852	hadcm3n_001a_1940_40_007546589_2	388,800	461,243	1.1863