Task 16046337

Name	hadcm3n_oftg_1900_40_008475623_0
Workunit	8626462
Created	27 Sep 2013, 10:38:29 UTC
Sent	27 Sep 2013, 12:58:05 UTC
Report deadline	27 Dec 2013, 20:25:16 UTC
Received	18 Oct 2013, 0:25:05 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1167410
Run time	18 days 5 hours 3 min 24 sec
CPU time	17 days 11 hours 4 min 13 sec
Validate state	Invalid
Credit	9,642.24
Device peak FLOPS	2.69 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 10:13:47 (552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
17 Oct 2013 14:58:08	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	803,520	1,506,279	1.8746
16 Oct 2013 23:53:50	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	777,600	1,456,832	1.8735
16 Oct 2013 09:19:36	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	751,680	1,407,248	1.8721
15 Oct 2013 17:48:56	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	725,760	1,358,038	1.8712
15 Oct 2013 03:07:53	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	699,840	1,308,943	1.8703
14 Oct 2013 12:29:08	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	673,920	1,259,295	1.8686
13 Oct 2013 23:12:13	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	648,000	1,211,519	1.8696
13 Oct 2013 09:54:38	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	622,080	1,163,690	1.8706
12 Oct 2013 20:38:57	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	596,160	1,115,862	1.8717
12 Oct 2013 07:24:16	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	570,240	1,068,017	1.8729
11 Oct 2013 18:06:48	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	544,320	1,020,224	1.8743
10 Oct 2013 21:24:14	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	518,400	971,312	1.8737
10 Oct 2013 05:24:17	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	492,480	923,426	1.8751
09 Oct 2013 16:09:52	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	466,560	875,801	1.8771
09 Oct 2013 02:50:14	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	440,640	828,151	1.8794
08 Oct 2013 13:35:16	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	414,720	780,567	1.8822
08 Oct 2013 00:13:45	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	388,800	732,958	1.8852
07 Oct 2013 10:41:39	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	362,880	684,664	1.8868
06 Oct 2013 21:27:29	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	336,960	637,270	1.8912
06 Oct 2013 07:08:59	1167410	16046337	hadcm3n_oftg_1900_40_008475623_0	311,040	588,555	1.8922