Task 15995740

Name	hadcm3n_82xq_1980_40_008461745_0
Workunit	8612601
Created	30 Aug 2013, 22:19:00 UTC
Sent	30 Aug 2013, 22:28:00 UTC
Report deadline	30 Nov 2013, 5:55:11 UTC
Received	27 Sep 2013, 9:20:00 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1212204
Run time	24 days 15 hours 18 min 35 sec
CPU time	16 days 16 hours 55 min 12 sec
Validate state	Invalid
Credit	7,464.96
Device peak FLOPS	2.42 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 09:21:02 (3524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:42:50 (1856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:53:45 (4636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:19:04 (5220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:18:41 (14696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:19:06 (14696): No heartbeat from core client for 30 sec - exiting 17:19:49 (14924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:18:05 (16016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
20 Sep 2013 13:51:27	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	622,080	1,439,456	2.3139
19 Sep 2013 20:24:20	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	596,160	1,380,031	2.3149
19 Sep 2013 03:21:00	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	570,240	1,320,808	2.3162
18 Sep 2013 10:08:44	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	544,320	1,261,591	2.3177
17 Sep 2013 14:12:19	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	518,400	1,201,386	2.3175
16 Sep 2013 19:04:39	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	492,480	1,142,211	2.3193
15 Sep 2013 22:42:59	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	466,560	1,081,886	2.3189
15 Sep 2013 02:57:26	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	440,640	1,021,505	2.3182
13 Sep 2013 05:10:12	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	414,720	958,611	2.3115
12 Sep 2013 08:24:23	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	388,800	896,431	2.3056
11 Sep 2013 12:16:58	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	362,880	834,623	2.3000
10 Sep 2013 15:26:19	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	336,960	772,358	2.2921
09 Sep 2013 18:28:22	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	311,040	710,677	2.2848
08 Sep 2013 22:00:40	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	285,120	649,271	2.2772
08 Sep 2013 03:25:03	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	259,200	590,856	2.2795
07 Sep 2013 09:04:09	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	233,280	531,113	2.2767
06 Sep 2013 16:09:28	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	207,360	472,774	2.2800
05 Sep 2013 15:25:50	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	181,440	413,470	2.2788
04 Sep 2013 19:34:58	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	155,520	353,769	2.2747
04 Sep 2013 00:12:54	1212204	15995740	hadcm3n_82xq_1980_40_008461745_0	129,600	295,267	2.2783