Task 13358659

Name	hadcm3n_o22p_1940_40_007447192_1
Workunit	7644695
Created	9 Sep 2011, 17:26:21 UTC
Sent	16 Sep 2011, 11:36:49 UTC
Report deadline	16 Dec 2011, 19:04:00 UTC
Received	23 Nov 2011, 18:50:09 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1169293
Run time	28 days 6 hours 32 min 15 sec
CPU time	19 days 7 hours 53 min 42 sec
Validate state	Invalid
Credit	11,819.52
Device peak FLOPS	2.81 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:43:51 (912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:28:36 (4244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:59:50 (5052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:54:30 (4948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:59:51 (5280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 12:20:48 (4828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:28:48 (2684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 06:30:44 (1380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:34:14 (3164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 21:40:34 (5152): No heartbeat from core client for 30 sec - exiting 21:40:35 (5152): No heartbeat from core client for 30 sec - exiting 21:40:36 (5152): No heartbeat from core client for 30 sec - exiting 21:40:37 (5152): No heartbeat from core client for 30 sec - exiting 21:40:38 (5152): No heartbeat from core client for 30 sec - exiting 21:40:39 (5152): No heartbeat from core client for 30 sec - exiting 21:40:40 (5152): No heartbeat from core client for 30 sec - exiting 21:40:41 (5152): No heartbeat from core client for 30 sec - exiting 21:40:42 (5152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:40:44 (5152): No heartbeat from core client for 30 sec - exiting 21:40:45 (5152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 07:43:59 (2292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8476, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8476, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8476, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8476, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8476, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8476, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
21 Nov 2011 09:33:41	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	984,960	1,659,003	1.6843
19 Nov 2011 10:34:03	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	959,040	1,614,289	1.6832
17 Nov 2011 08:58:25	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	933,120	1,572,211	1.6849
15 Nov 2011 23:10:26	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	907,200	1,527,884	1.6842
09 Nov 2011 05:10:03	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	881,280	1,484,184	1.6841
08 Nov 2011 15:20:19	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	855,360	1,441,308	1.6850
08 Nov 2011 00:07:49	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	829,440	1,398,016	1.6855
07 Nov 2011 01:51:27	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	803,520	1,349,642	1.6797
06 Nov 2011 05:47:00	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	777,600	1,302,925	1.6756
05 Nov 2011 09:54:41	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	751,680	1,256,782	1.6720
04 Nov 2011 11:58:32	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	725,760	1,212,120	1.6701
31 Oct 2011 18:38:49	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	699,840	1,168,637	1.6699
31 Oct 2011 18:15:57	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	673,920	1,123,404	1.6670
31 Oct 2011 17:24:05	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	648,000	1,078,183	1.6639
31 Oct 2011 17:07:52	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	622,080	1,033,095	1.6607
31 Oct 2011 16:38:19	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	596,160	988,171	1.6576
31 Oct 2011 15:36:08	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	570,240	944,066	1.6556
31 Oct 2011 14:12:18	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	544,320	900,265	1.6539
31 Oct 2011 14:12:18	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	518,400	857,072	1.6533
31 Oct 2011 14:12:18	1169293	13358659	hadcm3n_o22p_1940_40_007447192_1	492,480	812,605	1.6500