Task 14642256

Name	hadcm3n_106j_1940_40_007955653_0
Workunit	8110765
Created	8 May 2012, 21:13:43 UTC
Sent	11 May 2012, 16:21:59 UTC
Report deadline	10 Aug 2012, 23:49:10 UTC
Received	7 Jun 2012, 21:55:23 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1114073
Run time	14 days 7 hours 18 min 48 sec
CPU time	14 days 0 hours 4 min 10 sec
Validate state	Invalid
Credit	7,153.92
Device peak FLOPS	2.65 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10884, iMonCtr=1 Model crash detected, will try to restart... 13:58:16 (5484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Jun 2012 18:44:26	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	596,160	1,196,467	2.0070
04 Jun 2012 03:37:16	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	570,240	1,144,189	2.0065
03 Jun 2012 12:37:48	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	544,320	1,092,280	2.0067
02 Jun 2012 22:28:41	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	518,400	1,042,087	2.0102
02 Jun 2012 09:23:59	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	492,480	992,310	2.0149
01 Jun 2012 19:01:33	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	466,560	942,919	2.0210
01 Jun 2012 04:57:01	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	440,640	893,254	2.0272
31 May 2012 13:48:50	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	414,720	838,885	2.0228
30 May 2012 22:30:20	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	388,800	784,390	2.0175
30 May 2012 06:09:56	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	362,880	730,446	2.0129
29 May 2012 14:58:20	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	336,960	677,201	2.0097
28 May 2012 23:48:11	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	311,040	623,946	2.0060
28 May 2012 08:55:07	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	285,120	571,969	2.0061
27 May 2012 18:18:48	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	259,200	519,594	2.0046
27 May 2012 03:21:34	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	233,280	468,557	2.0086
26 May 2012 12:15:07	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	207,360	415,669	2.0046
25 May 2012 21:34:59	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	181,440	364,460	2.0087
25 May 2012 06:50:32	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	155,520	312,610	2.0101
24 May 2012 15:44:13	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	129,600	260,163	2.0074
24 May 2012 01:14:25	1114073	14642256	hadcm3n_106j_1940_40_007955653_0	103,680	209,175	2.0175