Task 15794630

Name	hadcm3n_4e7x_1940_40_008311469_2
Workunit	8462604
Created	23 May 2013, 19:46:59 UTC
Sent	23 May 2013, 19:47:06 UTC
Report deadline	23 Aug 2013, 3:14:17 UTC
Received	19 Jun 2013, 21:42:03 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1227663
Run time	9 days 23 hours 4 min 13 sec
CPU time	9 days 9 hours 11 min 41 sec
Validate state	Invalid
Credit	5,909.76
Device peak FLOPS	2.70 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=912, iMonCtr=1 Model crash detected, will try to restart... 11:42:17 (1708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... 12:01:03 (928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:37:21 (5036): No heartbeat from core client for 30 sec - exiting 16:37:22 (5036): No heartbeat from core client for 30 sec - exiting 16:37:23 (5036): No heartbeat from core client for 30 sec - exiting 16:37:24 (5036): No heartbeat from core client for 30 sec - exiting 16:37:25 (5036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Jun 2013 04:49:38	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	492,480	830,985	1.6873
17 Jun 2013 20:41:06	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	466,560	785,317	1.6832
10 Jun 2013 22:33:21	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	440,640	740,137	1.6797
09 Jun 2013 15:42:20	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	414,720	698,387	1.6840
09 Jun 2013 03:47:45	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	388,800	657,082	1.6900
08 Jun 2013 16:12:10	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	362,880	615,657	1.6966
08 Jun 2013 04:36:55	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	336,960	574,243	1.7042
07 Jun 2013 15:46:03	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	311,040	529,032	1.7008
07 Jun 2013 03:18:07	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	285,120	484,806	1.7004
05 Jun 2013 23:42:12	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	259,200	439,768	1.6966
05 Jun 2013 11:06:20	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	233,280	394,841	1.6926
04 Jun 2013 22:05:12	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	207,360	348,429	1.6803
04 Jun 2013 14:17:18	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	181,440	302,504	1.6672
03 Jun 2013 19:35:27	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	155,520	254,802	1.6384
28 May 2013 23:44:38	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	129,600	208,651	1.6100
28 May 2013 11:54:53	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	103,680	166,452	1.6054
28 May 2013 00:00:26	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	77,760	124,104	1.5960
24 May 2013 18:54:13	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	51,840	82,474	1.5909
24 May 2013 07:22:24	1227663	15794630	hadcm3n_4e7x_1940_40_008311469_2	25,920	41,164	1.5881