Task 13347322

Name	hadcm3n_t0tj_1940_40_007442780_1
Workunit	7640283
Created	8 Sep 2011, 22:13:23 UTC
Sent	8 Sep 2011, 22:18:55 UTC
Report deadline	9 Dec 2011, 5:46:06 UTC
Received	19 Sep 2011, 14:37:27 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1122356
Run time	4 days 8 hours 58 min 39 sec
CPU time	4 days 5 hours 24 min 30 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.82 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:00:13 (3660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2028, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2028, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2544, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Sep 2011 07:23:05	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	259,200	345,541	1.3331
14 Sep 2011 21:25:44	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	233,280	310,092	1.3293
14 Sep 2011 10:32:02	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	207,360	275,049	1.3264
14 Sep 2011 00:49:05	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	181,440	241,228	1.3295
13 Sep 2011 14:59:22	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	155,520	206,785	1.3296
13 Sep 2011 05:51:22	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	129,600	172,491	1.3309
12 Sep 2011 19:23:10	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	103,680	138,129	1.3323
12 Sep 2011 07:13:09	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	77,760	107,502	1.3825
11 Sep 2011 14:03:02	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	51,840	71,451	1.3783
10 Sep 2011 21:27:48	1122356	13347322	hadcm3n_t0tj_1940_40_007442780_1	25,920	35,659	1.3757