Task 13293677

Name	hadcm3n_p378_1940_40_007422489_1
Workunit	7620124
Created	25 Aug 2011, 5:59:08 UTC
Sent	25 Aug 2011, 6:06:29 UTC
Report deadline	24 Nov 2011, 13:33:40 UTC
Received	19 Sep 2011, 13:48:31 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1106224
Run time	11 days 0 hours 54 min 50 sec
CPU time	10 days 6 hours 30 min
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	2.40 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3340, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1 Model crash detected, will try to restart... 09:05:52 (3444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
18 Sep 2011 22:09:41	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	388,800	834,412	2.1461
18 Sep 2011 05:07:48	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	362,880	778,857	2.1463
17 Sep 2011 13:02:30	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	336,960	722,025	2.1428
16 Sep 2011 19:16:53	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	311,040	665,302	2.1390
16 Sep 2011 02:47:11	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	285,120	608,406	2.1339
15 Sep 2011 07:18:00	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	259,200	551,892	2.1292
14 Sep 2011 11:33:05	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	233,280	496,151	2.1268
13 Sep 2011 18:54:41	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	207,360	441,724	2.1302
13 Sep 2011 02:41:22	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	181,440	385,141	2.1227
09 Sep 2011 18:10:34	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	155,520	329,724	2.1201
09 Sep 2011 02:26:17	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	129,600	275,434	2.1253
08 Sep 2011 10:42:29	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	103,680	220,907	2.1307
07 Sep 2011 16:54:33	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	77,760	166,152	2.1367
07 Sep 2011 00:01:10	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	51,840	110,870	2.1387
06 Sep 2011 07:39:21	1106224	13293677	hadcm3n_p378_1940_40_007422489_1	25,920	55,644	2.1468