Task 15492188

Name	hadcm3n_3c6z_1940_40_008263435_0
Workunit	8418559
Created	21 Dec 2012, 5:17:54 UTC
Sent	21 Dec 2012, 5:22:51 UTC
Report deadline	22 Mar 2013, 12:50:02 UTC
Received	9 Jan 2013, 0:23:09 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1223919
Run time	11 days 10 hours 9 min 23 sec
CPU time	11 days 2 hours 22 min 3 sec
Validate state	Invalid
Credit	5,598.72
Device peak FLOPS	1.81 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3884, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:58:41 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:58:42 (3276): No heartbeat from core client for 30 sec - exiting 17:58:43 (3276): No heartbeat from core client for 30 sec - exiting 17:58:44 (3276): No heartbeat from core client for 30 sec - exiting 17:58:45 (3276): No heartbeat from core client for 30 sec - exiting 17:58:46 (3276): No heartbeat from core client for 30 sec - exiting 17:58:47 (3276): No heartbeat from core client for 30 sec - exiting 17:58:48 (3276): No heartbeat from core client for 30 sec - exiting 17:58:49 (3276): No heartbeat from core client for 30 sec - exiting 17:58:50 (3276): No heartbeat from core client for 30 sec - exiting 17:58:51 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=48148, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=48148, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
05 Jan 2013 08:05:36	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	466,560	919,111	1.9700
04 Jan 2013 17:37:50	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	440,640	868,535	1.9711
04 Jan 2013 03:29:13	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	414,720	817,815	1.9720
03 Jan 2013 13:25:21	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	388,800	767,046	1.9729
02 Jan 2013 23:23:49	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	362,880	716,713	1.9751
02 Jan 2013 09:10:12	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	336,960	665,996	1.9765
01 Jan 2013 19:07:05	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	311,040	615,523	1.9789
01 Jan 2013 04:20:23	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	285,120	564,059	1.9783
31 Dec 2012 06:38:03	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	259,200	514,338	1.9843
30 Dec 2012 16:16:09	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	233,280	463,958	1.9888
29 Dec 2012 10:05:39	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	207,360	413,803	1.9956
28 Dec 2012 16:36:13	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	181,440	361,866	1.9944
28 Dec 2012 00:47:34	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	155,520	310,224	1.9948
24 Dec 2012 17:11:25	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	129,600	258,407	1.9939
24 Dec 2012 02:34:54	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	103,680	206,344	1.9902
23 Dec 2012 12:48:53	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	77,760	154,396	1.9855
22 Dec 2012 18:51:47	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	51,840	102,520	1.9776
22 Dec 2012 03:58:25	1223919	15492188	hadcm3n_3c6z_1940_40_008263435_0	25,920	50,650	1.9541