Task 15636766

Name	hadcm3n_n01y_1920_40_008321527_0
Workunit	8472662
Created	24 Feb 2013, 19:47:26 UTC
Sent	24 Feb 2013, 19:52:27 UTC
Report deadline	27 May 2013, 3:19:38 UTC
Received	1 May 2013, 19:14:24 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1242229
Run time	6 days 14 hours 32 min 20 sec
CPU time	6 days 10 hours 16 min 27 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.85 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 03:27:07 (6588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:12:18 (3036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:02 (5040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:04 (5040): No heartbeat from core client for 30 sec - exiting 18:04:05 (5040): No heartbeat from core client for 30 sec - exiting 18:04:06 (5040): No heartbeat from core client for 30 sec - exiting 18:04:07 (5040): No heartbeat from core client for 30 sec - exiting 19:26:08 (9444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6092, iMonCtr=1 Model crash detected, will try to restart... 01:00:15 (6208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:22:41 (6660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8656, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6380, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
03 Mar 2013 15:45:40	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	259,200	512,652	1.9778
03 Mar 2013 00:48:57	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	233,280	460,505	1.9740
02 Mar 2013 10:18:55	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	207,360	408,878	1.9718
01 Mar 2013 21:02:18	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	181,440	361,317	1.9914
01 Mar 2013 08:02:18	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	155,520	314,836	2.0244
28 Feb 2013 01:15:44	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	129,600	265,941	2.0520
27 Feb 2013 10:37:14	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	103,680	214,674	2.0705
26 Feb 2013 17:41:36	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	77,760	159,743	2.0543
26 Feb 2013 01:22:47	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	51,840	103,066	1.9882
25 Feb 2013 08:44:44	1242229	15636766	hadcm3n_n01y_1920_40_008321527_0	25,920	45,912	1.7713