Task 12883917

Name	hadcm3n_p51u_1900_40_007224130_2
Workunit	7422370
Created	13 May 2011, 5:55:27 UTC
Sent	13 May 2011, 5:59:54 UTC
Report deadline	12 Aug 2011, 13:27:05 UTC
Received	23 May 2011, 23:47:17 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1141568
Run time	6 days 13 hours 33 min 29 sec
CPU time	6 days 12 hours 44 min 15 sec
Validate state	Invalid
Credit	4,043.52
Device peak FLOPS	2.58 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:44:00 (2712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish 14:24:16 (4488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
23 May 2011 18:16:28	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	336,960	564,297	1.6747
23 May 2011 04:50:42	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	311,040	520,880	1.6746
22 May 2011 06:30:06	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	285,120	476,773	1.6722
21 May 2011 12:04:14	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	259,200	433,642	1.6730
20 May 2011 19:14:37	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	233,280	391,166	1.6768
20 May 2011 07:27:52	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	207,360	348,820	1.6822
19 May 2011 17:13:56	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	181,440	305,371	1.6830
19 May 2011 03:48:43	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	155,520	262,170	1.6858
18 May 2011 11:10:49	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	129,600	217,021	1.6745
17 May 2011 18:38:35	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	103,680	172,315	1.6620
17 May 2011 04:02:17	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	77,760	129,051	1.6596
16 May 2011 06:15:41	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	51,840	86,147	1.6618
15 May 2011 04:19:03	1141568	12883917	hadcm3n_p51u_1900_40_007224130_2	25,920	42,760	1.6497