Task 13466493

Name	hadcm3n_u5qe_1980_40_007458857_2
Workunit	7656360
Created	7 Oct 2011, 19:46:43 UTC
Sent	7 Oct 2011, 19:46:51 UTC
Report deadline	7 Jan 2012, 3:14:02 UTC
Received	15 Nov 2011, 18:27:01 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1047669
Run time	16 days 19 hours 41 min 27 sec
CPU time	16 days 9 hours 3 min 52 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	2.15 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.26</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 14:52:45 (4052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:06:20 (584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:50:06 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:52:20 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:34:26 (3448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:51:14 (2176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:26:26 (2780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:11:59 (2744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 13:17:25 (3644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
31 Oct 2011 16:43:26	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	777,600	1,381,183	1.7762
31 Oct 2011 15:41:58	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	751,680	1,334,880	1.7759
31 Oct 2011 15:01:53	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	725,760	1,288,713	1.7757
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	699,840	1,242,445	1.7753
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	673,920	1,196,147	1.7749
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	648,000	1,149,703	1.7742
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	622,080	1,103,371	1.7737
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	596,160	1,056,961	1.7729
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	570,240	1,010,621	1.7723
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	544,320	964,264	1.7715
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	518,400	917,942	1.7707
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	492,480	871,469	1.7696
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	466,560	825,177	1.7686
31 Oct 2011 13:30:15	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	440,640	778,960	1.7678
19 Oct 2011 02:48:52	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	414,720	732,558	1.7664
18 Oct 2011 13:34:02	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	388,800	686,423	1.7655
18 Oct 2011 00:37:11	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	362,880	640,335	1.7646
17 Oct 2011 11:08:44	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	336,960	594,320	1.7638
16 Oct 2011 21:10:05	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	311,040	548,315	1.7628
16 Oct 2011 07:25:49	1047669	13466493	hadcm3n_u5qe_1980_40_007458857_2	285,120	502,307	1.7617