Task 14251838

Name	hadcm3n_o4m1_1980_40_007753135_2
Workunit	7908244
Created	12 Mar 2012, 3:07:37 UTC
Sent	12 Mar 2012, 3:07:45 UTC
Report deadline	11 Jun 2012, 10:34:56 UTC
Received	23 Mar 2012, 9:43:59 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1144522
Run time	10 days 9 hours 37 min 33 sec
CPU time	8 days 23 hours 46 min 38 sec
Validate state	Invalid
Credit	5,287.68
Device peak FLOPS	2.99 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 08:37:42 (11740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
22 Mar 2012 14:27:32	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	440,640	763,421	1.7325
21 Mar 2012 18:50:25	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	414,720	718,449	1.7324
21 Mar 2012 04:30:10	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	388,800	673,167	1.7314
20 Mar 2012 13:59:15	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	362,880	627,919	1.7304
19 Mar 2012 23:43:47	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	336,960	583,262	1.7310
19 Mar 2012 09:35:59	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	311,040	538,016	1.7297
18 Mar 2012 19:14:41	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	285,120	493,683	1.7315
18 Mar 2012 04:52:26	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	259,200	448,968	1.7321
17 Mar 2012 13:54:48	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	233,280	404,174	1.7326
16 Mar 2012 23:39:24	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	207,360	360,390	1.7380
16 Mar 2012 09:20:32	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	181,440	315,580	1.7393
15 Mar 2012 19:11:33	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	155,520	271,002	1.7426
15 Mar 2012 04:26:22	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	129,600	226,238	1.7457
14 Mar 2012 13:34:46	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	103,680	180,909	1.7449
13 Mar 2012 23:13:45	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	77,760	135,867	1.7473
13 Mar 2012 08:33:37	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	51,840	90,547	1.7467
12 Mar 2012 18:47:28	1144522	14251838	hadcm3n_o4m1_1980_40_007753135_2	25,920	45,311	1.7481