Task 14360019

Name	hadcm3n_o2o9_2020_40_007857136_1
Workunit	8012248
Created	4 Apr 2012, 21:12:22 UTC
Sent	4 Apr 2012, 23:14:54 UTC
Report deadline	5 Jul 2012, 6:42:05 UTC
Received	18 Apr 2012, 1:11:14 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1244151
Run time	8 days 21 hours 29 min 32 sec
CPU time	8 days 21 hours 11 min 43 sec
Validate state	Invalid
Credit	9,020.16
Device peak FLOPS	3.35 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.25</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2900, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:04:47 (3520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... OPEN: Unable to Open File dataout/o2o9ko.dao77b0 for Read/Write Model crashed: DUMPCTL : Fail to open output dump - may already exist tmp/pipe_dummy 2048 06:39:39 (4436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Apr 2012 18:18:17	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	751,680	762,677	1.0146
15 Apr 2012 12:38:17	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	725,760	739,870	1.0194
15 Apr 2012 05:20:40	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	699,840	716,973	1.0245
14 Apr 2012 16:46:53	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	673,920	694,046	1.0299
14 Apr 2012 10:53:36	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	648,000	671,076	1.0356
14 Apr 2012 03:56:27	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	622,080	647,793	1.0413
13 Apr 2012 14:17:50	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	596,160	624,711	1.0479
13 Apr 2012 07:56:01	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	570,240	601,396	1.0546
13 Apr 2012 01:24:06	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	544,320	578,194	1.0622
12 Apr 2012 17:09:05	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	518,400	555,023	1.0706
12 Apr 2012 03:13:14	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	492,480	531,558	1.0793
11 Apr 2012 18:33:07	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	466,560	507,802	1.0884
11 Apr 2012 11:21:00	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	440,640	483,366	1.0970
11 Apr 2012 05:43:55	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	414,720	460,040	1.1093
10 Apr 2012 19:09:05	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	388,800	437,163	1.1244
10 Apr 2012 12:42:53	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	362,880	413,895	1.1406
10 Apr 2012 04:11:11	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	336,960	390,238	1.1581
09 Apr 2012 19:09:05	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	311,040	360,214	1.1581
09 Apr 2012 10:51:47	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	285,120	330,105	1.1578
09 Apr 2012 02:27:34	1209916	14360019	hadcm3n_o2o9_2020_40_007857136_1	259,200	300,039	1.1576