Task 13362841

Name	hadcm3n_o4ng_1940_40_007449080_1
Workunit	7646583
Created	9 Sep 2011, 23:12:05 UTC
Sent	14 Sep 2011, 12:42:32 UTC
Report deadline	14 Dec 2011, 20:09:43 UTC
Received	12 Oct 2011, 17:49:07 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1042736
Run time	16 days 3 hours 53 min 45 sec
CPU time	13 days 12 hours 27 min 30 sec
Validate state	Invalid
Credit	7,153.92
Device peak FLOPS	2.17 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8748, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:09:32 (3296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6604, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6604, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6604, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6604, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3132, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3132, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5344, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5344, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts CPDN Monitor - Quit request from BOINC... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4848, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4848, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4848, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o4ng_1940_40_007449080/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4848, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Oct 2011 21:12:26	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	596,160	1,143,839	1.9187
05 Oct 2011 16:33:35	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	570,240	1,093,082	1.9169
04 Oct 2011 23:10:06	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	544,320	1,042,660	1.9155
03 Oct 2011 15:36:32	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	518,400	992,495	1.9145
02 Oct 2011 10:17:00	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	492,480	943,689	1.9162
01 Oct 2011 18:02:12	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	466,560	898,277	1.9253
29 Sep 2011 22:41:38	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	440,640	849,393	1.9276
29 Sep 2011 05:50:43	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	414,720	798,755	1.9260
28 Sep 2011 00:48:28	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	388,800	748,323	1.9247
27 Sep 2011 05:25:49	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	362,880	699,559	1.9278
26 Sep 2011 12:07:00	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	336,960	652,295	1.9358
25 Sep 2011 17:38:06	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	311,040	601,466	1.9337
23 Sep 2011 19:15:52	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	285,120	551,113	1.9329
22 Sep 2011 15:44:36	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	259,200	501,036	1.9330
21 Sep 2011 19:53:19	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	233,280	450,181	1.9298
21 Sep 2011 03:28:24	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	207,360	398,745	1.9230
19 Sep 2011 22:00:03	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	181,440	346,253	1.9084
19 Sep 2011 06:01:09	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	155,520	294,948	1.8965
18 Sep 2011 14:47:38	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	129,600	246,254	1.9001
17 Sep 2011 07:59:55	1042736	13362841	hadcm3n_o4ng_1940_40_007449080_1	103,680	196,186	1.8922