Task 13956265

Name	hadcm3n_o1yn_1940_40_007693832_0
Workunit	7848940
Created	23 Jan 2012, 19:11:48 UTC
Sent	23 Jan 2012, 19:14:19 UTC
Report deadline	24 Apr 2012, 2:41:30 UTC
Received	16 Feb 2012, 19:28:21 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	950229
Run time	14 days 15 hours 31 min 27 sec
CPU time	14 days 8 hours 32 min 46 sec
Validate state	Invalid
Credit	5,287.68
Device peak FLOPS	1.98 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3796, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6100, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6100, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6100, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6100, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6100, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts CPDN Monitor - Quit request from BOINC... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o1yn_1940_40_007693832/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6252, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
16 Feb 2012 04:08:22	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	440,640	1,186,956	2.6937
15 Feb 2012 08:05:57	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	414,720	1,115,926	2.6908
14 Feb 2012 12:15:20	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	388,800	1,045,054	2.6879
13 Feb 2012 16:12:59	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	362,880	974,759	2.6862
12 Feb 2012 20:21:00	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	336,960	904,778	2.6851
12 Feb 2012 00:34:27	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	311,040	834,705	2.6836
11 Feb 2012 05:30:26	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	285,120	764,829	2.6825
10 Feb 2012 08:48:14	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	259,200	694,869	2.6808
09 Feb 2012 12:55:16	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	233,280	624,693	2.6779
08 Feb 2012 17:12:38	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	207,360	554,875	2.6759
07 Feb 2012 21:13:57	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	181,440	485,290	2.6747
07 Feb 2012 01:18:26	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	155,520	415,440	2.6713
06 Feb 2012 05:46:11	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	129,600	345,985	2.6696
05 Feb 2012 10:11:01	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	103,680	276,829	2.6700
04 Feb 2012 14:47:08	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	77,760	207,417	2.6674
03 Feb 2012 19:04:24	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	51,840	138,307	2.6680
03 Feb 2012 00:03:39	950229	13956265	hadcm3n_o1yn_1940_40_007693832_0	25,920	69,089	2.6655