Task 13110532

Name	hadcm3n_yfsp_1900_40_007353331_1
Workunit	7550761
Created	6 Jul 2011, 14:26:17 UTC
Sent	15 Jul 2011, 17:14:36 UTC
Report deadline	15 Oct 2011, 0:41:47 UTC
Received	26 Jul 2011, 10:44:11 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1393537
Run time	9 days 21 hours 55 min 55 sec
CPU time	9 days 19 hours 2 min 36 sec
Validate state	Invalid
Credit	3,421.44
Device peak FLOPS	1.73 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=616, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yfsp_1900_40_007353331/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Jul 2011 23:00:38	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	285,120	802,234	2.8137
25 Jul 2011 22:22:00	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	259,200	728,755	2.8116
25 Jul 2011 21:07:57	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	233,280	655,462	2.8098
25 Jul 2011 20:30:49	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	207,360	582,911	2.8111
25 Jul 2011 19:04:10	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	181,440	510,064	2.8112
25 Jul 2011 19:04:10	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	155,520	436,816	2.8087
25 Jul 2011 18:56:09	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	129,600	364,084	2.8093
25 Jul 2011 18:09:19	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	103,680	291,729	2.8137
25 Jul 2011 17:36:36	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	77,760	218,132	2.8052
25 Jul 2011 16:34:49	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	51,840	145,170	2.8003
25 Jul 2011 15:54:11	1111761	13110532	hadcm3n_yfsp_1900_40_007353331_1	25,920	72,564	2.7995