Task 13272167

Name	hadcm3n_yg0l_1940_40_007414625_0
Workunit	7612255
Created	17 Aug 2011, 14:14:48 UTC
Sent	17 Aug 2011, 14:25:05 UTC
Report deadline	16 Nov 2011, 21:52:16 UTC
Received	26 Aug 2011, 0:30:34 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	775427
Run time	5 days 23 hours 23 min 57 sec
CPU time	5 days 17 hours 32 min 26 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.34 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 10:41:38 (4976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:41:39 (4976): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:18:26 (5136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8116, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:01:30 (6320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:54:28 (7540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:54:29 (7540): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7144, iMonCtr=1 Model crash detected, will try to restart... 21:31:57 (2968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:32:00 (2968): No heartbeat from core client for 30 sec - exiting 21:32:01 (2968): No heartbeat from core client for 30 sec - exiting 21:32:02 (2968): No heartbeat from core client for 30 sec - exiting 21:42:16 (4644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1752, iMonCtr=1 Model crash detected, will try to restart... 18:53:26 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:30:57 (6988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:39:21 (4476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2632, iMonCtr=1 Model crash detected, will try to restart... 11:34:09 (3896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:27:34 (1560): No heartbeat from core client for 30 sec - exiting 17:27:35 (1560): No heartbeat from core client for 30 sec - exiting 17:27:36 (1560): No heartbeat from core client for 30 sec - exiting 17:27:37 (1560): No heartbeat from core client for 30 sec - exiting Ocean Restart file copy failed on yg0lko.daf0c20 CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\BOINC/projects/climateprediction.net/hadcm3n_yg0l_1940_40_007414625/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Aug 2011 23:29:14	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	259,200	495,240	1.9106
25 Aug 2011 09:07:41	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	233,280	444,429	1.9051
24 Aug 2011 16:38:15	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	207,360	393,986	1.9000
23 Aug 2011 18:10:51	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	181,440	347,384	1.9146
22 Aug 2011 20:20:16	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	155,520	301,667	1.9397
21 Aug 2011 20:01:42	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	129,600	252,660	1.9495
20 Aug 2011 19:31:10	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	103,680	202,368	1.9519
20 Aug 2011 04:58:46	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	77,760	151,177	1.9441
19 Aug 2011 14:32:55	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	51,840	100,259	1.9340
19 Aug 2011 00:00:12	775427	13272167	hadcm3n_yg0l_1940_40_007414625_0	25,920	49,426	1.9069