Task 13282255

Name	hadcm3n_yc5z_1940_40_007413372_2
Workunit	7611002
Created	21 Aug 2011, 14:12:17 UTC
Sent	21 Aug 2011, 14:12:32 UTC
Report deadline	20 Nov 2011, 21:39:43 UTC
Received	1 Oct 2013, 9:59:26 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1064436
Run time	11 days 6 hours 29 min 49 sec
CPU time	8 days 20 hours 15 min 4 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	3.24 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin
Stderr	<core_client_version>6.6.20</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 15:57:55 (309): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:17:51 (18542): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:31:04 (17033): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:48:46 (243): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 17:44:43 (1677): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 18:54:03 (266): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:11:30 (393): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yc5z_1940_40_007413372/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
01 Oct 2013 10:01:26	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	518,400	764,132	1.4740
01 Oct 2013 10:01:26	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	492,480	725,221	1.4726
18 Sep 2011 17:49:27	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	466,560	687,498	1.4735
17 Sep 2011 17:02:55	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	440,640	649,323	1.4736
16 Sep 2011 13:57:24	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	414,720	612,491	1.4769
16 Sep 2011 07:29:57	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	388,800	575,196	1.4794
11 Sep 2011 02:26:48	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	362,880	536,225	1.4777
10 Sep 2011 14:24:30	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	336,960	497,203	1.4756
10 Sep 2011 02:21:10	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	311,040	458,103	1.4728
09 Sep 2011 14:37:16	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	285,120	419,015	1.4696
08 Sep 2011 22:25:35	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	259,200	379,935	1.4658
08 Sep 2011 17:18:04	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	233,280	342,658	1.4689
08 Sep 2011 17:18:04	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	207,360	307,037	1.4807
08 Sep 2011 17:18:04	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	181,440	271,181	1.4946
05 Sep 2011 18:24:55	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	155,520	232,638	1.4959
05 Sep 2011 06:53:13	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	129,600	194,112	1.4978
02 Sep 2011 01:11:39	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	103,680	155,334	1.4982
01 Sep 2011 18:15:45	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	77,760	116,609	1.4996
01 Sep 2011 18:15:45	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	51,840	77,732	1.4995
27 Aug 2011 12:53:24	1064436	13282255	hadcm3n_yc5z_1940_40_007413372_2	25,920	38,862	1.4993