Task 13613733

Name	hadcm3n_o2ph_1980_40_007539535_2
Workunit	7736767
Created	6 Nov 2011, 20:00:02 UTC
Sent	6 Nov 2011, 20:04:14 UTC
Report deadline	6 Feb 2012, 3:31:25 UTC
Received	22 Feb 2012, 8:14:51 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1117998
Run time	13 days 22 hours 46 min 13 sec
CPU time	13 days 6 hours 20 min 36 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.58 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o2phko.pji1c10 Error converting file to netcdf: dataout/o2phko.pii1c10 Error converting file to netcdf: dataout/o2phko.pfi1c10 Error converting file to netcdf: dataout/o2phka.phi1c10 Error converting file to netcdf: dataout/o2phka.pgi1c10 Error converting file to netcdf: dataout/o2phka.pei1c10 Error converting file to netcdf: dataout/o2phka.pdi1c10 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3144, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:53:12 (2528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2220, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2380, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 02:35:31 (2060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2828, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 23:20:11 (1232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1180, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2088, iMonCtr=1 Model crash detected, will try to restart... 18:08:44 (2180): No heartbeat from core client for 30 sec - exiting 18:08:45 (2180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:55:38 (2960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2352, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:10:33 (2336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:18:27 (2200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1120, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2088, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:39:49 (320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2272, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
22 Feb 2012 07:16:14	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	518,400	1,146,011	2.2107
20 Feb 2012 00:48:53	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	492,480	1,096,597	2.2267
15 Feb 2012 12:46:50	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	440,640	995,220	2.2586
12 Feb 2012 16:57:30	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	414,720	945,973	2.2810
11 Feb 2012 04:29:16	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	388,800	898,999	2.3122
09 Feb 2012 16:59:55	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	362,880	849,571	2.3412
06 Feb 2012 00:46:01	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	336,960	799,980	2.3741
03 Feb 2012 08:17:07	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	311,040	753,804	2.4235
30 Jan 2012 23:51:44	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	285,120	707,012	2.4797
29 Jan 2012 07:57:19	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	259,200	657,442	2.5364
27 Jan 2012 08:06:57	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	233,280	605,617	2.5961
23 Jan 2012 08:40:12	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	207,360	539,405	2.6013
18 Jan 2012 08:39:08	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	181,440	469,086	2.5854
15 Jan 2012 08:45:28	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	155,520	403,453	2.5942
11 Jan 2012 02:02:23	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	129,600	328,834	2.5373
09 Dec 2011 14:59:09	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	103,680	260,220	2.5098
08 Dec 2011 02:11:03	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	77,760	196,014	2.5208
04 Dec 2011 23:28:39	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	51,840	130,578	2.5189
15 Nov 2011 17:46:34	1117998	13613733	hadcm3n_o2ph_1980_40_007539535_2	25,920	64,849	2.5019