Task 13314713

Name	hadcm3n_ye2f_1900_40_007351089_2
Workunit	7548519
Created	29 Aug 2011, 20:53:50 UTC
Sent	29 Aug 2011, 20:54:30 UTC
Report deadline	29 Nov 2011, 4:21:41 UTC
Received	2 Oct 2011, 16:27:48 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	25 (0x00000019) Unknown error code
Computer ID	1166636
Run time	5 days 12 hours 2 min 53 sec
CPU time	4 days 21 hours 56 min 26 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	4.38 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> 22:00:38 (1284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:00:49 (1284): No heartbeat from core client for 30 sec - exiting 22:00:50 (1284): No heartbeat from core client for 30 sec - exiting 22:00:51 (1284): No heartbeat from core client for 30 sec - exiting 22:00:52 (1284): No heartbeat from core client for 30 sec - exiting 22:00:53 (1284): No heartbeat from core client for 30 sec - exiting 22:00:54 (1284): No heartbeat from core client for 30 sec - exiting 22:02:18 (3960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:19 (3960): No heartbeat from core client for 30 sec - exiting 22:05:26 (3388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:05:10 (5360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2140, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2140, iMonCtr=1 Model crash detected, will try to restart... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
02 Oct 2011 07:29:41	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	388,800	434,142	1.1166
01 Oct 2011 23:43:00	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	362,880	406,292	1.1196
30 Sep 2011 06:39:09	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	336,960	378,629	1.1237
29 Sep 2011 22:36:35	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	311,040	351,355	1.1296
29 Sep 2011 04:54:45	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	285,120	323,815	1.1357
28 Sep 2011 05:16:44	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	259,200	296,398	1.1435
27 Sep 2011 18:46:14	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	233,280	269,601	1.1557
27 Sep 2011 01:12:59	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	207,360	242,656	1.1702
26 Sep 2011 17:54:55	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	181,440	217,259	1.1974
26 Sep 2011 00:03:56	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	155,520	193,103	1.2417
25 Sep 2011 16:35:38	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	129,600	169,487	1.3078
25 Sep 2011 08:45:38	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	103,680	144,061	1.3895
24 Sep 2011 21:36:00	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	77,760	107,465	1.3820
24 Sep 2011 09:26:52	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	51,840	70,885	1.3674
23 Sep 2011 06:04:45	1166636	13314713	hadcm3n_ye2f_1900_40_007351089_2	25,920	31,125	1.2008