Task 13289126

Name	hadcm3n_p41a_1940_40_007420606_1
Workunit	7618241
Created	24 Aug 2011, 23:25:37 UTC
Sent	24 Aug 2011, 23:26:38 UTC
Report deadline	24 Nov 2011, 6:53:49 UTC
Received	15 Oct 2011, 11:48:46 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	25 (0x00000019) Unknown error code
Computer ID	400932
Run time	20 days 5 hours 33 min 45 sec
CPU time	18 days 10 hours 44 min 28 sec
Validate state	Invalid
Credit	8,709.12
Device peak FLOPS	2.72 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 11:01:03 (6108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:44:25 (4412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4780, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 20:22:53 (2308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:22:54 (2308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5308, iMonCtr=1 Model crash detected, will try to restart... 05:59:02 (1052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:34:36 (2980): Can't acquire lockfile (32) - waiting 35s 01:34:51 (1300): Can't acquire lockfile (32) - waiting 35s 01:35:02 (4624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:35:26 (1300): Can't acquire lockfile (32) - exiting 01:35:26 (1300): Error: The process cannot access the file because it is being used by another process. (0x20) 01:35:57 (2980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 12:25:31 (4128): Can't acquire lockfile (32) - waiting 35s 12:25:56 (3380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:40:51 (3252): Can't acquire lockfile (32) - waiting 35s 20:41:08 (4852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2196, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1 Model crash detected, will try to restart... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Oct 2011 00:19:32	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	725,760	1,575,192	2.1704
07 Oct 2011 08:25:29	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	699,840	1,520,225	2.1722
06 Oct 2011 16:33:27	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	673,920	1,465,650	2.1748
06 Oct 2011 00:45:29	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	648,000	1,411,269	2.1779
05 Oct 2011 07:08:18	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	622,080	1,358,382	2.1836
04 Oct 2011 15:12:47	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	596,160	1,302,191	2.1843
03 Oct 2011 23:12:46	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	570,240	1,245,942	2.1849
03 Oct 2011 06:40:28	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	544,320	1,189,935	2.1861
27 Sep 2011 16:12:29	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	518,400	1,134,172	2.1878
26 Sep 2011 23:47:05	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	492,480	1,078,778	2.1905
23 Sep 2011 19:05:48	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	466,560	1,023,250	2.1932
23 Sep 2011 03:21:57	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	440,640	967,641	2.1960
21 Sep 2011 10:32:42	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	414,720	912,421	2.2001
20 Sep 2011 18:30:29	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	388,800	858,357	2.2077
20 Sep 2011 02:53:29	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	362,880	811,952	2.2375
19 Sep 2011 06:11:14	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	336,960	754,674	2.2397
18 Sep 2011 14:32:28	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	311,040	699,451	2.2487
17 Sep 2011 23:03:55	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	285,120	644,513	2.2605
17 Sep 2011 07:34:42	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	259,200	589,427	2.2740
16 Sep 2011 15:54:00	400932	13289126	hadcm3n_p41a_1940_40_007420606_1	233,280	534,291	2.2903