climateprediction.net home page
Task 13289126

Task 13289126

Name hadcm3n_p41a_1940_40_007420606_1
Workunit 7618241
Created 24 Aug 2011, 23:25:37 UTC
Sent 24 Aug 2011, 23:26:38 UTC
Report deadline 24 Nov 2011, 6:53:49 UTC
Received 15 Oct 2011, 11:48:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 400932
Run time 20 days 5 hours 33 min 45 sec
CPU time 18 days 10 hours 44 min 28 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 2.72 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:01:03 (6108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:44:25 (4412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4780, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
20:22:53 (2308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:22:54 (2308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5308, iMonCtr=1
Model crash detected, will try to restart...
05:59:02 (1052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
01:34:36 (2980): Can't acquire lockfile (32) - waiting 35s
01:34:51 (1300): Can't acquire lockfile (32) - waiting 35s
01:35:02 (4624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:35:26 (1300): Can't acquire lockfile (32) - exiting
01:35:26 (1300): Error: The process cannot access the file because it is being used by another process. (0x20)
01:35:57 (2980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
12:25:31 (4128): Can't acquire lockfile (32) - waiting 35s
12:25:56 (3380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:40:51 (3252): Can't acquire lockfile (32) - waiting 35s
20:41:08 (4852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2196, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Oct 2011 00:19:32 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 725,760 1,575,192 2.1704
07 Oct 2011 08:25:29 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 699,840 1,520,225 2.1722
06 Oct 2011 16:33:27 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 673,920 1,465,650 2.1748
06 Oct 2011 00:45:29 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 648,000 1,411,269 2.1779
05 Oct 2011 07:08:18 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 622,080 1,358,382 2.1836
04 Oct 2011 15:12:47 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 596,160 1,302,191 2.1843
03 Oct 2011 23:12:46 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 570,240 1,245,942 2.1849
03 Oct 2011 06:40:28 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 544,320 1,189,935 2.1861
27 Sep 2011 16:12:29 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 518,400 1,134,172 2.1878
26 Sep 2011 23:47:05 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 492,480 1,078,778 2.1905
23 Sep 2011 19:05:48 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 466,560 1,023,250 2.1932
23 Sep 2011 03:21:57 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 440,640 967,641 2.1960
21 Sep 2011 10:32:42 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 414,720 912,421 2.2001
20 Sep 2011 18:30:29 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 388,800 858,357 2.2077
20 Sep 2011 02:53:29 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 362,880 811,952 2.2375
19 Sep 2011 06:11:14 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 336,960 754,674 2.2397
18 Sep 2011 14:32:28 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 311,040 699,451 2.2487
17 Sep 2011 23:03:55 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 285,120 644,513 2.2605
17 Sep 2011 07:34:42 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 259,200 589,427 2.2740
16 Sep 2011 15:54:00 400932 13289126 hadcm3n_p41a_1940_40_007420606_1 233,280 534,291 2.2903


©2024 cpdn.org