climateprediction.net home page
Task 12742363

Task 12742363

Name hadcm3n_o41w_1900_40_007200583_0
Workunit 7398863
Created 28 Mar 2011, 14:08:55 UTC
Sent 30 Mar 2011, 20:22:43 UTC
Report deadline 30 Jun 2011, 3:49:54 UTC
Received 11 Aug 2011, 18:20:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1045125
Run time 14 days 15 hours 23 min 44 sec
CPU time 13 days 15 hours 39 min 32 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.57 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
19:03:12 (1436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:02:56 (3512): No heartbeat from core client for 30 sec - exiting
09:02:57 (3512): No heartbeat from core client for 30 sec - exiting
09:02:58 (3512): No heartbeat from core client for 30 sec - exiting
09:02:59 (3512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:40:25 (4664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:32:02 (1296): Can't acquire lockfile (32) - waiting 35s
20:32:23 (4412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4896, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3244, iMonCtr=1
Model crash detected, will try to restart...
21:20:09 (1620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:09:44 (6580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:09:45 (6580): No heartbeat from core client for 30 sec - exiting
09:09:46 (6580): No heartbeat from core client for 30 sec - exiting
09:09:47 (6580): No heartbeat from core client for 30 sec - exiting
09:09:48 (6580): No heartbeat from core client for 30 sec - exiting
09:09:49 (6580): No heartbeat from core client for 30 sec - exiting
09:09:50 (6580): No heartbeat from core client for 30 sec - exiting
09:09:51 (6580): No heartbeat from core client for 30 sec - exiting
09:09:52 (6580): No heartbeat from core client for 30 sec - exiting
09:09:53 (6580): No heartbeat from core client for 30 sec - exiting
09:09:55 (6580): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5448, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:02:06 (3564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:12:24 (1120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
C17:15:02 (5024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:34:44 (5576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:54:27 (5292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:33:34 (2772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:17:38 (5380): No heartbeat from core client for 30 sec - exiting
20:17:39 (5380): No heartbeat from core client for 30 sec - exiting
20:17:40 (5380): No heartbeat from core client for 30 sec - exiting
20:17:41 (5380): No heartbeat from core client for 30 sec - exiting
20:17:42 (5380): No heartbeat from core client for 30 sec - exiting
20:17:43 (5380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:12:49 (5140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:55:23 (5436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1
Model crash detected, will try to restart...
17:06:51 (5636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:08:40 (5472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2740, iMonCtr=1
Model crash detected, will try to restart...
18:33:19 (4384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:21:43 (2764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:03:07 (5332): No heartbeat from core client for 30 sec - exiting
19:03:08 (5332): No heartbeat from core client for 30 sec - exiting
19:03:09 (5332): No heartbeat from core client for 30 sec - exiting
19:03:10 (5332): No heartbeat from core client for 30 sec - exiting
19:03:11 (5332): No heartbeat from core client for 30 sec - exiting
19:03:12 (5332): No heartbeat from core client for 30 sec - exiting
19:03:13 (5332): No heartbeat from core client for 30 sec - exiting
19:03:14 (5332): No heartbeat from core client for 30 sec - exiting
19:03:16 (5332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Aug 2011 20:45:58 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 648,000 1,173,744 1.8113
06 Aug 2011 20:54:37 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 622,080 1,126,442 1.8108
31 Jul 2011 12:07:11 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 596,160 1,079,612 1.8109
27 Jul 2011 18:16:03 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 570,240 1,032,728 1.8110
25 Jul 2011 22:00:36 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 544,320 985,957 1.8114
25 Jul 2011 18:11:57 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 518,400 939,201 1.8117
25 Jul 2011 14:38:00 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 492,480 892,250 1.8117
07 Jul 2011 16:15:06 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 466,560 845,187 1.8115
04 Jul 2011 17:42:43 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 440,640 798,134 1.8113
03 Jul 2011 18:07:21 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 414,720 750,702 1.8101
02 Jul 2011 17:48:45 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 388,800 703,127 1.8085
28 Jun 2011 08:13:46 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 362,880 656,616 1.8095
21 Jun 2011 20:05:31 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 336,960 610,288 1.8112
07 Jun 2011 18:33:20 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 311,040 563,574 1.8119
30 May 2011 16:11:12 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 285,120 516,775 1.8125
25 May 2011 19:09:35 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 259,200 469,571 1.8116
22 May 2011 18:09:53 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 233,280 422,395 1.8107
14 May 2011 18:38:07 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 207,360 375,033 1.8086
07 May 2011 20:29:48 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 181,440 328,110 1.8084
01 May 2011 13:33:39 1045125 12742363 hadcm3n_o41w_1900_40_007200583_0 155,520 281,139 1.8077


©2024 cpdn.org