climateprediction.net home page
Task 15993417

Task 15993417

Name hadcm3n_817a_1980_40_008459497_0
Workunit 8610353
Created 30 Aug 2013, 20:54:34 UTC
Sent 5 Sep 2013, 18:26:04 UTC
Report deadline 6 Dec 2013, 1:53:15 UTC
Received 17 Oct 2013, 19:16:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1242385
Run time 22 days 13 hours 28 min 54 sec
CPU time 21 days 22 hours 24 min 4 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 0.93 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk.
 (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
14:01:44 (8184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:29:18 (1608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:29:19 (1608): No heartbeat from core client for 30 sec - exiting
09:29:20 (1608): No heartbeat from core client for 30 sec - exiting
09:29:21 (1608): No heartbeat from core client for 30 sec - exiting
09:29:22 (1608): No heartbeat from core client for 30 sec - exiting
09:29:23 (1608): No heartbeat from core client for 30 sec - exiting
09:29:24 (1608): No heartbeat from core client for 30 sec - exiting
09:29:25 (1608): No heartbeat from core client for 30 sec - exiting
09:29:26 (1608): No heartbeat from core client for 30 sec - exiting
09:29:27 (1608): No heartbeat from core client for 30 sec - exiting
09:29:28 (1608): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
08:30:33 (4752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:35:47 (7340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:58:22 (1076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:37:51 (1108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:27:20 (5956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:57:32 (8460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:12:14 (6508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:12:15 (6508): No heartbeat from core client for 30 sec - exiting
11:12:16 (6508): No heartbeat from core client for 30 sec - exiting
11:12:17 (6508): No heartbeat from core client for 30 sec - exiting
11:12:18 (6508): No heartbeat from core client for 30 sec - exiting
11:12:19 (6508): No heartbeat from core client for 30 sec - exiting
11:12:20 (6508): No heartbeat from core client for 30 sec - exiting
11:12:21 (6508): No heartbeat from core client for 30 sec - exiting
11:12:22 (6508): No heartbeat from core client for 30 sec - exiting
11:12:23 (6508): No heartbeat from core client for 30 sec - exiting
11:12:24 (6508): No heartbeat from core client for 30 sec - exiting
13:00:12 (8880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
22:03:23 (6208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:29:32 (6976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:59:56 (6112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2280, iMonCtr=1
Model crash detected, will try to restart...
13:00:51 (7716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:03:45 (8856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:03:46 (8856): No heartbeat from core client for 30 sec - exiting
14:03:47 (8856): No heartbeat from core client for 30 sec - exiting
14:03:48 (8856): No heartbeat from core client for 30 sec - exiting
14:03:49 (8856): No heartbeat from core client for 30 sec - exiting
14:03:50 (8856): No heartbeat from core client for 30 sec - exiting
14:03:51 (8856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
08:31:27 (6500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:36:29 (7588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:36:30 (7588): No heartbeat from core client for 30 sec - exiting
09:36:31 (7588): No heartbeat from core client for 30 sec - exiting
09:36:32 (7588): No heartbeat from core client for 30 sec - exiting
09:36:33 (7588): No heartbeat from core client for 30 sec - exiting
09:36:34 (7588): No heartbeat from core client for 30 sec - exiting
09:36:35 (7588): No heartbeat from core client for 30 sec - exiting
09:36:36 (7588): No heartbeat from core client for 30 sec - exiting
09:36:37 (7588): No heartbeat from core client for 30 sec - exiting
09:36:38 (7588): No heartbeat from core client for 30 sec - exiting
09:36:39 (7588): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
08:31:39 (7004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6516, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
13:01:52 (6848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:34:25 (1232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7052, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
08:30:51 (5816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:28:34 (7072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:35:50 (1720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:35:51 (1720): No heartbeat from core client for 30 sec - exiting
10:35:52 (1720): No heartbeat from core client for 30 sec - exiting
10:35:53 (1720): No heartbeat from core client for 30 sec - exiting
10:35:54 (1720): No heartbeat from core client for 30 sec - exiting
10:35:55 (1720): No heartbeat from core client for 30 sec - exiting
10:35:56 (1720): No heartbeat from core client for 30 sec - exiting
10:35:57 (1720): No heartbeat from core client for 30 sec - exiting
10:35:58 (1720): No heartbeat from core client for 30 sec - exiting
10:35:59 (1720): No heartbeat from core client for 30 sec - exiting
10:36:00 (1720): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3748, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6800, iMonCtr=1
Model crash detected, will try to restart...
13:02:01 (5924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:33:08 (2872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:33:09 (2872): No heartbeat from core client for 30 sec - exiting
09:33:10 (2872): No heartbeat from core client for 30 sec - exiting
09:33:11 (2872): No heartbeat from core client for 30 sec - exiting
09:33:12 (2872): No heartbeat from core client for 30 sec - exiting
09:33:13 (2872): No heartbeat from core client for 30 sec - exiting
09:33:14 (2872): No heartbeat from core client for 30 sec - exiting
09:33:15 (2872): No heartbeat from core client for 30 sec - exiting
09:33:16 (2872): No heartbeat from core client for 30 sec - exiting
09:33:17 (2872): No heartbeat from core client for 30 sec - exiting
09:33:18 (2872): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2860, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:30:41 (5920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:03:22 (4824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=1
Model crash detected, will try to restart...
14:05:12 (1488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:29:36 (6072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:31:53 (5100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Oct 2013 06:11:54 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 881,280 1,858,163 2.1085
16 Oct 2013 14:48:05 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 855,360 1,806,792 2.1123
16 Oct 2013 00:30:39 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 829,440 1,757,943 2.1194
15 Oct 2013 06:44:25 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 803,520 1,704,133 2.1208
06 Oct 2013 04:37:20 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 777,600 1,651,968 2.1244
05 Oct 2013 09:44:11 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 751,680 1,588,504 2.1133
03 Oct 2013 20:04:42 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 725,760 1,533,903 2.1135
02 Oct 2013 21:25:39 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 699,840 1,479,374 2.1139
02 Oct 2013 03:06:25 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 673,920 1,424,260 2.1134
01 Oct 2013 09:00:53 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 648,000 1,367,144 2.1098
30 Sep 2013 15:24:28 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 622,080 1,307,964 2.1026
29 Sep 2013 22:00:16 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 596,160 1,249,971 2.0967
29 Sep 2013 05:42:26 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 570,240 1,192,958 2.0920
28 Sep 2013 12:26:13 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 544,320 1,132,600 2.0808
27 Sep 2013 18:23:46 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 518,400 1,073,189 2.0702
27 Sep 2013 01:39:59 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 492,480 1,017,416 2.0659
26 Sep 2013 07:16:31 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 466,560 960,871 2.0595
25 Sep 2013 15:18:14 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 440,640 906,767 2.0578
24 Sep 2013 08:49:13 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 414,720 854,571 2.0606
23 Sep 2013 16:10:00 1242385 15993417 hadcm3n_817a_1980_40_008459497_0 388,800 806,111 2.0733


©2024 climateprediction.net