climateprediction.net home page
Task 17313982

Task 17313982

Name hadcm3n_sa8z_1940_40_009107926_1
Workunit 9238262
Created 28 Oct 2014, 5:28:10 UTC
Sent 28 Oct 2014, 5:36:19 UTC
Report deadline 27 Jan 2015, 13:03:30 UTC
Received 5 Dec 2014, 1:42:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1338325
Run time 24 days 0 hours 12 min 5 sec
CPU time 8 days 8 hours 20 min 58 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 4.26 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.4.27</core_client_version>
<![CDATA[
<message>
Das Laufwerk kann einen bestimmten Bereich oder eine bestimmte Spur nicht finden.
 (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
01:23:18 (3564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:30:56 (5260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:36:53 (7596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:37:38 (5080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:41:01 (7968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:41:03 (7968): No heartbeat from core client for 30 sec - exiting
17:41:46 (8056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:42:38 (1504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:50:12 (4772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:50:56 (3648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:39:47 (5548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:12:20 (9188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6484, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
11:22:17 (5168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
15:57:23 (6388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:27:49 (5860): Can't acquire lockfile (32) - waiting 35s
20:27:50 (1824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:28:24 (5860): Can't set up shared mem: -1. Will run in standalone mode.
20:28:24 (5824): Can't set up shared mem: -1. Will run in standalone mode.
20:29:17 (4952): Can't acquire lockfile (32) - waiting 35s
20:30:00 (4952): Can't acquire lockfile (32) - exiting
20:30:00 (4952): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
20:31:28 (6116): Can't acquire lockfile (32) - waiting 35s
20:32:03 (6116): Can't acquire lockfile (32) - exiting
20:32:03 (6116): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:31:08 (6604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:52:33 (6452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:56:54 (8140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:56:55 (8140): No heartbeat from core client for 30 sec - exiting
23:57:03 (7752): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
17:35:58 (6556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
20:01:13 (6964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:48:00 (8176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:02:06 (6040): Can't acquire lockfile (32) - waiting 35s
14:02:30 (7336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:37:46 (7516): Can't acquire lockfile (32) - waiting 35s
14:37:54 (3208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:38:21 (7516): Can't set up shared mem: -1. Will run in standalone mode.
14:38:21 (5480): Can't set up shared mem: -1. Will run in standalone mode.
14:38:40 (4720): Can't acquire lockfile (32) - waiting 35s
14:39:15 (4720): Can't acquire lockfile (32) - exiting
14:39:15 (4720): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
14:44:05 (640): Can't acquire lockfile (32) - waiting 35s
18:16:46 (7572): Can't acquire lockfile (32) - waiting 35s
18:17:21 (7572): Can't acquire lockfile (32) - exiting
18:17:21 (7572): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
19:02:56 (6088): Can't acquire lockfile (32) - waiting 35s
19:03:31 (6088): Can't acquire lockfile (32) - exiting
19:03:31 (6088): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
19:17:59 (6560): Can't acquire lockfile (32) - waiting 35s
19:18:34 (6560): Can't acquire lockfile (32) - exiting
19:18:34 (6560): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
19:28:36 (7596): Can't acquire lockfile (32) - waiting 35s
19:29:11 (7596): Can't acquire lockfile (32) - exiting
19:29:11 (7596): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
19:39:13 (7924): Can't acquire lockfile (32) - waiting 35s
19:39:48 (7924): Can't acquire lockfile (32) - exiting
19:39:48 (7924): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
19:52:12 (1108): Can't acquire lockfile (32) - waiting 35s
19:52:47 (1108): Can't acquire lockfile (32) - exiting
19:52:47 (1108): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
20:15:58 (6572): Can't acquire lockfile (32) - waiting 35s
20:16:33 (6572): Can't acquire lockfile (32) - exiting
20:16:33 (6572): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
20:26:39 (6672): Can't acquire lockfile (32) - waiting 35s
20:27:14 (6672): Can't acquire lockfile (32) - exiting
20:27:14 (6672): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
20:37:22 (4716): Can't acquire lockfile (32) - waiting 35s
20:37:57 (4716): Can't acquire lockfile (32) - exiting
20:37:57 (4716): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
20:47:59 (6360): Can't acquire lockfile (32) - waiting 35s
20:48:34 (6360): Can't acquire lockfile (32) - exiting
20:48:34 (6360): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
20:58:41 (3824): Can't acquire lockfile (32) - waiting 35s
20:59:16 (3824): Can't acquire lockfile (32) - exiting
20:59:16 (3824): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
21:09:18 (5624): Can't acquire lockfile (32) - waiting 35s
21:09:53 (5624): Can't acquire lockfile (32) - exiting
21:09:53 (5624): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
21:19:54 (2608): Can't acquire lockfile (32) - waiting 35s
21:20:29 (2608): Can't acquire lockfile (32) - exiting
21:20:29 (2608): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
21:30:36 (5820): Can't acquire lockfile (32) - waiting 35s
21:31:11 (5820): Can't acquire lockfile (32) - exiting
21:31:11 (5820): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
21:49:31 (316): Can't acquire lockfile (32) - waiting 35s
21:50:06 (316): Can't acquire lockfile (32) - exiting
21:50:06 (316): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
22:11:03 (5476): Can't acquire lockfile (32) - waiting 35s
22:11:38 (5476): Can't acquire lockfile (32) - exiting
22:11:38 (5476): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
22:26:39 (7136): Can't acquire lockfile (32) - waiting 35s
22:27:14 (7136): Can't acquire lockfile (32) - exiting
22:27:14 (7136): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
22:37:18 (5900): Can't acquire lockfile (32) - waiting 35s
22:37:53 (5900): Can't acquire lockfile (32) - exiting
22:37:53 (5900): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
22:47:58 (5340): Can't acquire lockfile (32) - waiting 35s
22:48:33 (5340): Can't acquire lockfile (32) - exiting
22:48:33 (5340): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
22:58:49 (1888): Can't acquire lockfile (32) - waiting 35s
22:59:24 (1888): Can't acquire lockfile (32) - exiting
22:59:24 (1888): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
23:09:41 (5072): Can't acquire lockfile (32) - waiting 35s
23:10:16 (5072): Can't acquire lockfile (32) - exiting
23:10:16 (5072): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
23:20:17 (2444): Can't acquire lockfile (32) - waiting 35s
23:20:52 (2444): Can't acquire lockfile (32) - exiting
23:20:52 (2444): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
23:31:34 (8068): Can't acquire lockfile (32) - waiting 35s
23:32:09 (8068): Can't acquire lockfile (32) - exiting
23:32:09 (8068): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:12:45 (7916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:23:18 (5172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Dec 2014 16:28:10 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 777,600 713,591 0.9177
03 Dec 2014 20:43:13 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 751,680 700,429 0.9318
02 Dec 2014 21:14:29 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 725,760 687,250 0.9469
01 Dec 2014 23:55:42 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 699,840 674,333 0.9636
01 Dec 2014 00:24:01 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 673,920 661,254 0.9812
30 Nov 2014 04:36:49 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 648,000 648,056 1.0001
13 Nov 2014 16:53:06 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 622,080 313,530 0.5040
12 Nov 2014 18:17:11 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 596,160 300,188 0.5035
11 Nov 2014 22:41:22 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 570,240 287,500 0.5042
11 Nov 2014 01:06:34 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 544,320 274,491 0.5043
10 Nov 2014 12:39:24 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 518,400 261,245 0.5039
09 Nov 2014 23:37:06 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 492,480 248,211 0.5040
09 Nov 2014 11:01:09 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 466,560 235,013 0.5037
09 Nov 2014 00:55:33 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 440,640 222,197 0.5043
08 Nov 2014 08:52:51 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 414,720 209,528 0.5052
07 Nov 2014 23:23:17 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 388,800 196,805 0.5062
07 Nov 2014 12:55:28 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 362,880 183,739 0.5063
07 Nov 2014 06:27:12 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 336,960 170,696 0.5066
06 Nov 2014 19:44:53 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 311,040 157,583 0.5066
06 Nov 2014 00:51:18 1338325 17313982 hadcm3n_sa8z_1940_40_009107926_1 285,120 145,028 0.5087


©2024 climateprediction.net