climateprediction.net home page
Task 13651290

Task 13651290

Name hadcm3n_t0i6_1940_40_007539051_3
Workunit 7736283
Created 21 Nov 2011, 12:54:08 UTC
Sent 21 Nov 2011, 13:01:10 UTC
Report deadline 20 Feb 2012, 20:28:21 UTC
Received 3 Jan 2012, 14:52:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1178088
Run time 12 days 12 hours 23 min 54 sec
CPU time 12 days 6 hours 25 min 32 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.68 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:57:55 (5208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:37:02 (5872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:52:50 (1872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:40:05 (5072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:40:06 (5072): No heartbeat from core client for 30 sec - exiting
11:40:27 (5784): No heartbeat from core client for 30 sec - exiting
11:40:29 (5784): No heartbeat from core client for 30 sec - exiting
11:40:30 (5784): No heartbeat from core client for 30 sec - exiting
11:40:31 (5784): No heartbeat from core client for 30 sec - exiting
11:40:32 (5784): No heartbeat from core client for 30 sec - exiting
11:40:33 (5784): No heartbeat from core client for 30 sec - exiting
11:40:35 (5784): No heartbeat from core client for 30 sec - exiting
11:40:36 (5784): No heartbeat from core client for 30 sec - exiting
11:40:37 (5784): No heartbeat from core client for 30 sec - exiting
11:40:38 (5784): No heartbeat from core client for 30 sec - exiting
11:40:39 (5784): No heartbeat from core client for 30 sec - exiting
11:40:40 (5784): No heartbeat from core client for 30 sec - exiting
11:40:41 (5784): No heartbeat from core client for 30 sec - exiting
11:40:42 (5784): No heartbeat from core client for 30 sec - exiting
11:40:43 (5784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:55:40 (5296): No heartbeat from core client for 30 sec - exiting
10:55:41 (5296): No heartbeat from core client for 30 sec - exiting
10:55:43 (5296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2852, iMonCtr=1
Model crash detected, will try to restart...
13:52:26 (5004): No heartbeat from core client for 30 sec - exiting
13:52:27 (5004): No heartbeat from core client for 30 sec - exiting
13:52:29 (5004): No heartbeat from core client for 30 sec - exiting
13:52:30 (5004): No heartbeat from core client for 30 sec - exiting
13:52:31 (5004): No heartbeat from core client for 30 sec - exiting
13:52:32 (5004): No heartbeat from core client for 30 sec - exiting
13:52:33 (5004): No heartbeat from core client for 30 sec - exiting
13:52:34 (5004): No heartbeat from core client for 30 sec - exiting
13:52:35 (5004): No heartbeat from core client for 30 sec - exiting
13:52:36 (5004): No heartbeat from core client for 30 sec - exiting
13:52:37 (5004): No heartbeat from core client for 30 sec - exiting
13:52:38 (5004): No heartbeat from core client for 30 sec - exiting
13:52:39 (5004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:22:55 (2976): No heartbeat from core client for 30 sec - exiting
11:22:57 (2976): No heartbeat from core client for 30 sec - exiting
11:22:58 (2976): No heartbeat from core client for 30 sec - exiting
11:22:59 (2976): No heartbeat from core client for 30 sec - exiting
11:23:00 (2976): No heartbeat from core client for 30 sec - exiting
11:23:01 (2976): No heartbeat from core client for 30 sec - exiting
11:23:02 (2976): No heartbeat from core client for 30 sec - exiting
11:23:03 (2976): No heartbeat from core client for 30 sec - exiting
11:23:04 (2976): No heartbeat from core client for 30 sec - exiting
11:23:05 (2976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:49:33 (6024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:55:10 (2588): No heartbeat from core client for 30 sec - exiting
17:55:12 (2588): No heartbeat from core client for 30 sec - exiting
17:55:13 (2588): No heartbeat from core client for 30 sec - exiting
17:55:14 (2588): No heartbeat from core client for 30 sec - exiting
17:55:15 (2588): No heartbeat from core client for 30 sec - exiting
17:55:16 (2588): No heartbeat from core client for 30 sec - exiting
17:55:17 (2588): No heartbeat from core client for 30 sec - exiting
17:55:18 (2588): No heartbeat from core client for 30 sec - exiting
17:55:19 (2588): No heartbeat from core client for 30 sec - exiting
17:55:20 (2588): No heartbeat from core client for 30 sec - exiting
17:55:21 (2588): No heartbeat from core client for 30 sec - exiting
17:55:22 (2588): No heartbeat from core client for 30 sec - exiting
17:55:24 (2588): No heartbeat from core client for 30 sec - exiting
17:55:25 (2588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:55:26 (2588): No heartbeat from core client for 30 sec - exiting
17:55:27 (2588): No heartbeat from core client for 30 sec - exiting
14:51:24 (2312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:54:50 (5584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
09:44:25 (6048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:44:26 (6048): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Dec 2011 20:09:40 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 518,400 1,049,369 2.0242
21 Dec 2011 00:25:44 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 492,480 995,537 2.0215
19 Dec 2011 15:09:12 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 466,560 941,742 2.0185
15 Dec 2011 21:53:35 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 440,640 887,838 2.0149
14 Dec 2011 06:57:14 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 414,720 834,330 2.0118
13 Dec 2011 18:41:37 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 388,800 780,701 2.0080
09 Dec 2011 00:00:31 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 362,880 727,331 2.0043
06 Dec 2011 18:44:13 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 336,960 673,068 1.9975
01 Dec 2011 21:43:55 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 311,040 622,770 2.0022
29 Nov 2011 16:43:18 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 285,120 572,236 2.0070
28 Nov 2011 12:01:31 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 259,200 522,846 2.0172
27 Nov 2011 21:06:23 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 233,280 469,665 2.0133
27 Nov 2011 05:54:49 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 207,360 416,787 2.0100
26 Nov 2011 15:06:06 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 181,440 363,794 2.0050
25 Nov 2011 23:47:19 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 155,520 310,245 1.9949
25 Nov 2011 08:20:52 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 129,600 257,099 1.9838
24 Nov 2011 17:23:54 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 103,680 203,615 1.9639
24 Nov 2011 01:54:33 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 77,760 151,689 1.9507
22 Nov 2011 18:52:32 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 51,840 101,200 1.9522
22 Nov 2011 05:22:57 1178088 13651290 hadcm3n_t0i6_1940_40_007539051_3 25,920 50,613 1.9527


©2024 climateprediction.net