climateprediction.net home page
Task 15833141

Task 15833141

Name hadcm3n_u3p3_2020_40_008336531_1
Workunit 8487392
Created 6 Jun 2013, 17:08:08 UTC
Sent 6 Jun 2013, 17:28:12 UTC
Report deadline 6 Sep 2013, 0:55:23 UTC
Received 21 Jun 2013, 1:09:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1251428
Run time 12 days 16 hours 8 min 58 sec
CPU time 10 days 7 hours 51 min 40 sec
Validate state Invalid
Credit 7,153.92
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
17:50:34 (7552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:50:35 (7552): No heartbeat from core client for 30 sec - exiting
17:50:36 (7552): No heartbeat from core client for 30 sec - exiting
17:50:38 (7552): No heartbeat from core client for 30 sec - exiting
17:50:39 (7552): No heartbeat from core client for 30 sec - exiting
17:50:40 (7552): No heartbeat from core client for 30 sec - exiting
17:50:41 (7552): No heartbeat from core client for 30 sec - exiting
17:50:42 (7552): No heartbeat from core client for 30 sec - exiting
17:50:43 (7552): No heartbeat from core client for 30 sec - exiting
17:50:44 (7552): No heartbeat from core client for 30 sec - exiting
17:50:45 (7552): No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1432, iMonCtr=1
Model crash detected, will try to restart...
18:39:39 (1432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:39:40 (1432): No heartbeat from core client for 30 sec - exiting
18:39:41 (1432): No heartbeat from core client for 30 sec - exiting
18:39:42 (1432): No heartbeat from core client for 30 sec - exiting
18:39:43 (1432): No heartbeat from core client for 30 sec - exiting
18:39:44 (1432): No heartbeat from core client for 30 sec - exiting
18:39:45 (1432): No heartbeat from core client for 30 sec - exiting
18:39:46 (1432): No heartbeat from core client for 30 sec - exiting
18:39:48 (1432): No heartbeat from core client for 30 sec - exiting
18:39:49 (1432): No heartbeat from core client for 30 sec - exiting
18:39:50 (1432): No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:31:56 (5740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:34:16 (192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:35:19 (192): No heartbeat from core client for 30 sec - exiting
22:35:20 (192): No heartbeat from core client for 30 sec - exiting
22:35:21 (192): No heartbeat from core client for 30 sec - exiting
22:35:22 (192): No heartbeat from core client for 30 sec - exiting
22:35:23 (192): No heartbeat from core client for 30 sec - exiting
22:35:24 (192): No heartbeat from core client for 30 sec - exiting
22:35:25 (192): No heartbeat from core client for 30 sec - exiting
22:35:26 (192): No heartbeat from core client for 30 sec - exiting
22:35:27 (192): No heartbeat from core client for 30 sec - exiting
22:35:28 (192): No heartbeat from core client for 30 sec - exiting
20:38:50 (5852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: Access is denied.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=1
Model crash detected, will try to restart...
23:09:11 (5008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: Access is denied.

23:12:17 (7272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:02:58 (7860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:02:59 (7860): No heartbeat from core client for 30 sec - exiting
00:03:00 (7860): No heartbeat from core client for 30 sec - exiting
00:03:01 (7860): No heartbeat from core client for 30 sec - exiting
00:03:02 (7860): No heartbeat from core client for 30 sec - exiting
00:03:03 (7860): No heartbeat from core client for 30 sec - exiting
00:03:04 (7860): No heartbeat from core client for 30 sec - exiting
00:03:05 (7860): No heartbeat from core client for 30 sec - exiting
00:03:06 (7860): No heartbeat from core client for 30 sec - exiting
00:03:07 (7860): No heartbeat from core client for 30 sec - exiting
00:03:08 (7860): No heartbeat from core client for 30 sec - exiting
00:13:46 (7260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:13:47 (7260): No heartbeat from core client for 30 sec - exiting
00:13:48 (7260): No heartbeat from core client for 30 sec - exiting
00:13:49 (7260): No heartbeat from core client for 30 sec - exiting
00:13:50 (7260): No heartbeat from core client for 30 sec - exiting
00:13:51 (7260): No heartbeat from core client for 30 sec - exiting
00:13:52 (7260): No heartbeat from core client for 30 sec - exiting
00:13:53 (7260): No heartbeat from core client for 30 sec - exiting
00:13:54 (7260): No heartbeat from core client for 30 sec - exiting
00:13:55 (7260): No heartbeat from core client for 30 sec - exiting
00:13:56 (7260): No heartbeat from core client for 30 sec - exiting
00:31:59 (3848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:32:00 (3848): No heartbeat from core client for 30 sec - exiting
00:32:01 (3848): No heartbeat from core client for 30 sec - exiting
00:32:02 (3848): No heartbeat from core client for 30 sec - exiting
00:32:03 (3848): No heartbeat from core client for 30 sec - exiting
00:32:04 (3848): No heartbeat from core client for 30 sec - exiting
00:32:05 (3848): No heartbeat from core client for 30 sec - exiting
00:32:06 (3848): No heartbeat from core client for 30 sec - exiting
00:32:07 (3848): No heartbeat from core client for 30 sec - exiting
00:32:08 (3848): No heartbeat from core client for 30 sec - exiting
00:32:09 (3848): No heartbeat from core client for 30 sec - exiting
00:36:46 (6148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:36:47 (6148): No heartbeat from core client for 30 sec - exiting
00:36:48 (6148): No heartbeat from core client for 30 sec - exiting
00:36:49 (6148): No heartbeat from core client for 30 sec - exiting
00:36:50 (6148): No heartbeat from core client for 30 sec - exiting
00:36:51 (6148): No heartbeat from core client for 30 sec - exiting
00:36:52 (6148): No heartbeat from core client for 30 sec - exiting
00:36:53 (6148): No heartbeat from core client for 30 sec - exiting
00:36:54 (6148): No heartbeat from core client for 30 sec - exiting
00:36:55 (6148): No heartbeat from core client for 30 sec - exiting
00:36:56 (6148): No heartbeat from core client for 30 sec - exiting
03:08:57 (7268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:08:59 (7268): No heartbeat from core client for 30 sec - exiting
03:09:00 (7268): No heartbeat from core client for 30 sec - exiting
03:09:01 (7268): No heartbeat from core client for 30 sec - exiting
03:09:02 (7268): No heartbeat from core client for 30 sec - exiting
03:09:03 (7268): No heartbeat from core client for 30 sec - exiting
03:09:04 (7268): No heartbeat from core client for 30 sec - exiting
03:09:05 (7268): No heartbeat from core client for 30 sec - exiting
03:09:06 (7268): No heartbeat from core client for 30 sec - exiting
03:09:07 (7268): No heartbeat from core client for 30 sec - exiting
03:09:08 (7268): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Jun 2013 17:02:33 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 596,160 891,096 1.4947
20 Jun 2013 03:45:50 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 570,240 852,510 1.4950
19 Jun 2013 14:59:08 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 544,320 812,391 1.4925
19 Jun 2013 01:07:50 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 518,400 772,509 1.4902
18 Jun 2013 12:07:14 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 492,480 733,751 1.4899
17 Jun 2013 22:58:43 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 466,560 695,104 1.4898
17 Jun 2013 10:21:38 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 440,640 657,547 1.4923
16 Jun 2013 21:19:51 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 414,720 618,916 1.4924
16 Jun 2013 00:12:11 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 388,800 579,987 1.4917
16 Jun 2013 00:12:11 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 362,880 542,929 1.4962
14 Jun 2013 20:58:02 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 336,960 504,584 1.4975
14 Jun 2013 08:37:29 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 311,040 466,903 1.5011
13 Jun 2013 19:23:02 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 285,120 427,721 1.5001
13 Jun 2013 05:52:44 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 259,200 388,123 1.4974
12 Jun 2013 16:23:01 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 233,280 348,133 1.4923
11 Jun 2013 23:26:11 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 207,360 312,273 1.5059
11 Jun 2013 07:47:49 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 181,440 274,713 1.5141
10 Jun 2013 18:53:20 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 155,520 236,424 1.5202
10 Jun 2013 06:41:59 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 129,600 199,068 1.5360
09 Jun 2013 16:33:03 1251428 15833141 hadcm3n_u3p3_2020_40_008336531_1 103,680 158,921 1.5328


©2024 climateprediction.net