climateprediction.net home page
Task 17377892

Task 17377892

Name hadcm3n_xahs_1940_40_009149894_1
Workunit 9280230
Created 8 Nov 2014, 10:09:38 UTC
Sent 8 Nov 2014, 10:21:43 UTC
Report deadline 7 Feb 2015, 17:48:54 UTC
Received 12 Jan 2015, 8:44:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1341147
Run time 42 days 13 hours 33 min 47 sec
CPU time 22 days 17 hours 56 min 46 sec
Validate state Invalid
Credit 7,464.96
Device peak FLOPS 2.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
05:52:32 (2156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:48:14 (4048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:47:11 (3548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:46:14 (344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1744, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Dec 2014 08:09:45 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 622,080 1,916,217 3.0803
16 Dec 2014 18:17:27 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 596,160 1,836,409 3.0804
15 Dec 2014 04:08:48 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 570,240 1,756,523 3.0803
13 Dec 2014 15:14:04 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 544,320 1,677,681 3.0822
12 Dec 2014 01:26:53 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 518,400 1,598,317 3.0832
10 Dec 2014 11:18:27 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 492,480 1,519,044 3.0845
08 Dec 2014 22:34:55 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 466,560 1,439,792 3.0860
07 Dec 2014 08:37:00 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 440,640 1,360,058 3.0866
05 Dec 2014 18:29:11 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 414,720 1,280,310 3.0872
04 Dec 2014 04:54:37 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 388,800 1,200,592 3.0879
02 Dec 2014 14:49:36 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 362,880 1,120,974 3.0891
01 Dec 2014 01:42:01 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 336,960 1,041,982 3.0923
29 Nov 2014 12:10:03 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 311,040 963,157 3.0966
27 Nov 2014 22:35:26 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 285,120 884,367 3.1017
26 Nov 2014 06:23:44 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 259,200 804,242 3.1028
24 Nov 2014 15:16:50 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 233,280 724,640 3.1063
23 Nov 2014 00:35:21 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 207,360 644,858 3.1098
21 Nov 2014 09:42:30 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 181,440 565,006 3.1140
19 Nov 2014 18:55:41 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 155,520 485,075 3.1191
18 Nov 2014 02:32:52 1341147 17377892 hadcm3n_xahs_1940_40_009149894_1 129,600 403,350 3.1123


©2024 cpdn.org