climateprediction.net home page
Task 17556000

Task 17556000

Name hadcm3n_xbqa_1940_40_009151496_1
Workunit 9281832
Created 7 Dec 2014, 9:30:17 UTC
Sent 7 Dec 2014, 9:45:50 UTC
Report deadline 8 Mar 2015, 17:13:01 UTC
Received 16 Dec 2014, 17:07:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1523870
Run time 7 days 11 hours 29 min 39 sec
CPU time 7 days 9 hours 36 min 58 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
19:03:35 (2316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:32:28 (6764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:33:17 (7892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Dec 2014 09:33:48 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 388,800 639,200 1.6440
15 Dec 2014 21:11:26 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 362,880 595,288 1.6405
15 Dec 2014 08:44:31 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 336,960 550,877 1.6348
14 Dec 2014 20:51:39 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 311,040 507,135 1.6304
14 Dec 2014 07:59:39 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 285,120 464,053 1.6276
13 Dec 2014 18:49:36 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 259,200 419,952 1.6202
13 Dec 2014 06:17:37 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 233,280 376,322 1.6132
12 Dec 2014 18:21:22 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 207,360 333,336 1.6075
11 Dec 2014 15:16:00 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 181,440 291,150 1.6047
10 Dec 2014 22:42:33 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 155,520 250,012 1.6076
10 Dec 2014 05:20:40 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 129,600 207,447 1.6007
09 Dec 2014 14:51:17 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 103,680 164,922 1.5907
09 Dec 2014 02:41:38 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 77,760 124,703 1.6037
08 Dec 2014 14:49:54 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 51,840 84,096 1.6222
07 Dec 2014 23:56:29 1304271 17556000 hadcm3n_xbqa_1940_40_009151496_1 25,920 41,760 1.6111


©2024 cpdn.org