climateprediction.net home page
Task 15489961

Task 15489961

Name hadcm3n_39nx_1940_40_008261695_0
Workunit 8416819
Created 20 Dec 2012, 23:01:27 UTC
Sent 20 Dec 2012, 23:02:15 UTC
Report deadline 22 Mar 2013, 6:29:26 UTC
Received 26 Dec 2012, 20:24:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1253823
Run time 4 days 19 hours 53 min 22 sec
CPU time 4 days 9 hours 3 min 22 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 2.47 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
19:01:38 (4340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5692, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:51:46 (4204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:51:48 (4204): No heartbeat from core client for 30 sec - exiting
04:10:57 (4952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:30:56 (5296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:30:58 (5296): No heartbeat from core client for 30 sec - exiting
04:31:01 (5296): No heartbeat from core client for 30 sec - exiting
04:31:03 (5296): No heartbeat from core client for 30 sec - exiting
04:31:06 (5296): No heartbeat from core client for 30 sec - exiting
04:31:09 (5296): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
04:38:35 (6408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:38:36 (6408): No heartbeat from core client for 30 sec - exiting
04:38:39 (6408): No heartbeat from core client for 30 sec - exiting
04:38:41 (6408): No heartbeat from core client for 30 sec - exiting
04:41:12 (6432): No heartbeat from core client for 30 sec - exiting
04:41:13 (6432): No heartbeat from core client for 30 sec - exiting
04:41:14 (6432): No heartbeat from core client for 30 sec - exiting
04:41:16 (6432): No heartbeat from core client for 30 sec - exiting
04:41:18 (6432): No heartbeat from core client for 30 sec - exiting
04:41:19 (6432): No heartbeat from core client for 30 sec - exiting
04:41:20 (6432): No heartbeat from core client for 30 sec - exiting
04:41:21 (6432): No heartbeat from core client for 30 sec - exiting
04:41:25 (6432): No heartbeat from core client for 30 sec - exiting
04:41:26 (6432): No heartbeat from core client for 30 sec - exiting
04:41:27 (6432): No heartbeat from core client for 30 sec - exiting
04:41:29 (6432): No heartbeat from core client for 30 sec - exiting
04:41:31 (6432): No heartbeat from core client for 30 sec - exiting
04:41:32 (6432): No heartbeat from core client for 30 sec - exiting
04:41:34 (6432): No heartbeat from core client for 30 sec - exiting
04:41:35 (6432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:41:36 (6432): No heartbeat from core client for 30 sec - exiting
04:41:37 (6432): No heartbeat from core client for 30 sec - exiting
04:41:38 (6432): No heartbeat from core client for 30 sec - exiting
04:41:40 (6432): No heartbeat from core client for 30 sec - exiting
04:41:42 (6432): No heartbeat from core client for 30 sec - exiting
04:41:44 (6432): No heartbeat from core client for 30 sec - exiting
04:41:46 (6432): No heartbeat from core client for 30 sec - exiting
04:48:20 (2056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:48:22 (2056): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:31:46 (4516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1
Model crash detected, will try to restart...
00:27:55 (4736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6828, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Dec 2012 04:01:28 1253823 15489961 hadcm3n_39nx_1940_40_008261695_0 129,600 323,333 2.4949
24 Dec 2012 04:16:34 1253823 15489961 hadcm3n_39nx_1940_40_008261695_0 103,680 258,105 2.4894
23 Dec 2012 08:58:08 1253823 15489961 hadcm3n_39nx_1940_40_008261695_0 77,760 192,563 2.4764
22 Dec 2012 12:40:11 1253823 15489961 hadcm3n_39nx_1940_40_008261695_0 51,840 127,439 2.4583
21 Dec 2012 17:51:26 1253823 15489961 hadcm3n_39nx_1940_40_008261695_0 25,920 63,896 2.4651


©2024 climateprediction.net