climateprediction.net home page
Task 15497755

Task 15497755

Name hadcm3n_3lzk_1940_40_008267996_0
Workunit 8423120
Created 21 Dec 2012, 21:44:07 UTC
Sent 21 Dec 2012, 22:05:03 UTC
Report deadline 23 Mar 2013, 5:32:14 UTC
Received 20 Feb 2013, 21:24:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1254767
Run time 5 days 16 hours 57 min 45 sec
CPU time 5 days 13 hours 35 min 41 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 3.33 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
20:21:28 (6680): No heartbeat from core client for 30 sec - exiting
20:21:29 (6680): No heartbeat from core client for 30 sec - exiting
20:21:30 (6680): No heartbeat from core client for 30 sec - exiting
20:21:31 (6680): No heartbeat from core client for 30 sec - exiting
20:21:33 (6680): No heartbeat from core client for 30 sec - exiting
20:21:34 (6680): No heartbeat from core client for 30 sec - exiting
20:21:35 (6680): No heartbeat from core client for 30 sec - exiting
20:21:36 (6680): No heartbeat from core client for 30 sec - exiting
20:21:37 (6680): No heartbeat from core client for 30 sec - exiting
20:21:38 (6680): No heartbeat from core client for 30 sec - exiting
20:21:39 (6680): No heartbeat from core client for 30 sec - exiting
20:21:40 (6680): No heartbeat from core client for 30 sec - exiting
20:21:41 (6680): No heartbeat from core client for 30 sec - exiting
20:21:42 (6680): No heartbeat from core client for 30 sec - exiting
20:21:43 (6680): No heartbeat from core client for 30 sec - exiting
20:21:45 (6680): No heartbeat from core client for 30 sec - exiting
20:21:46 (6680): No heartbeat from core client for 30 sec - exiting
20:21:47 (6680): No heartbeat from core client for 30 sec - exiting
20:21:48 (6680): No heartbeat from core client for 30 sec - exiting
20:21:49 (6680): No heartbeat from core client for 30 sec - exiting
20:21:50 (6680): No heartbeat from core client for 30 sec - exiting
20:21:51 (6680): No heartbeat from core client for 30 sec - exiting
20:21:52 (6680): No heartbeat from core client for 30 sec - exiting
20:21:53 (6680): No heartbeat from core client for 30 sec - exiting
20:21:54 (6680): No heartbeat from core client for 30 sec - exiting
20:21:55 (6680): No heartbeat from core client for 30 sec - exiting
20:21:57 (6680): No heartbeat from core client for 30 sec - exiting
20:21:58 (6680): No heartbeat from core client for 30 sec - exiting
20:21:59 (6680): No heartbeat from core client for 30 sec - exiting
20:22:00 (6680): No heartbeat from core client for 30 sec - exiting
20:22:01 (6680): No heartbeat from core client for 30 sec - exiting
20:22:02 (6680): No heartbeat from core client for 30 sec - exiting
20:22:03 (6680): No heartbeat from core client for 30 sec - exiting
20:22:04 (6680): No heartbeat from core client for 30 sec - exiting
20:22:05 (6680): No heartbeat from core client for 30 sec - exiting
20:22:06 (6680): No heartbeat from core client for 30 sec - exiting
20:22:07 (6680): No heartbeat from core client for 30 sec - exiting
20:22:09 (6680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3624, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3624, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3624, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Feb 2013 17:00:57 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 362,880 473,304 1.3043
19 Feb 2013 18:07:19 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 336,960 439,096 1.3031
18 Feb 2013 09:53:33 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 311,040 404,448 1.3003
16 Feb 2013 00:27:09 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 285,120 368,591 1.2928
15 Feb 2013 08:37:06 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 259,200 331,867 1.2804
13 Feb 2013 17:31:16 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 233,280 297,764 1.2764
06 Jan 2013 18:18:29 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 207,360 264,368 1.2749
05 Jan 2013 19:40:39 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 181,440 230,697 1.2715
04 Jan 2013 00:23:13 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 155,520 198,168 1.2742
02 Jan 2013 21:37:43 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 129,600 166,159 1.2821
31 Dec 2012 15:15:52 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 103,680 133,901 1.2915
28 Dec 2012 16:46:15 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 77,760 101,650 1.3072
25 Dec 2012 15:15:53 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 51,840 66,088 1.2748
23 Dec 2012 22:43:37 1254767 15497755 hadcm3n_3lzk_1940_40_008267996_0 25,920 32,215 1.2429


©2024 climateprediction.net