climateprediction.net home page
Task 15919694

Task 15919694

Name hadcm3n_o6zb_1980_40_008403640_1
Workunit 8554496
Created 14 Aug 2013, 17:30:10 UTC
Sent 14 Aug 2013, 18:53:49 UTC
Report deadline 14 Nov 2013, 2:21:00 UTC
Received 2 Sep 2013, 20:12:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1113142
Run time 13 days 23 hours 46 min 17 sec
CPU time 8 days 14 hours 37 min 29 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 2.62 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
15:02:20 (6164): No heartbeat from core client for 30 sec - exiting
15:02:21 (6164): No heartbeat from core client for 30 sec - exiting
15:02:22 (6164): No heartbeat from core client for 30 sec - exiting
15:02:24 (6164): No heartbeat from core client for 30 sec - exiting
15:02:25 (6164): No heartbeat from core client for 30 sec - exiting
15:02:26 (6164): No heartbeat from core client for 30 sec - exiting
15:02:27 (6164): No heartbeat from core client for 30 sec - exiting
15:02:28 (6164): No heartbeat from core client for 30 sec - exiting
15:02:29 (6164): No heartbeat from core client for 30 sec - exiting
15:02:30 (6164): No heartbeat from core client for 30 sec - exiting
15:02:31 (6164): No heartbeat from core client for 30 sec - exiting
15:02:32 (6164): No heartbeat from core client for 30 sec - exiting
15:02:33 (6164): No heartbeat from core client for 30 sec - exiting
15:02:34 (6164): No heartbeat from core client for 30 sec - exiting
15:02:36 (6164): No heartbeat from core client for 30 sec - exiting
15:02:37 (6164): No heartbeat from core client for 30 sec - exiting
15:02:38 (6164): No heartbeat from core client for 30 sec - exiting
15:02:39 (6164): No heartbeat from core client for 30 sec - exiting
15:02:40 (6164): No heartbeat from core client for 30 sec - exiting
15:02:41 (6164): No heartbeat from core client for 30 sec - exiting
15:02:42 (6164): No heartbeat from core client for 30 sec - exiting
15:02:43 (6164): No heartbeat from core client for 30 sec - exiting
15:02:44 (6164): No heartbeat from core client for 30 sec - exiting
15:02:45 (6164): No heartbeat from core client for 30 sec - exiting
15:02:46 (6164): No heartbeat from core client for 30 sec - exiting
15:02:48 (6164): No heartbeat from core client for 30 sec - exiting
15:02:49 (6164): No heartbeat from core client for 30 sec - exiting
15:02:50 (6164): No heartbeat from core client for 30 sec - exiting
15:02:51 (6164): No heartbeat from core client for 30 sec - exiting
15:02:52 (6164): No heartbeat from core client for 30 sec - exiting
15:02:53 (6164): No heartbeat from core client for 30 sec - exiting
15:02:54 (6164): No heartbeat from core client for 30 sec - exiting
15:02:55 (6164): No heartbeat from core client for 30 sec - exiting
15:02:56 (6164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:54:09 (8464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:54:10 (8464): No heartbeat from core client for 30 sec - exiting
09:54:11 (8464): No heartbeat from core client for 30 sec - exiting
09:54:12 (8464): No heartbeat from core client for 30 sec - exiting
09:54:13 (8464): No heartbeat from core client for 30 sec - exiting
09:54:14 (8464): No heartbeat from core client for 30 sec - exiting
09:54:15 (8464): No heartbeat from core client for 30 sec - exiting
09:54:16 (8464): No heartbeat from core client for 30 sec - exiting
09:54:17 (8464): No heartbeat from core client for 30 sec - exiting
09:54:18 (8464): No heartbeat from core client for 30 sec - exiting
09:54:19 (8464): No heartbeat from core client for 30 sec - exiting
05:27:49 (8172): No heartbeat from core client for 30 sec - exiting
05:27:51 (8172): No heartbeat from core client for 30 sec - exiting
05:27:52 (8172): No heartbeat from core client for 30 sec - exiting
05:27:53 (8172): No heartbeat from core client for 30 sec - exiting
05:27:54 (8172): No heartbeat from core client for 30 sec - exiting
05:27:55 (8172): No heartbeat from core client for 30 sec - exiting
05:27:56 (8172): No heartbeat from core client for 30 sec - exiting
05:27:57 (8172): No heartbeat from core client for 30 sec - exiting
05:27:58 (8172): No heartbeat from core client for 30 sec - exiting
05:28:00 (8172): No heartbeat from core client for 30 sec - exiting
05:28:01 (8172): No heartbeat from core client for 30 sec - exiting
05:28:02 (8172): No heartbeat from core client for 30 sec - exiting
05:28:03 (8172): No heartbeat from core client for 30 sec - exiting
05:28:04 (8172): No heartbeat from core client for 30 sec - exiting
05:28:05 (8172): No heartbeat from core client for 30 sec - exiting
05:28:06 (8172): No heartbeat from core client for 30 sec - exiting
05:28:07 (8172): No heartbeat from core client for 30 sec - exiting
05:28:08 (8172): No heartbeat from core client for 30 sec - exiting
05:28:09 (8172): No heartbeat from core client for 30 sec - exiting
05:28:11 (8172): No heartbeat from core client for 30 sec - exiting
05:28:12 (8172): No heartbeat from core client for 30 sec - exiting
05:28:13 (8172): No heartbeat from core client for 30 sec - exiting
05:28:14 (8172): No heartbeat from core client for 30 sec - exiting
05:28:15 (8172): No heartbeat from core client for 30 sec - exiting
05:28:16 (8172): No heartbeat from core client for 30 sec - exiting
05:28:17 (8172): No heartbeat from core client for 30 sec - exiting
05:28:18 (8172): No heartbeat from core client for 30 sec - exiting
05:28:19 (8172): No heartbeat from core client for 30 sec - exiting
05:28:20 (8172): No heartbeat from core client for 30 sec - exiting
05:28:22 (8172): No heartbeat from core client for 30 sec - exiting
05:28:23 (8172): No heartbeat from core client for 30 sec - exiting
05:28:24 (8172): No heartbeat from core client for 30 sec - exiting
05:28:25 (8172): No heartbeat from core client for 30 sec - exiting
05:28:26 (8172): No heartbeat from core client for 30 sec - exiting
05:28:27 (8172): No heartbeat from core client for 30 sec - exiting
05:28:28 (8172): No heartbeat from core client for 30 sec - exiting
05:28:29 (8172): No heartbeat from core client for 30 sec - exiting
05:28:30 (8172): No heartbeat from core client for 30 sec - exiting
05:28:31 (8172): No heartbeat from core client for 30 sec - exiting
05:28:32 (8172): No heartbeat from core client for 30 sec - exiting
05:28:34 (8172): No heartbeat from core client for 30 sec - exiting
05:28:35 (8172): No heartbeat from core client for 30 sec - exiting
05:28:36 (8172): No heartbeat from core client for 30 sec - exiting
05:28:37 (8172): No heartbeat from core client for 30 sec - exiting
05:28:38 (8172): No heartbeat from core client for 30 sec - exiting
05:28:39 (8172): No heartbeat from core client for 30 sec - exiting
05:28:40 (8172): No heartbeat from core client for 30 sec - exiting
05:28:41 (8172): No heartbeat from core client for 30 sec - exiting
05:28:42 (8172): No heartbeat from core client for 30 sec - exiting
05:28:43 (8172): No heartbeat from core client for 30 sec - exiting
05:28:44 (8172): No heartbeat from core client for 30 sec - exiting
05:28:46 (8172): No heartbeat from core client for 30 sec - exiting
05:28:47 (8172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Sep 2013 19:53:58 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 414,720 725,800 1.7501
30 Aug 2013 22:09:31 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 388,800 680,491 1.7502
29 Aug 2013 23:52:18 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 362,880 635,582 1.7515
28 Aug 2013 23:42:18 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 336,960 590,159 1.7514
28 Aug 2013 01:20:55 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 311,040 544,497 1.7506
27 Aug 2013 06:10:09 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 285,120 498,691 1.7491
26 Aug 2013 10:19:44 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 259,200 453,032 1.7478
25 Aug 2013 12:18:45 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 233,280 407,758 1.7479
24 Aug 2013 16:29:55 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 207,360 362,324 1.7473
23 Aug 2013 21:59:51 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 181,440 317,335 1.7490
23 Aug 2013 10:11:53 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 155,520 272,498 1.7522
22 Aug 2013 15:26:31 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 129,600 227,530 1.7556
21 Aug 2013 16:28:27 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 103,680 182,634 1.7615
20 Aug 2013 20:36:42 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 77,760 137,118 1.7633
20 Aug 2013 05:22:26 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 51,840 92,083 1.7763
19 Aug 2013 08:19:39 1113142 15919694 hadcm3n_o6zb_1980_40_008403640_1 25,920 46,175 1.7814


©2024 cpdn.org