climateprediction.net home page
Task 15875263

Task 15875263

Name hadcm3n_4mzu_1980_40_008392164_3
Workunit 8543023
Created 1 Jul 2013, 7:20:59 UTC
Sent 1 Jul 2013, 7:35:37 UTC
Report deadline 30 Sep 2013, 15:02:48 UTC
Received 18 Jul 2013, 12:32:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1069778
Run time 11 days 8 hours 20 min 40 sec
CPU time 10 days 11 hours 38 min 14 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 2.55 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:41:04 (4904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:07:10 (4532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:32:34 (4584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:35:54 (4404): No heartbeat from core client for 30 sec - exiting
05:35:55 (4404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4368, iMonCtr=1
Model crash detected, will try to restart...
05:40:20 (4552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
16:00:32 (4680): No heartbeat from core client for 30 sec - exiting
16:00:33 (4680): No heartbeat from core client for 30 sec - exiting
16:00:34 (4680): No heartbeat from core client for 30 sec - exiting
16:00:35 (4680): No heartbeat from core client for 30 sec - exiting
16:00:36 (4680): No heartbeat from core client for 30 sec - exiting
16:00:37 (4680): No heartbeat from core client for 30 sec - exiting
16:00:38 (4680): No heartbeat from core client for 30 sec - exiting
16:00:39 (4680): No heartbeat from core client for 30 sec - exiting
16:00:40 (4680): No heartbeat from core client for 30 sec - exiting
16:00:41 (4680): No heartbeat from core client for 30 sec - exiting
16:00:42 (4680): No heartbeat from core client for 30 sec - exiting
16:00:43 (4680): No heartbeat from core client for 30 sec - exiting
16:00:44 (4680): No heartbeat from core client for 30 sec - exiting
16:00:45 (4680): No heartbeat from core client for 30 sec - exiting
16:00:46 (4680): No heartbeat from core client for 30 sec - exiting
16:00:47 (4680): No heartbeat from core client for 30 sec - exiting
16:00:48 (4680): No heartbeat from core client for 30 sec - exiting
16:00:49 (4680): No heartbeat from core client for 30 sec - exiting
16:00:50 (4680): No heartbeat from core client for 30 sec - exiting
16:00:51 (4680): No heartbeat from core client for 30 sec - exiting
16:00:52 (4680): No heartbeat from core client for 30 sec - exiting
16:00:53 (4680): No heartbeat from core client for 30 sec - exiting
16:00:54 (4680): No heartbeat from core client for 30 sec - exiting
16:00:55 (4680): No heartbeat from core client for 30 sec - exiting
16:00:56 (4680): No heartbeat from core client for 30 sec - exiting
16:00:57 (4680): No heartbeat from core client for 30 sec - exiting
16:00:58 (4680): No heartbeat from core client for 30 sec - exiting
16:00:59 (4680): No heartbeat from core client for 30 sec - exiting
16:01:00 (4680): No heartbeat from core client for 30 sec - exiting
16:01:01 (4680): No heartbeat from core client for 30 sec - exiting
16:01:02 (4680): No heartbeat from core client for 30 sec - exiting
16:01:03 (4680): No heartbeat from core client for 30 sec - exiting
16:01:04 (4680): No heartbeat from core client for 30 sec - exiting
16:01:05 (4680): No heartbeat from core client for 30 sec - exiting
16:01:06 (4680): No heartbeat from core client for 30 sec - exiting
16:01:07 (4680): No heartbeat from core client for 30 sec - exiting
16:01:08 (4680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1
Model crash detected, will try to restart...
06:22:18 (4752): No heartbeat from core client for 30 sec - exiting
06:22:19 (4752): No heartbeat from core client for 30 sec - exiting
06:22:20 (4752): No heartbeat from core client for 30 sec - exiting
06:22:21 (4752): No heartbeat from core client for 30 sec - exiting
06:22:22 (4752): No heartbeat from core client for 30 sec - exiting
06:22:23 (4752): No heartbeat from core client for 30 sec - exiting
06:22:24 (4752): No heartbeat from core client for 30 sec - exiting
06:22:25 (4752): No heartbeat from core client for 30 sec - exiting
06:22:26 (4752): No heartbeat from core client for 30 sec - exiting
06:22:27 (4752): No heartbeat from core client for 30 sec - exiting
06:22:28 (4752): No heartbeat from core client for 30 sec - exiting
06:22:29 (4752): No heartbeat from core client for 30 sec - exiting
06:22:30 (4752): No heartbeat from core client for 30 sec - exiting
06:22:31 (4752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:22:32 (4752): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:00:32 (408): No heartbeat from core client for 30 sec - exiting
16:00:33 (408): No heartbeat from core client for 30 sec - exiting
16:00:34 (408): No heartbeat from core client for 30 sec - exiting
16:00:35 (408): No heartbeat from core client for 30 sec - exiting
16:00:36 (408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:29:01 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2013 15:56:38 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 492,480 887,623 1.8024
23 Jul 2013 15:56:38 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 466,560 840,588 1.8017
23 Jul 2013 15:56:38 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 440,640 793,875 1.8016
23 Jul 2013 15:56:38 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 414,720 747,205 1.8017
23 Jul 2013 15:56:38 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 388,800 700,969 1.8029
11 Jul 2013 10:21:34 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 362,880 654,436 1.8035
10 Jul 2013 16:27:36 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 336,960 608,211 1.8050
10 Jul 2013 02:49:08 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 311,040 561,678 1.8058
09 Jul 2013 08:42:01 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 285,120 514,516 1.8046
08 Jul 2013 13:53:37 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 259,200 468,547 1.8077
08 Jul 2013 01:15:43 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 233,280 423,521 1.8155
07 Jul 2013 07:45:49 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 207,360 378,422 1.8250
06 Jul 2013 13:18:46 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 181,440 332,337 1.8317
06 Jul 2013 05:10:25 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 155,520 285,800 1.8377
06 Jul 2013 04:24:22 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 129,600 237,996 1.8364
04 Jul 2013 14:22:27 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 103,680 190,189 1.8344
03 Jul 2013 11:41:13 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 77,760 143,656 1.8474
02 Jul 2013 16:57:14 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 51,840 95,804 1.8481
02 Jul 2013 12:03:27 1069778 15875263 hadcm3n_4mzu_1980_40_008392164_3 25,920 47,929 1.8491


©2024 climateprediction.net