climateprediction.net home page
Task 16046337

Task 16046337

Name hadcm3n_oftg_1900_40_008475623_0
Workunit 8626462
Created 27 Sep 2013, 10:38:29 UTC
Sent 27 Sep 2013, 12:58:05 UTC
Report deadline 27 Dec 2013, 20:25:16 UTC
Received 18 Oct 2013, 0:25:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1167410
Run time 18 days 5 hours 3 min 24 sec
CPU time 17 days 11 hours 4 min 13 sec
Validate state Invalid
Credit 9,642.24
Device peak FLOPS 2.69 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:13:47 (552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Oct 2013 14:58:08 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 803,520 1,506,279 1.8746
16 Oct 2013 23:53:50 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 777,600 1,456,832 1.8735
16 Oct 2013 09:19:36 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 751,680 1,407,248 1.8721
15 Oct 2013 17:48:56 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 725,760 1,358,038 1.8712
15 Oct 2013 03:07:53 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 699,840 1,308,943 1.8703
14 Oct 2013 12:29:08 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 673,920 1,259,295 1.8686
13 Oct 2013 23:12:13 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 648,000 1,211,519 1.8696
13 Oct 2013 09:54:38 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 622,080 1,163,690 1.8706
12 Oct 2013 20:38:57 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 596,160 1,115,862 1.8717
12 Oct 2013 07:24:16 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 570,240 1,068,017 1.8729
11 Oct 2013 18:06:48 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 544,320 1,020,224 1.8743
10 Oct 2013 21:24:14 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 518,400 971,312 1.8737
10 Oct 2013 05:24:17 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 492,480 923,426 1.8751
09 Oct 2013 16:09:52 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 466,560 875,801 1.8771
09 Oct 2013 02:50:14 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 440,640 828,151 1.8794
08 Oct 2013 13:35:16 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 414,720 780,567 1.8822
08 Oct 2013 00:13:45 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 388,800 732,958 1.8852
07 Oct 2013 10:41:39 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 362,880 684,664 1.8868
06 Oct 2013 21:27:29 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 336,960 637,270 1.8912
06 Oct 2013 07:08:59 1167410 16046337 hadcm3n_oftg_1900_40_008475623_0 311,040 588,555 1.8922


©2024 cpdn.org