climateprediction.net home page
Task 16002209

Task 16002209

Name hadcm3n_4ig3_2020_40_008390175_1
Workunit 8541034
Created 3 Sep 2013, 18:49:24 UTC
Sent 3 Sep 2013, 18:59:54 UTC
Report deadline 4 Dec 2013, 2:27:05 UTC
Received 19 Sep 2013, 6:20:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1185663
Run time 13 days 4 hours 15 min 19 sec
CPU time 10 days 12 hours 49 min 8 sec
Validate state Invalid
Credit 12,130.56
Device peak FLOPS 3.22 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:23:01 (7948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:18:55 (10072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:18:56 (10072): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:32:16 (5496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Sep 2013 10:02:23 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 1,010,880 905,630 0.8959
16 Sep 2013 03:14:30 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 984,960 882,057 0.8955
15 Sep 2013 20:22:24 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 959,040 858,257 0.8949
15 Sep 2013 13:25:26 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 933,120 834,234 0.8940
15 Sep 2013 06:38:20 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 907,200 810,603 0.8935
14 Sep 2013 23:46:39 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 881,280 787,079 0.8931
14 Sep 2013 16:55:00 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 855,360 763,281 0.8924
14 Sep 2013 09:58:22 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 829,440 739,571 0.8917
14 Sep 2013 02:15:27 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 803,520 716,345 0.8915
13 Sep 2013 18:28:14 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 777,600 693,564 0.8919
13 Sep 2013 11:06:36 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 751,680 670,752 0.8923
13 Sep 2013 03:29:49 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 725,760 647,829 0.8926
12 Sep 2013 19:32:57 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 699,840 624,282 0.8920
12 Sep 2013 11:45:18 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 673,920 600,553 0.8911
12 Sep 2013 04:57:54 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 648,000 576,770 0.8901
11 Sep 2013 22:10:29 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 622,080 553,215 0.8893
11 Sep 2013 15:22:48 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 596,160 529,574 0.8883
11 Sep 2013 08:46:07 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 570,240 506,471 0.8882
11 Sep 2013 02:24:31 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 544,320 484,001 0.8892
10 Sep 2013 19:57:23 1185663 16002209 hadcm3n_4ig3_2020_40_008390175_1 518,400 461,523 0.8903


©2024 climateprediction.net