climateprediction.net home page
Task 16289953

Task 16289953

Name hadcm3n_84dx_1980_40_008463641_4
Workunit 8614480
Created 11 Feb 2014, 15:37:50 UTC
Sent 11 Feb 2014, 15:38:04 UTC
Report deadline 13 May 2014, 23:05:15 UTC
Received 19 Mar 2014, 11:26:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1376550
Run time 34 days 5 hours 25 min 12 sec
CPU time 28 days 10 hours 5 min 27 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 1.80 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:10:38 (5912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:10:47 (5912): No heartbeat from core client for 30 sec - exiting
03:10:48 (5912): No heartbeat from core client for 30 sec - exiting
03:10:49 (5912): No heartbeat from core client for 30 sec - exiting
03:10:50 (5912): No heartbeat from core client for 30 sec - exiting
03:10:52 (5912): No heartbeat from core client for 30 sec - exiting
03:10:53 (5912): No heartbeat from core client for 30 sec - exiting
03:10:54 (5912): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
12:34:46 (4272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:01:50 (5836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10248, iMonCtr=1
Model crash detected, will try to restart...
02:03:31 (6904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:03:32 (6904): No heartbeat from core client for 30 sec - exiting
02:03:34 (6904): No heartbeat from core client for 30 sec - exiting
02:03:35 (6904): No heartbeat from core client for 30 sec - exiting
02:03:36 (6904): No heartbeat from core client for 30 sec - exiting
02:03:37 (6904): No heartbeat from core client for 30 sec - exiting
02:03:38 (6904): No heartbeat from core client for 30 sec - exiting
02:03:39 (6904): No heartbeat from core client for 30 sec - exiting
02:03:40 (6904): No heartbeat from core client for 30 sec - exiting
02:03:41 (6904): No heartbeat from core client for 30 sec - exiting
02:03:42 (6904): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
21:28:50 (5568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Mar 2014 06:41:21 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 777,600 2,452,554 3.1540
18 Mar 2014 05:57:57 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 751,680 2,367,586 3.1497
17 Mar 2014 04:39:49 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 725,760 2,282,006 3.1443
16 Mar 2014 03:06:13 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 699,840 2,195,613 3.1373
14 Mar 2014 21:38:26 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 673,920 2,108,471 3.1287
13 Mar 2014 16:18:00 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 648,000 2,011,695 3.1045
12 Mar 2014 05:22:01 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 622,080 1,938,473 3.1161
11 Mar 2014 17:05:44 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 596,160 1,890,928 3.1718
10 Mar 2014 16:52:38 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 570,240 1,808,957 3.1723
09 Mar 2014 17:53:44 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 544,320 1,727,220 3.1732
08 Mar 2014 14:12:57 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 518,400 1,646,332 3.1758
07 Mar 2014 11:33:55 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 492,480 1,564,292 3.1764
06 Mar 2014 10:59:03 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 466,560 1,484,960 3.1828
05 Mar 2014 03:31:53 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 440,640 1,400,665 3.1787
03 Mar 2014 20:46:54 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 414,720 1,319,923 3.1827
02 Mar 2014 13:28:29 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 388,800 1,239,691 3.1885
01 Mar 2014 06:03:58 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 362,880 1,156,409 3.1868
28 Feb 2014 01:25:05 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 336,960 1,076,027 3.1933
27 Feb 2014 04:44:26 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 311,040 1,012,307 3.2546
25 Feb 2014 23:22:51 1274848 16289953 hadcm3n_84dx_1980_40_008463641_4 285,120 933,009 3.2723


©2024 climateprediction.net