climateprediction.net home page
Task 16045381

Task 16045381

Name hadcm3n_of31_1900_40_008474672_0
Workunit 8625511
Created 27 Sep 2013, 10:31:10 UTC
Sent 27 Sep 2013, 23:28:42 UTC
Report deadline 28 Dec 2013, 6:55:53 UTC
Received 1 Oct 2013, 16:01:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1291985
Run time 2 days 4 hours 41 min 31 sec
CPU time 2 days 3 hours 37 min 9 sec
Validate state Invalid
Credit 933.12
Device peak FLOPS 1.38 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:16:51 (5052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:16:52 (5052): No heartbeat from core client for 30 sec - exiting
23:16:54 (5052): No heartbeat from core client for 30 sec - exiting
23:16:55 (5052): No heartbeat from core client for 30 sec - exiting
23:16:56 (5052): No heartbeat from core client for 30 sec - exiting
23:16:57 (5052): No heartbeat from core client for 30 sec - exiting
23:16:58 (5052): No heartbeat from core client for 30 sec - exiting
23:16:59 (5052): No heartbeat from core client for 30 sec - exiting
23:17:00 (5052): No heartbeat from core client for 30 sec - exiting
23:17:01 (5052): No heartbeat from core client for 30 sec - exiting
23:17:02 (5052): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
23:18:46 (9584): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
23:22:11 (5848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:22:12 (5848): No heartbeat from core client for 30 sec - exiting
23:22:13 (5848): No heartbeat from core client for 30 sec - exiting
23:22:14 (5848): No heartbeat from core client for 30 sec - exiting
23:22:15 (5848): No heartbeat from core client for 30 sec - exiting
23:22:16 (5848): No heartbeat from core client for 30 sec - exiting
23:22:17 (5848): No heartbeat from core client for 30 sec - exiting
23:22:19 (5848): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
23:33:22 (5360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:33:23 (5360): No heartbeat from core client for 30 sec - exiting
23:33:24 (5360): No heartbeat from core client for 30 sec - exiting
23:33:25 (5360): No heartbeat from core client for 30 sec - exiting
23:33:26 (5360): No heartbeat from core client for 30 sec - exiting
23:33:27 (5360): No heartbeat from core client for 30 sec - exiting
23:33:28 (5360): No heartbeat from core client for 30 sec - exiting
23:33:29 (5360): No heartbeat from core client for 30 sec - exiting
23:33:30 (5360): No heartbeat from core client for 30 sec - exiting
23:33:31 (5360): No heartbeat from core client for 30 sec - exiting
23:33:33 (5360): No heartbeat from core client for 30 sec - exiting
23:44:28 (8804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:44:29 (8804): No heartbeat from core client for 30 sec - exiting
23:44:30 (8804): No heartbeat from core client for 30 sec - exiting
23:44:31 (8804): No heartbeat from core client for 30 sec - exiting
23:44:32 (8804): No heartbeat from core client for 30 sec - exiting
23:44:33 (8804): No heartbeat from core client for 30 sec - exiting
23:44:34 (8804): No heartbeat from core client for 30 sec - exiting
23:44:35 (8804): No heartbeat from core client for 30 sec - exiting
23:44:36 (8804): No heartbeat from core client for 30 sec - exiting
23:44:37 (8804): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:05:26 (8428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:05:27 (8428): No heartbeat from core client for 30 sec - exiting
00:05:28 (8428): No heartbeat from core client for 30 sec - exiting
00:05:29 (8428): No heartbeat from core client for 30 sec - exiting
00:05:30 (8428): No heartbeat from core client for 30 sec - exiting
00:05:31 (8428): No heartbeat from core client for 30 sec - exiting
00:05:32 (8428): No heartbeat from core client for 30 sec - exiting
00:05:33 (8428): No heartbeat from core client for 30 sec - exiting
00:05:34 (8428): No heartbeat from core client for 30 sec - exiting
00:05:35 (8428): No heartbeat from core client for 30 sec - exiting
00:05:37 (8428): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Sep 2013 23:46:06 1291985 16045381 hadcm3n_of31_1900_40_008474672_0 77,760 168,379 2.1654
29 Sep 2013 08:18:27 1291985 16045381 hadcm3n_of31_1900_40_008474672_0 51,840 113,122 2.1821
28 Sep 2013 16:52:18 1291985 16045381 hadcm3n_of31_1900_40_008474672_0 25,920 59,085 2.2795


©2024 climateprediction.net