climateprediction.net home page
Task 17806496

Task 17806496

Name hadcm3n_ld0k_1940_40_009465813_1
Workunit 9548047
Created 18 Jan 2015, 21:31:57 UTC
Sent 19 Jan 2015, 0:45:36 UTC
Report deadline 20 Apr 2015, 8:12:47 UTC
Received 23 Jan 2015, 19:31:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1352433
Run time 1 days 19 hours 40 min 47 sec
CPU time 1 days 19 hours 11 min 58 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 4.95 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.5.0</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:05:01 (5964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:05:02 (5964): No heartbeat from core client for 30 sec - exiting
20:05:03 (5964): No heartbeat from core client for 30 sec - exiting
20:05:04 (5964): No heartbeat from core client for 30 sec - exiting
20:05:05 (5964): No heartbeat from core client for 30 sec - exiting
20:05:06 (5964): No heartbeat from core client for 30 sec - exiting
20:05:08 (5964): No heartbeat from core client for 30 sec - exiting
20:05:09 (5964): No heartbeat from core client for 30 sec - exiting
20:05:10 (5964): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:15:56 (5656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:15:57 (5656): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jan 2015 16:29:20 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 207,360 145,734 0.7028
23 Jan 2015 04:35:16 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 181,440 126,763 0.6986
22 Jan 2015 22:38:55 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 155,520 108,263 0.6961
22 Jan 2015 16:12:01 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 129,600 89,796 0.6929
22 Jan 2015 04:07:09 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 103,680 72,313 0.6975
21 Jan 2015 23:08:05 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 77,760 54,888 0.7059
19 Jan 2015 21:36:58 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 51,840 37,507 0.7235
19 Jan 2015 16:30:17 1352433 17806496 hadcm3n_ld0k_1940_40_009465813_1 25,920 19,517 0.7530


©2024 climateprediction.net