climateprediction.net home page
Task 16070280

Task 16070280

Name hadcm3n_n1zl_1880_40_008375971_3
Workunit 8526830
Created 19 Oct 2013, 8:35:59 UTC
Sent 19 Oct 2013, 8:36:16 UTC
Report deadline 18 Jan 2014, 16:03:27 UTC
Received 22 Nov 2013, 6:58:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1256758
Run time 6 days 15 hours 5 min 22 sec
CPU time 5 days 20 hours 55 min 33 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 2.50 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:22:49 (4184): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
02:22:50 (4184): No heartbeat from core client for 30 sec - exiting
02:22:51 (4184): No heartbeat from core client for 30 sec - exiting
02:22:52 (4184): No heartbeat from core client for 30 sec - exiting
02:22:53 (4184): No heartbeat from core client for 30 sec - exiting
02:22:54 (4184): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
06:59:57 (6176): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:00:55 (5212): No heartbeat from core client for 30 sec - exiting
10:00:56 (5212): No heartbeat from core client for 30 sec - exiting
10:00:57 (5212): No heartbeat from core client for 30 sec - exiting
10:00:58 (5212): No heartbeat from core client for 30 sec - exiting
10:00:59 (5212): No heartbeat from core client for 30 sec - exiting
10:01:01 (5212): No heartbeat from core client for 30 sec - exiting
10:01:02 (5212): No heartbeat from core client for 30 sec - exiting
10:01:03 (5212): No heartbeat from core client for 30 sec - exiting
10:01:04 (5212): No heartbeat from core client for 30 sec - exiting
10:01:05 (5212): No heartbeat from core client for 30 sec - exiting
10:01:06 (5212): No heartbeat from core client for 30 sec - exiting
10:01:07 (5212): No heartbeat from core client for 30 sec - exiting
10:01:08 (5212): No heartbeat from core client for 30 sec - exiting
10:01:09 (5212): No heartbeat from core client for 30 sec - exiting
10:01:10 (5212): No heartbeat from core client for 30 sec - exiting
10:01:12 (5212): No heartbeat from core client for 30 sec - exiting
10:01:13 (5212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:03:49 (5928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:03:51 (5928): No heartbeat from core client for 30 sec - exiting
06:03:52 (5928): No heartbeat from core client for 30 sec - exiting
06:03:53 (5928): No heartbeat from core client for 30 sec - exiting
06:03:54 (5928): No heartbeat from core client for 30 sec - exiting
06:03:55 (5928): No heartbeat from core client for 30 sec - exiting
06:12:59 (692): No heartbeat from core client for 30 sec - exiting
06:13:00 (692): No heartbeat from core client for 30 sec - exiting
06:13:01 (692): No heartbeat from core client for 30 sec - exiting
06:13:02 (692): No heartbeat from core client for 30 sec - exiting
06:13:03 (692): No heartbeat from core client for 30 sec - exiting
06:13:04 (692): No heartbeat from core client for 30 sec - exiting
06:13:05 (692): No heartbeat from core client for 30 sec - exiting
06:13:07 (692): No heartbeat from core client for 30 sec - exiting
06:13:08 (692): No heartbeat from core client for 30 sec - exiting
06:13:09 (692): No heartbeat from core client for 30 sec - exiting
06:13:10 (692): No heartbeat from core client for 30 sec - exiting
06:13:11 (692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Nov 2013 19:41:38 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 311,040 484,240 1.5568
15 Nov 2013 22:52:54 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 285,120 447,019 1.5678
04 Nov 2013 07:03:41 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 259,200 406,471 1.5682
02 Nov 2013 06:37:31 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 233,280 365,131 1.5652
31 Oct 2013 14:21:22 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 207,360 326,362 1.5739
31 Oct 2013 03:22:31 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 181,440 288,153 1.5881
30 Oct 2013 16:20:20 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 155,520 249,865 1.6066
30 Oct 2013 05:56:39 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 129,600 211,532 1.6322
29 Oct 2013 16:19:07 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 103,680 170,396 1.6435
28 Oct 2013 09:26:11 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 77,760 129,528 1.6657
27 Oct 2013 20:18:11 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 51,840 87,833 1.6943
26 Oct 2013 16:41:00 1256758 16070280 hadcm3n_n1zl_1880_40_008375971_3 25,920 46,170 1.7813


©2024 climateprediction.net