climateprediction.net home page
Task 13394817

Task 13394817

Name hadcm3n_o1qk_1940_40_007445716_3
Workunit 7643219
Created 18 Sep 2011, 1:16:01 UTC
Sent 18 Sep 2011, 1:23:47 UTC
Report deadline 18 Dec 2011, 8:50:58 UTC
Received 25 Oct 2011, 18:32:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1155912
Run time 4 days 17 hours 21 min 48 sec
CPU time 4 days 15 hours 31 min 20 sec
Validate state Invalid
Credit 4,043.52
Device peak FLOPS 2.99 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:20:28 (4808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:20:29 (4808): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Oct 2011 15:07:55 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 336,960 397,937 1.1810
31 Oct 2011 15:07:55 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 311,040 367,893 1.1828
31 Oct 2011 15:07:54 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 285,120 337,168 1.1825
31 Oct 2011 15:07:54 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 259,200 306,381 1.1820
11 Oct 2011 19:52:23 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 233,280 275,380 1.1805
11 Oct 2011 03:12:49 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 207,360 244,793 1.1805
10 Oct 2011 16:59:34 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 181,440 214,171 1.1804
07 Oct 2011 04:46:06 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 155,520 183,521 1.1800
06 Oct 2011 11:57:09 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 129,600 153,428 1.1839
06 Oct 2011 01:51:28 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 103,680 121,893 1.1757
05 Oct 2011 07:08:19 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 77,760 91,363 1.1749
04 Oct 2011 17:20:36 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 51,840 61,035 1.1774
28 Sep 2011 02:39:37 1155912 13394817 hadcm3n_o1qk_1940_40_007445716_3 25,920 30,752 1.1864


©2024 cpdn.org