climateprediction.net home page
Task 13099027

Task 13099027

Name hadcm3n_ybcy_1900_40_007347580_1
Workunit 7545010
Created 6 Jul 2011, 13:45:07 UTC
Sent 18 Jul 2011, 23:22:38 UTC
Report deadline 18 Oct 2011, 6:49:49 UTC
Received 5 Aug 2011, 6:47:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1155912
Run time 3 days 23 hours 54 min 51 sec
CPU time 3 days 23 hours 31 min 1 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 2.99 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:32:42 (2820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1212, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:09:13 (2556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:09:14 (2556): No heartbeat from core client for 30 sec - exiting
00:09:15 (2556): No heartbeat from core client for 30 sec - exiting
00:09:16 (2556): No heartbeat from core client for 30 sec - exiting
00:09:17 (2556): No heartbeat from core client for 30 sec - exiting
00:09:18 (2556): No heartbeat from core client for 30 sec - exiting
00:09:19 (2556): No heartbeat from core client for 30 sec - exiting
00:09:20 (2556): No heartbeat from core client for 30 sec - exiting
00:09:21 (2556): No heartbeat from core client for 30 sec - exiting
00:09:22 (2556): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Jul 2011 13:35:41 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 285,120 337,171 1.1826
30 Jul 2011 04:56:32 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 259,200 306,459 1.1823
29 Jul 2011 00:43:37 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 233,280 275,270 1.1800
25 Jul 2011 22:49:40 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 207,360 245,169 1.1823
25 Jul 2011 21:56:43 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 181,440 214,907 1.1845
25 Jul 2011 20:59:40 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 155,520 184,859 1.1887
25 Jul 2011 20:26:41 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 129,600 154,630 1.1931
25 Jul 2011 19:26:26 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 103,680 124,624 1.2020
25 Jul 2011 18:55:24 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 77,760 93,786 1.2061
25 Jul 2011 18:45:25 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 51,840 62,056 1.1971
25 Jul 2011 18:10:17 1155912 13099027 hadcm3n_ybcy_1900_40_007347580_1 25,920 31,204 1.2039


©2024 cpdn.org