climateprediction.net home page
Task 13567390

Task 13567390

Name hadcm3n_ydzx_1900_40_007525321_3
Workunit 7722796
Created 30 Oct 2011, 7:43:50 UTC
Sent 30 Oct 2011, 7:51:58 UTC
Report deadline 29 Jan 2012, 15:19:09 UTC
Received 6 Nov 2011, 11:47:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1144003
Run time 6 days 21 hours 39 min 31 sec
CPU time 6 days 17 hours 8 min 25 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 3.29 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4648, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
09:42:30 (5628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:42:41 (5628): No heartbeat from core client for 30 sec - exiting
09:42:48 (5628): No heartbeat from core client for 30 sec - exiting
09:42:50 (5628): No heartbeat from core client for 30 sec - exiting
09:42:52 (5628): No heartbeat from core client for 30 sec - exiting
09:42:55 (5628): No heartbeat from core client for 30 sec - exiting
09:42:56 (5628): No heartbeat from core client for 30 sec - exiting
09:42:58 (5628): No heartbeat from core client for 30 sec - exiting
09:42:59 (5628): No heartbeat from core client for 30 sec - exiting
09:43:01 (5628): No heartbeat from core client for 30 sec - exiting
09:43:02 (5628): No heartbeat from core client for 30 sec - exiting
09:43:03 (5628): No heartbeat from core client for 30 sec - exiting
09:43:05 (5628): No heartbeat from core client for 30 sec - exiting
09:43:11 (5628): No heartbeat from core client for 30 sec - exiting
09:43:17 (5628): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
12:54:13 (6480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:33:02 (5032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:33:14 (5032): No heartbeat from core client for 30 sec - exiting
08:33:20 (5032): No heartbeat from core client for 30 sec - exiting
08:33:32 (5032): No heartbeat from core client for 30 sec - exiting
08:33:36 (5032): No heartbeat from core client for 30 sec - exiting
08:33:38 (5032): No heartbeat from core client for 30 sec - exiting
08:33:40 (5032): No heartbeat from core client for 30 sec - exiting
08:33:43 (5032): No heartbeat from core client for 30 sec - exiting
08:33:48 (5032): No heartbeat from core client for 30 sec - exiting
08:33:55 (5032): No heartbeat from core client for 30 sec - exiting
08:33:59 (5032): No heartbeat from core client for 30 sec - exiting
08:34:05 (5032): No heartbeat from core client for 30 sec - exiting
08:34:09 (5032): No heartbeat from core client for 30 sec - exiting
08:34:13 (5032): No heartbeat from core client for 30 sec - exiting
08:34:17 (5032): No heartbeat from core client for 30 sec - exiting
08:34:23 (5032): No heartbeat from core client for 30 sec - exiting
08:34:28 (5032): No heartbeat from core client for 30 sec - exiting
08:34:31 (5032): No heartbeat from core client for 30 sec - exiting
08:34:35 (5032): No heartbeat from core client for 30 sec - exiting
08:34:38 (5032): No heartbeat from core client for 30 sec - exiting
08:34:39 (5032): No heartbeat from core client for 30 sec - exiting
08:34:40 (5032): No heartbeat from core client for 30 sec - exiting
08:34:41 (5032): No heartbeat from core client for 30 sec - exiting
08:34:42 (5032): No heartbeat from core client for 30 sec - exiting
08:34:43 (5032): No heartbeat from core client for 30 sec - exiting
08:34:44 (5032): No heartbeat from core client for 30 sec - exiting
08:34:45 (5032): No heartbeat from core client for 30 sec - exiting
08:34:46 (5032): No heartbeat from core client for 30 sec - exiting
08:34:47 (5032): No heartbeat from core client for 30 sec - exiting
08:34:48 (5032): No heartbeat from core client for 30 sec - exiting
08:34:49 (5032): No heartbeat from core client for 30 sec - exiting
08:34:50 (5032): No heartbeat from core client for 30 sec - exiting
08:34:51 (5032): No heartbeat from core client for 30 sec - exiting
08:34:52 (5032): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8820, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8820, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8820, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Nov 2011 00:27:35 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 414,720 550,244 1.3268
05 Nov 2011 14:57:47 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 388,800 515,912 1.3269
05 Nov 2011 05:08:17 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 362,880 481,592 1.3271
04 Nov 2011 21:26:15 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 336,960 447,098 1.3269
04 Nov 2011 08:25:19 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 311,040 412,396 1.3259
03 Nov 2011 22:38:04 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 285,120 377,678 1.3246
03 Nov 2011 13:04:36 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 259,200 343,031 1.3234
03 Nov 2011 10:59:25 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 233,280 308,291 1.3215
02 Nov 2011 17:54:35 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 207,360 273,803 1.3204
02 Nov 2011 07:33:23 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 181,440 239,228 1.3185
01 Nov 2011 21:53:27 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 155,520 204,683 1.3161
01 Nov 2011 11:05:29 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 129,600 170,671 1.3169
01 Nov 2011 01:23:56 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 103,680 136,282 1.3144
31 Oct 2011 20:03:02 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 77,760 101,985 1.3115
31 Oct 2011 19:35:40 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 51,840 67,660 1.3052
31 Oct 2011 19:21:51 1144003 13567390 hadcm3n_ydzx_1900_40_007525321_3 25,920 33,893 1.3076


©2024 cpdn.org