climateprediction.net home page
Task 13773051

Task 13773051

Name hadcm3n_yh89_1940_40_007454509_4
Workunit 7652012
Created 13 Dec 2011, 6:56:26 UTC
Sent 13 Dec 2011, 7:01:13 UTC
Report deadline 13 Mar 2012, 14:28:24 UTC
Received 9 Jan 2012, 4:19:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 807660
Run time 17 days 5 hours 11 min 41 sec
CPU time 14 days 2 hours 20 min 29 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 1.43 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:08:23 (1012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:12:49 (1012): No heartbeat from core client for 30 sec - exiting
15:16:51 (1012): No heartbeat from core client for 30 sec - exiting
15:21:16 (1012): No heartbeat from core client for 30 sec - exiting
15:25:18 (1012): No heartbeat from core client for 30 sec - exiting
15:29:19 (1012): No heartbeat from core client for 30 sec - exiting
15:33:21 (1012): No heartbeat from core client for 30 sec - exiting
15:37:22 (1012): No heartbeat from core client for 30 sec - exiting
15:41:24 (1012): No heartbeat from core client for 30 sec - exiting
15:45:25 (1012): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
00:12:25 (5256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:45:55 (3752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:35:35 (4784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:55:22 (2404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:49:24 (3704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:50:11 (4748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:58:26 (1600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:59:00 (300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:54:34 (1688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Dec 2011 02:42:09 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 388,800 1,145,549 2.9464
28 Dec 2011 02:58:34 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 362,880 1,069,116 2.9462
27 Dec 2011 01:20:42 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 336,960 992,644 2.9459
25 Dec 2011 22:38:05 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 311,040 916,101 2.9453
24 Dec 2011 21:00:42 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 285,120 839,470 2.9443
23 Dec 2011 18:49:19 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 259,200 762,702 2.9425
22 Dec 2011 15:55:25 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 233,280 686,042 2.9409
21 Dec 2011 15:04:33 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 207,360 610,116 2.9423
20 Dec 2011 12:10:07 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 181,440 533,707 2.9415
19 Dec 2011 10:18:01 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 155,520 457,155 2.9395
18 Dec 2011 07:57:22 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 129,600 380,490 2.9359
17 Dec 2011 08:18:49 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 103,680 304,469 2.9366
16 Dec 2011 07:51:17 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 77,760 228,130 2.9338
15 Dec 2011 05:58:36 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 51,840 152,233 2.9366
14 Dec 2011 06:02:03 807660 13773051 hadcm3n_yh89_1940_40_007454509_4 25,920 76,076 2.9350


©2024 cpdn.org