climateprediction.net home page
Task 14367106

Task 14367106

Name hadcm3n_y9f4_1940_40_007858633_1
Workunit 8013745
Created 5 Apr 2012, 19:55:35 UTC
Sent 5 Apr 2012, 21:17:35 UTC
Report deadline 6 Jul 2012, 4:44:46 UTC
Received 21 Apr 2012, 13:46:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1305670
Run time 8 days 23 hours 25 min 48 sec
CPU time 8 days 23 hours 25 min 48 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 3.80 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:01:09 (5904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:01:10 (5904): No heartbeat from core client for 30 sec - exiting
11:01:11 (5904): No heartbeat from core client for 30 sec - exiting
11:01:13 (5904): No heartbeat from core client for 30 sec - exiting
11:01:14 (5904): No heartbeat from core client for 30 sec - exiting
11:01:15 (5904): No heartbeat from core client for 30 sec - exiting
11:01:16 (5904): No heartbeat from core client for 30 sec - exiting
11:01:17 (5904): No heartbeat from core client for 30 sec - exiting
11:01:18 (5904): No heartbeat from core client for 30 sec - exiting
11:01:19 (5904): No heartbeat from core client for 30 sec - exiting
11:01:20 (5904): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6416, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6064, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Apr 2012 16:18:10 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 881,280 774,127 0.8784
20 Apr 2012 09:29:08 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 855,360 749,132 0.8758
19 Apr 2012 17:55:15 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 829,440 724,516 0.8735
19 Apr 2012 10:52:17 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 803,520 699,952 0.8711
19 Apr 2012 04:02:58 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 777,600 675,404 0.8686
18 Apr 2012 21:24:22 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 751,680 650,725 0.8657
18 Apr 2012 14:45:43 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 725,760 625,977 0.8625
17 Apr 2012 22:39:26 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 699,840 601,600 0.8596
17 Apr 2012 15:28:18 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 673,920 577,312 0.8566
16 Apr 2012 17:46:28 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 648,000 552,893 0.8532
16 Apr 2012 10:41:29 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 622,080 528,389 0.8494
15 Apr 2012 18:18:22 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 596,160 504,052 0.8455
14 Apr 2012 21:00:39 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 570,240 479,292 0.8405
14 Apr 2012 13:46:41 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 544,320 454,757 0.8355
13 Apr 2012 22:40:08 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 518,400 432,489 0.8343
13 Apr 2012 16:36:12 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 492,480 410,714 0.8340
13 Apr 2012 09:53:10 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 466,560 388,954 0.8337
12 Apr 2012 18:40:28 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 440,640 366,960 0.8328
12 Apr 2012 12:13:11 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 414,720 345,012 0.8319
12 Apr 2012 06:26:43 1136999 14367106 hadcm3n_y9f4_1940_40_007858633_1 388,800 323,195 0.8313


©2024 cpdn.org