climateprediction.net home page
Task 13545055

Task 13545055

Name hadcm3n_yc2o_1900_40_007519684_1
Workunit 7717159
Created 28 Oct 2011, 13:04:07 UTC
Sent 4 Nov 2011, 20:18:13 UTC
Report deadline 4 Feb 2012, 3:45:24 UTC
Received 15 Dec 2011, 22:37:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1135179
Run time 19 days 17 hours 19 min 14 sec
CPU time 18 days 14 hours 57 min 18 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 2.67 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1
Model crash detected, will try to restart...
18:32:11 (4648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Dec 2011 09:49:49 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 725,760 1,569,978 2.1632
14 Dec 2011 17:45:55 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 699,840 1,515,165 2.1650
13 Dec 2011 21:29:36 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 673,920 1,462,684 2.1704
13 Dec 2011 05:09:36 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 648,000 1,406,689 2.1708
12 Dec 2011 07:18:01 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 622,080 1,350,724 2.1713
10 Dec 2011 09:01:08 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 596,160 1,293,759 2.1702
09 Dec 2011 12:43:37 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 570,240 1,236,294 2.1680
08 Dec 2011 19:43:54 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 544,320 1,178,642 2.1653
08 Dec 2011 00:10:19 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 518,400 1,121,264 2.1629
07 Dec 2011 04:24:28 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 492,480 1,063,715 2.1599
06 Dec 2011 05:53:57 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 466,560 1,005,919 2.1560
05 Dec 2011 09:21:04 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 440,640 948,154 2.1518
04 Dec 2011 12:35:49 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 414,720 890,505 2.1472
03 Dec 2011 18:19:05 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 388,800 834,009 2.1451
02 Dec 2011 22:23:30 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 362,880 778,134 2.1443
02 Dec 2011 02:05:10 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 336,960 722,097 2.1430
01 Dec 2011 06:35:01 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 311,040 666,383 2.1424
30 Nov 2011 15:08:28 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 285,120 610,430 2.1410
28 Nov 2011 12:11:34 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 259,200 554,472 2.1392
23 Nov 2011 06:57:15 1135179 13545055 hadcm3n_yc2o_1900_40_007519684_1 233,280 498,014 2.1348


©2024 climateprediction.net