climateprediction.net home page
Task 15928986

Task 15928986

Name hadcm3n_n3e7_1960_40_008407318_0
Workunit 8558174
Created 20 Aug 2013, 11:17:11 UTC
Sent 20 Aug 2013, 11:23:47 UTC
Report deadline 19 Nov 2013, 18:50:58 UTC
Received 8 Sep 2013, 13:25:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1222669
Run time 6 days 23 hours 53 min 11 sec
CPU time 6 days 20 hours 27 min 32 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 2.86 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:15:16 (3596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4104, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3636, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3472, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8176, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Sep 2013 20:47:25 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 440,640 560,098 1.2711
04 Sep 2013 18:14:37 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 414,720 527,312 1.2715
02 Sep 2013 22:16:08 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 388,800 494,718 1.2724
01 Sep 2013 18:43:27 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 362,880 462,051 1.2733
31 Aug 2013 18:14:18 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 336,960 428,993 1.2731
30 Aug 2013 23:55:58 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 311,040 396,312 1.2742
29 Aug 2013 20:40:45 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 285,120 363,958 1.2765
27 Aug 2013 21:44:35 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 259,200 331,431 1.2787
26 Aug 2013 17:21:13 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 233,280 298,783 1.2808
25 Aug 2013 16:14:38 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 207,360 266,498 1.2852
24 Aug 2013 21:00:53 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 181,440 234,153 1.2905
24 Aug 2013 11:48:52 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 155,520 201,809 1.2976
23 Aug 2013 16:33:14 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 129,600 169,577 1.3085
23 Aug 2013 10:55:33 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 103,680 137,280 1.3241
22 Aug 2013 15:31:40 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 77,760 104,888 1.3489
21 Aug 2013 18:23:51 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 51,840 72,536 1.3992
20 Aug 2013 21:42:35 1222669 15928986 hadcm3n_n3e7_1960_40_008407318_0 25,920 35,686 1.3768


©2024 cpdn.org