climateprediction.net home page
Task 13661099

Task 13661099

Name hadcm3n_yl0i_1900_40_007514106_2
Workunit 7711581
Created 25 Nov 2011, 11:14:49 UTC
Sent 25 Nov 2011, 11:19:37 UTC
Report deadline 24 Feb 2012, 18:46:48 UTC
Received 19 Dec 2011, 12:53:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1159034
Run time 7 days 4 hours 17 min 46 sec
CPU time 6 days 18 hours 12 min 26 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 2.54 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
Het apparaat herkent de opdracht niet. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6680, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:04:08 (560): Can't acquire lockfile (32) - waiting 35s
15:04:30 (3428): No heartbeat from core client for 30 sec - exiting
15:04:31 (3428): No heartbeat from core client for 30 sec - exiting
15:04:32 (3428): No heartbeat from core client for 30 sec - exiting
15:04:33 (3428): No heartbeat from core client for 30 sec - exiting
15:04:34 (3428): No heartbeat from core client for 30 sec - exiting
15:04:35 (3428): No heartbeat from core client for 30 sec - exiting
15:04:36 (3428): No heartbeat from core client for 30 sec - exiting
15:04:37 (3428): No heartbeat from core client for 30 sec - exiting
15:04:38 (3428): No heartbeat from core client for 30 sec - exiting
15:04:39 (3428): No heartbeat from core client for 30 sec - exiting
15:04:41 (3428): No heartbeat from core client for 30 sec - exiting
15:04:42 (3428): No heartbeat from core client for 30 sec - exiting
15:04:43 (560): Can't acquire lockfile (32) - exiting
15:04:43 (560): Error: Het proces heeft geen toegang tot het bestand omdat het door een ander

proces wordt gebruikt. (0x20)
15:04:43 (3428): No heartbeat from core client for 30 sec - exiting
15:04:44 (3428): No heartbeat from core client for 30 sec - exiting
15:04:45 (3428): No heartbeat from core client for 30 sec - exiting
15:04:46 (3428): No heartbeat from core client for 30 sec - exiting
15:04:47 (3428): No heartbeat from core client for 30 sec - exiting
15:04:48 (3428): No heartbeat from core client for 30 sec - exiting
15:04:49 (3428): No heartbeat from core client for 30 sec - exiting
15:04:50 (3428): No heartbeat from core client for 30 sec - exiting
15:04:51 (3428): No heartbeat from core client for 30 sec - exiting
15:04:52 (3428): No heartbeat from core client for 30 sec - exiting
15:04:53 (6124): Can't set up shared mem: -1. Will run in standalone mode.
15:04:53 (3428): No heartbeat from core client for 30 sec - exiting
15:04:54 (3428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7276, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8928, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:54:47 (1476): Can't acquire lockfile (32) - waiting 35s
13:55:16 (2224): No heartbeat from core client for 30 sec - exiting
13:55:17 (2224): No heartbeat from core client for 30 sec - exiting
13:55:18 (2224): No heartbeat from core client for 30 sec - exiting
13:55:19 (2224): No heartbeat from core client for 30 sec - exiting
13:55:20 (2224): No heartbeat from core client for 30 sec - exiting
13:55:21 (2224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6956, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6956, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6956, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6956, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6956, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Dec 2011 20:29:53 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 233,280 580,796 2.4897
15 Dec 2011 14:42:12 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 207,360 515,185 2.4845
12 Dec 2011 15:00:21 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 181,440 451,785 2.4900
10 Dec 2011 18:13:35 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 155,520 387,460 2.4914
09 Dec 2011 10:48:08 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 129,600 322,589 2.4891
07 Dec 2011 12:53:08 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 103,680 257,160 2.4803
05 Dec 2011 18:58:00 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 77,760 192,729 2.4785
04 Dec 2011 13:10:58 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 51,840 128,682 2.4823
26 Nov 2011 21:06:50 1159034 13661099 hadcm3n_yl0i_1900_40_007514106_2 25,920 64,380 2.4838


©2024 cpdn.org