climateprediction.net home page
Task 13664027

Task 13664027

Name hadcm3n_o5on_1940_40_007544654_3
Workunit 7741886
Created 26 Nov 2011, 20:44:31 UTC
Sent 26 Nov 2011, 20:44:38 UTC
Report deadline 26 Feb 2012, 4:11:49 UTC
Received 3 Apr 2012, 14:25:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1181710
Run time 6 days 22 hours 36 min 51 sec
CPU time 6 days 22 hours 36 min 51 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 1.98 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>5.3.19</core_client_version>
<message>The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:32:27 (5636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:32:38 (5636): No heartbeat from core client for 30 sec - exiting
08:33:16 (5636): No heartbeat from core client for 30 sec - exiting
08:33:22 (5636): No heartbeat from core client for 30 sec - exiting
08:46:35 (5132): No heartbeat from core client for 30 sec - exiting
08:46:41 (5132):CPDN Monitor - No 'heartbeat' from BOINC...
 No heartbeat from core client for 30 sec - exiting
08:47:27 (5132): No heartbeat from core client for 30 sec - exiting
08:47:28 (5132): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2164, selfPID=2164, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3588, selfPID=3588, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=440, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=440, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=440, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=440, iMonCtr=1
Model crash detected, will try to restart...
14:25:26 (440): No heartbeat from core client for 30 sec - exiting
14:25:27 (440): No heartbeat from core client for 30 sec - exiting
14:25:28 (440): No heartbeat from core client for 30 sec - exiting
14:25:29 (440): No heartbeat from core client for 30 sec - exiting
14:25:30 (440): No heartbeat from core client for 30 sec - exiting
14:25:31 (440): No heartbeat from core client for 30 sec - exiting
14:25:33 (440): No heartbeat from core client for 30 sec - exiting
14:25:34 (440): No heartbeat from core client for 30 sec - exiting
14:25:35 (440): No heartbeat from core client for 30 sec - exiting
14:25:36 (440): No heartbeat from core client for 30 sec - exiting
14:25:37 (440): No heartbeat from core client for 30 sec - exiting
14:25:38 (440): No heartbeat from core client for 30 sec - exiting
14:25:39 (440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5048, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5048, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Mar 2012 12:13:26 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 181,440 565,336 3.1158
26 Mar 2012 22:46:52 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 155,520 486,287 3.1268
17 Mar 2012 19:41:00 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 129,600 406,483 3.1364
02 Mar 2012 17:57:43 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 103,680 326,248 3.1467
05 Feb 2012 20:04:38 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 77,760 244,089 3.1390
08 Jan 2012 19:29:06 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 51,840 160,715 3.1002
28 Dec 2011 20:09:39 1181710 13664027 hadcm3n_o5on_1940_40_007544654_3 25,920 82,272 3.1741


©2024 cpdn.org