climateprediction.net home page
Task 13018109

Task 13018109

Name hadcm3n_t5eh_1940_40_007313200_1
Workunit 7510630
Created 28 Jun 2011, 6:29:49 UTC
Sent 28 Jun 2011, 7:48:41 UTC
Report deadline 27 Sep 2011, 15:15:52 UTC
Received 18 Jul 2011, 16:46:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1102272
Run time 12 days 20 hours 19 min 20 sec
CPU time 11 days 12 hours 42 min 51 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.51 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
01:16:59 (3472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:17:04 (3472): No heartbeat from core client for 30 sec - exiting
01:17:05 (3472): No heartbeat from core client for 30 sec - exiting
01:17:06 (3472): No heartbeat from core client for 30 sec - exiting
01:17:07 (3472): No heartbeat from core client for 30 sec - exiting
01:17:08 (3472): No heartbeat from core client for 30 sec - exiting
01:17:09 (3472): No heartbeat from core client for 30 sec - exiting
01:17:10 (3472): No heartbeat from core client for 30 sec - exiting
01:17:11 (3472): No heartbeat from core client for 30 sec - exiting
01:17:12 (3472): No heartbeat from core client for 30 sec - exiting
01:17:13 (3472): No heartbeat from core client for 30 sec - exiting
01:17:15 (3472): No heartbeat from core client for 30 sec - exiting
01:17:16 (3472): No heartbeat from core client for 30 sec - exiting
01:17:17 (3472): No heartbeat from core client for 30 sec - exiting
01:17:18 (3472): No heartbeat from core client for 30 sec - exiting
01:17:19 (3472): No heartbeat from core client for 30 sec - exiting
01:17:20 (3472): No heartbeat from core client for 30 sec - exiting
01:17:21 (3472): No heartbeat from core client for 30 sec - exiting
01:17:22 (3472): No heartbeat from core client for 30 sec - exiting
01:17:23 (3472): No heartbeat from core client for 30 sec - exiting
01:17:24 (3472): No heartbeat from core client for 30 sec - exiting
01:17:25 (3472): No heartbeat from core client for 30 sec - exiting
01:17:27 (3472): No heartbeat from core client for 30 sec - exiting
01:17:28 (3472): No heartbeat from core client for 30 sec - exiting
01:17:29 (3472): No heartbeat from core client for 30 sec - exiting
01:17:30 (3472): No heartbeat from core client for 30 sec - exiting
01:17:31 (3472): No heartbeat from core client for 30 sec - exiting
01:17:32 (3472): No heartbeat from core client for 30 sec - exiting
01:17:33 (3472): No heartbeat from core client for 30 sec - exiting
01:17:34 (3472): No heartbeat from core client for 30 sec - exiting
01:17:35 (3472): No heartbeat from core client for 30 sec - exiting
01:17:37 (3472): No heartbeat from core client for 30 sec - exiting
03:01:47 (2236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:01:55 (2236): No heartbeat from core client for 30 sec - exiting
03:01:56 (2236): No heartbeat from core client for 30 sec - exiting
03:01:57 (2236): No heartbeat from core client for 30 sec - exiting
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:54:27 (4028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:18:34 (3372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:01:40 (2528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:02:43 (1068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:04:08 (2932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:04:20 (2932): No heartbeat from core client for 30 sec - exiting
00:04:21 (2932): No heartbeat from core client for 30 sec - exiting
00:04:22 (2932): No heartbeat from core client for 30 sec - exiting
00:04:23 (2932): No heartbeat from core client for 30 sec - exiting
00:04:24 (2932): No heartbeat from core client for 30 sec - exiting
00:04:25 (2932): No heartbeat from core client for 30 sec - exiting
00:04:26 (2932): No heartbeat from core client for 30 sec - exiting
00:04:27 (2932): No heartbeat from core client for 30 sec - exiting
00:04:28 (2932): No heartbeat from core client for 30 sec - exiting
00:04:29 (2932): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1304, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Jul 2011 15:06:17 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 388,800 996,293 2.5625
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 362,880 936,495 2.5807
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 336,960 876,357 2.6008
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 311,040 815,628 2.6223
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 285,120 754,590 2.6466
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 259,200 693,038 2.6738
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 233,280 631,854 2.7086
25 Jul 2011 13:26:11 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 207,360 570,850 2.7529
10 Jul 2011 13:08:35 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 181,440 509,962 2.8106
09 Jul 2011 18:14:37 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 155,520 459,843 2.9568
08 Jul 2011 21:50:10 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 129,600 412,271 3.1811
08 Jul 2011 00:55:57 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 103,680 364,117 3.5119
30 Jun 2011 07:24:02 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 77,760 153,269 1.9711
29 Jun 2011 13:16:13 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 51,840 102,009 1.9678
28 Jun 2011 22:38:20 1102272 13018109 hadcm3n_t5eh_1940_40_007313200_1 25,920 51,646 1.9925


©2024 climateprediction.net