climateprediction.net home page
Task 13749993

Task 13749993

Name hadcm3n_t4w9_1940_40_007614343_3
Workunit 7792473
Created 8 Dec 2011, 3:09:53 UTC
Sent 8 Dec 2011, 8:29:24 UTC
Report deadline 8 Mar 2012, 15:56:35 UTC
Received 9 Jan 2012, 9:24:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1161345
Run time 8 days 4 hours 38 min 13 sec
CPU time 7 days 21 hours 29 min 19 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 2.64 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
04:51:28 (3252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4640, iMonCtr=1
Model crash detected, will try to restart...
10:30:32 (3536): No heartbeat from core client for 30 sec - exiting
10:30:33 (3536): No heartbeat from core client for 30 sec - exiting
10:30:34 (3536): No heartbeat from core client for 30 sec - exiting
10:30:35 (3536): No heartbeat from core client for 30 sec - exiting
10:30:36 (3536): No heartbeat from core client for 30 sec - exiting
10:30:37 (3536): No heartbeat from core client for 30 sec - exiting
10:30:38 (3536): No heartbeat from core client for 30 sec - exiting
10:30:40 (3536): No heartbeat from core client for 30 sec - exiting
10:30:41 (3536): No heartbeat from core client for 30 sec - exiting
10:30:42 (3536): No heartbeat from core client for 30 sec - exiting
10:30:43 (3536): No heartbeat from core client for 30 sec - exiting
10:30:44 (3536): No heartbeat from core client for 30 sec - exiting
10:30:45 (3536): No heartbeat from core client for 30 sec - exiting
10:30:46 (3536): No heartbeat from core client for 30 sec - exiting
10:30:47 (3536): No heartbeat from core client for 30 sec - exiting
10:30:48 (3536): No heartbeat from core client for 30 sec - exiting
10:30:49 (3536): No heartbeat from core client for 30 sec - exiting
10:30:50 (3536): No heartbeat from core client for 30 sec - exiting
10:30:52 (3536): No heartbeat from core client for 30 sec - exiting
10:30:53 (3536): No heartbeat from core client for 30 sec - exiting
10:30:54 (3536): No heartbeat from core client for 30 sec - exiting
10:30:55 (3536): No heartbeat from core client for 30 sec - exiting
10:30:56 (3536): No heartbeat from core client for 30 sec - exiting
10:30:57 (3536): No heartbeat from core client for 30 sec - exiting
10:30:58 (3536): No heartbeat from core client for 30 sec - exiting
10:30:59 (3536): No heartbeat from core client for 30 sec - exiting
10:31:00 (3536): No heartbeat from core client for 30 sec - exiting
10:31:01 (3536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:31:02 (3536): No heartbeat from core client for 30 sec - exiting
17:18:07 (4444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:22:46 (1336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
18:08:31 (4588): No heartbeat from core client for 30 sec - exiting
18:08:32 (4588): No heartbeat from core client for 30 sec - exiting
18:08:33 (4588): No heartbeat from core client for 30 sec - exiting
18:08:34 (4588): No heartbeat from core client for 30 sec - exiting
18:08:35 (4588): No heartbeat from core client for 30 sec - exiting
18:08:36 (4588): No heartbeat from core client for 30 sec - exiting
18:08:37 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:47:19 (4920): No heartbeat from core client for 30 sec - exiting
08:47:20 (4920): No heartbeat from core client for 30 sec - exiting
08:47:21 (4920): No heartbeat from core client for 30 sec - exiting
08:47:23 (4920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3640, iMonCtr=1
Model crash detected, will try to restart...
21:42:59 (3568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:19:25 (4560): No heartbeat from core client for 30 sec - exiting
08:19:27 (4560): No heartbeat from core client for 30 sec - exiting
08:19:28 (4560): No heartbeat from core client for 30 sec - exiting
08:19:29 (4560): No heartbeat from core client for 30 sec - exiting
08:19:30 (4560): No heartbeat from core client for 30 sec - exiting
08:19:31 (4560): No heartbeat from core client for 30 sec - exiting
08:19:32 (4560): No heartbeat from core client for 30 sec - exiting
08:19:34 (4560): No heartbeat from core client for 30 sec - exiting
08:19:35 (4560): No heartbeat from core client for 30 sec - exiting
08:19:36 (4560): No heartbeat from core client for 30 sec - exiting
08:19:37 (4560): No heartbeat from core client for 30 sec - exiting
08:19:38 (4560): No heartbeat from core client for 30 sec - exiting
08:19:39 (4560): No heartbeat from core client for 30 sec - exiting
08:19:40 (4560): No heartbeat from core client for 30 sec - exiting
08:19:41 (4560): No heartbeat from core client for 30 sec - exiting
08:19:42 (4560): No heartbeat from core client for 30 sec - exiting
08:19:43 (4560): No heartbeat from core client for 30 sec - exiting
08:19:44 (4560): No heartbeat from core client for 30 sec - exiting
08:19:46 (4560): No heartbeat from core client for 30 sec - exiting
08:19:47 (4560): No heartbeat from core client for 30 sec - exiting
08:19:48 (4560): No heartbeat from core client for 30 sec - exiting
08:19:49 (4560): No heartbeat from core client for 30 sec - exiting
08:19:50 (4560): No heartbeat from core client for 30 sec - exiting
08:19:51 (4560): No heartbeat from core client for 30 sec - exiting
08:19:52 (4560): No heartbeat from core client for 30 sec - exiting
08:19:53 (4560): No heartbeat from core client for 30 sec - exiting
08:19:54 (4560): No heartbeat from core client for 30 sec - exiting
08:19:55 (4560): No heartbeat from core client for 30 sec - exiting
08:19:56 (4560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:05:10 (3588): No heartbeat from core client for 30 sec - exiting
11:05:12 (3588): No heartbeat from core client for 30 sec - exiting
11:05:13 (3588): No heartbeat from core client for 30 sec - exiting
11:05:14 (3588): No heartbeat from core client for 30 sec - exiting
11:05:15 (3588): No heartbeat from core client for 30 sec - exiting
11:05:16 (3588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:43:19 (1360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:26:08 (4792): No heartbeat from core client for 30 sec - exiting
10:26:09 (4792): No heartbeat from core client for 30 sec - exiting
10:26:10 (4792): No heartbeat from core client for 30 sec - exiting
10:26:11 (4792): No heartbeat from core client for 30 sec - exiting
10:26:12 (4792): No heartbeat from core client for 30 sec - exiting
10:26:13 (4792): No heartbeat from core client for 30 sec - exiting
10:26:14 (4792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:43:25 (4080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:39:03 (4860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:39:34 (4504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:04:09 (4196): No heartbeat from core client for 30 sec - exiting
21:04:10 (4196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Jan 2012 06:54:31 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 466,560 658,360 1.4111
07 Jan 2012 08:07:05 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 440,640 622,146 1.4119
06 Jan 2012 08:37:58 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 414,720 584,932 1.4104
05 Jan 2012 11:40:13 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 388,800 548,087 1.4097
04 Jan 2012 22:41:42 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 362,880 510,196 1.4060
04 Jan 2012 01:59:25 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 336,960 473,382 1.4049
03 Jan 2012 03:04:55 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 311,040 436,888 1.4046
01 Jan 2012 22:32:21 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 285,120 400,698 1.4054
31 Dec 2011 09:11:33 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 259,200 364,346 1.4057
30 Dec 2011 09:36:19 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 233,280 327,233 1.4027
27 Dec 2011 04:36:37 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 207,360 290,512 1.4010
25 Dec 2011 05:58:16 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 181,440 254,444 1.4024
24 Dec 2011 01:33:36 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 155,520 218,359 1.4041
23 Dec 2011 06:35:44 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 129,600 182,229 1.4061
21 Dec 2011 05:42:11 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 103,680 145,863 1.4069
20 Dec 2011 09:58:04 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 77,760 109,577 1.4092
18 Dec 2011 10:07:46 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 51,840 72,626 1.4010
17 Dec 2011 23:10:33 1161345 13749993 hadcm3n_t4w9_1940_40_007614343_3 25,920 36,475 1.4072


©2024 cpdn.org