climateprediction.net home page
Task 15492297

Task 15492297

Name hadcm3n_3bo1_1940_40_008263525_0
Workunit 8418649
Created 21 Dec 2012, 5:37:39 UTC
Sent 21 Dec 2012, 5:40:02 UTC
Report deadline 22 Mar 2013, 13:07:13 UTC
Received 12 Jan 2013, 17:47:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1056211
Run time 20 days 0 hours 23 min 43 sec
CPU time 16 days 21 hours 27 min 4 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 3.24 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
12:08:35 (6616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:11:19 (7724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:23:36 (5624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:25:57 (5592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:29:07 (1480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:29:09 (1480): No heartbeat from core client for 30 sec - exiting
12:31:54 (2920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:34:51 (6764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:38:31 (8092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:49:42 (5888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:49:43 (5888): No heartbeat from core client for 30 sec - exiting
12:52:22 (5200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:52:24 (5200): No heartbeat from core client for 30 sec - exiting
12:55:14 (6968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:58:15 (6912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:58:16 (6912): No heartbeat from core client for 30 sec - exiting
13:00:55 (6492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:29:29 (1628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:28:19 (6200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:39:27 (4832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:12:48 (6444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:12:49 (6444): No heartbeat from core client for 30 sec - exiting
12:12:50 (6444): No heartbeat from core client for 30 sec - exiting
12:12:51 (6444): No heartbeat from core client for 30 sec - exiting
12:12:53 (6444): No heartbeat from core client for 30 sec - exiting
12:12:54 (6444): No heartbeat from core client for 30 sec - exiting
12:12:55 (6444): No heartbeat from core client for 30 sec - exiting
12:15:06 (7016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:16 (5384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:17 (5384): No heartbeat from core client for 30 sec - exiting
12:26:23 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:26:25 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7416, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7416, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7416, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7416, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7416, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7416, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Jan 2013 09:03:41 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 725,760 1,441,957 1.9868
09 Jan 2013 09:15:17 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 699,840 1,391,434 1.9882
08 Jan 2013 14:25:32 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 673,920 1,340,253 1.9887
07 Jan 2013 19:52:01 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 648,000 1,289,369 1.9898
07 Jan 2013 02:24:39 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 622,080 1,238,184 1.9904
06 Jan 2013 08:10:58 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 596,160 1,184,773 1.9873
05 Jan 2013 15:30:50 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 570,240 1,130,519 1.9825
05 Jan 2013 00:03:28 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 544,320 1,079,645 1.9835
04 Jan 2013 07:55:04 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 518,400 1,029,123 1.9852
03 Jan 2013 16:56:12 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 492,480 979,166 1.9882
03 Jan 2013 00:24:05 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 466,560 926,790 1.9864
02 Jan 2013 06:44:39 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 440,640 875,688 1.9873
01 Jan 2013 14:05:39 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 414,720 826,430 1.9927
31 Dec 2012 20:57:50 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 388,800 777,039 1.9986
31 Dec 2012 03:57:29 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 362,880 724,937 1.9977
30 Dec 2012 10:24:52 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 336,960 672,461 1.9957
29 Dec 2012 18:53:24 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 311,040 619,680 1.9923
29 Dec 2012 01:58:39 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 285,120 566,310 1.9862
28 Dec 2012 07:54:12 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 259,200 512,758 1.9782
27 Dec 2012 14:30:43 1056211 15492297 hadcm3n_3bo1_1940_40_008263525_0 233,280 459,529 1.9699


©2024 cpdn.org