climateprediction.net home page
Task 16083653

Task 16083653

Name hadcm3n_3e2t_2020_40_008389746_3
Workunit 8540605
Created 25 Nov 2013, 22:20:25 UTC
Sent 25 Nov 2013, 23:37:23 UTC
Report deadline 25 Feb 2014, 7:04:34 UTC
Received 9 Dec 2013, 7:21:23 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1158176
Run time 9 days 2 hours 24 min 12 sec
CPU time 8 days 8 hours 40 min 7 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 2.91 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
06:46:16 (4420): No heartbeat from core client for 30 sec - exiting
06:46:17 (4420): No heartbeat from core client for 30 sec - exiting
06:46:18 (4420): No heartbeat from core client for 30 sec - exiting
06:46:19 (4420): No heartbeat from core client for 30 sec - exiting
06:46:20 (4420): No heartbeat from core client for 30 sec - exiting
06:46:21 (4420): No heartbeat from core client for 30 sec - exiting
06:46:22 (4420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:46:23 (4420): No heartbeat from core client for 30 sec - exiting
06:47:28 (5280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:48:05 (5964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:48:51 (2968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:49:57 (6608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:18:45 (900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:23:31 (5288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:05:16 (5944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:27:46 (21756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:31:44 (22276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:18:24 (22300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:04:02 (25776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:05:50 (25640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:07:04 (28696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:07:44 (13356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:25:54 (29544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:14:04 (50496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:22:41 (77300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:35:41 (54784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:38:13 (76768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:15:30 (80196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:17:39 (101844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:17:40 (101844): No heartbeat from core client for 30 sec - exiting
23:17:41 (101844): No heartbeat from core client for 30 sec - exiting
23:17:42 (101844): No heartbeat from core client for 30 sec - exiting
23:17:43 (101844): No heartbeat from core client for 30 sec - exiting
23:17:44 (101844): No heartbeat from core client for 30 sec - exiting
23:17:45 (101844): No heartbeat from core client for 30 sec - exiting
23:17:46 (101844): No heartbeat from core client for 30 sec - exiting
23:17:47 (101844): No heartbeat from core client for 30 sec - exiting
23:17:48 (101844): No heartbeat from core client for 30 sec - exiting
23:17:49 (101844): No heartbeat from core client for 30 sec - exiting
23:21:19 (103884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:25:21 (104172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:25:22 (104172): No heartbeat from core client for 30 sec - exiting
23:25:23 (104172): No heartbeat from core client for 30 sec - exiting
23:25:24 (104172): No heartbeat from core client for 30 sec - exiting
23:25:25 (104172): No heartbeat from core client for 30 sec - exiting
23:25:26 (104172): No heartbeat from core client for 30 sec - exiting
23:25:27 (104172): No heartbeat from core client for 30 sec - exiting
23:25:28 (104172): No heartbeat from core client for 30 sec - exiting
23:25:29 (104172): No heartbeat from core client for 30 sec - exiting
23:25:30 (104172): No heartbeat from core client for 30 sec - exiting
23:25:31 (104172): No heartbeat from core client for 30 sec - exiting
00:05:21 (100836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:07:45 (104740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:09:31 (102780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:09:32 (102780): No heartbeat from core client for 30 sec - exiting
00:09:33 (102780): No heartbeat from core client for 30 sec - exiting
00:09:34 (102780): No heartbeat from core client for 30 sec - exiting
00:09:35 (102780): No heartbeat from core client for 30 sec - exiting
00:09:36 (102780): No heartbeat from core client for 30 sec - exiting
00:09:37 (102780): No heartbeat from core client for 30 sec - exiting
00:09:38 (102780): No heartbeat from core client for 30 sec - exiting
00:09:39 (102780): No heartbeat from core client for 30 sec - exiting
00:09:40 (102780): No heartbeat from core client for 30 sec - exiting
00:09:41 (102780): No heartbeat from core client for 30 sec - exiting
00:29:16 (5528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:31:39 (102812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:55:55 (6096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3008, iMonCtr=1
Model crash detected, will try to restart...
07:11:29 (2892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=900, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=900, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=900, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=900, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=900, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
17:44:38 (4920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Dec 2013 21:59:05 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 362,880 688,913 1.8985
04 Dec 2013 07:19:38 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 336,960 640,789 1.9017
03 Dec 2013 17:24:43 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 311,040 592,472 1.9048
03 Dec 2013 01:57:17 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 285,120 543,663 1.9068
02 Dec 2013 11:44:42 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 259,200 495,547 1.9118
01 Dec 2013 19:50:44 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 233,280 443,347 1.9005
01 Dec 2013 04:47:44 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 207,360 392,823 1.8944
30 Nov 2013 13:22:46 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 181,440 340,857 1.8786
29 Nov 2013 22:22:13 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 155,520 290,964 1.8709
29 Nov 2013 07:08:47 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 129,600 240,637 1.8568
28 Nov 2013 16:18:43 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 103,680 191,547 1.8475
28 Nov 2013 00:20:10 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 77,760 145,499 1.8711
27 Nov 2013 09:26:02 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 51,840 98,239 1.8950
26 Nov 2013 18:41:43 1158176 16083653 hadcm3n_3e2t_2020_40_008389746_3 25,920 50,302 1.9407


©2024 cpdn.org