climateprediction.net home page
Task 15510237

Task 15510237

Name hadcm3n_o6el_2140_40_008270186_1
Workunit 8425310
Created 24 Dec 2012, 22:51:48 UTC
Sent 25 Dec 2012, 0:19:46 UTC
Report deadline 26 Mar 2013, 7:46:57 UTC
Received 14 Jan 2013, 6:13:33 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1209461
Run time 15 days 20 hours 28 min 1 sec
CPU time 12 days 18 hours 28 min 44 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.68 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:55:16 (6608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:48:27 (9736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:51:30 (9708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:00:42 (10004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:59:57 (6584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:06:00 (4996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	01:32:05 AM	No files match the supplied pattern.
MainError:	01:32:05 AM	No files match the supplied pattern.
MainError:	02:06:07 PM	No files match the supplied pattern.
MainError:	02:06:07 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	03:59:43 PM	No files match the supplied pattern.
MainError:	03:59:43 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	02:09:11 AM	No files match the supplied pattern.
MainError:	02:09:11 AM	No files match the supplied pattern.
MainError:	12:01:42 AM	No files match the supplied pattern.
MainError:	12:01:42 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	03:46:15 AM	No files match the supplied pattern.
MainError:	03:46:15 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:39:05 (5476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:33:42 (8816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	07:43:03 PM	No files match the supplied pattern.
MainError:	07:43:03 PM	No files match the supplied pattern.
19:06:50 (6172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	11:12:31 AM	No files match the supplied pattern.
MainError:	11:12:31 AM	No files match the supplied pattern.
MainError:	08:18:59 PM	No files match the supplied pattern.
MainError:	08:18:59 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	10:57:45 AM	No files match the supplied pattern.
MainError:	10:57:45 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Error converting file to netcdf: dataout/o6elka.ph11c10
Error converting file to netcdf: dataout/o6elka.pg11c10
Error converting file to netcdf: dataout/o6elka.pe11c10
MainError:	04:26:26 AM	No files match the supplied pattern.
MainError:	04:26:26 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Jan 2013 05:12:52 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 777,600 1,227,680 1.5788
13 Jan 2013 11:04:47 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 751,680 1,186,845 1.5789
12 Jan 2013 20:25:55 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 725,760 1,147,864 1.5816
12 Jan 2013 11:19:49 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 699,840 1,115,368 1.5937
11 Jan 2013 20:35:54 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 673,920 1,073,909 1.5935
11 Jan 2013 03:53:28 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 648,000 1,033,634 1.5951
09 Jan 2013 12:20:03 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 622,080 991,929 1.5945
09 Jan 2013 02:13:52 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 596,160 956,633 1.6047
07 Jan 2013 16:06:49 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 570,240 912,784 1.6007
05 Jan 2013 14:10:36 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 544,320 870,825 1.5998
05 Jan 2013 02:29:23 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 518,400 826,578 1.5945
04 Jan 2013 08:35:19 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 492,480 781,862 1.5876
03 Jan 2013 18:26:50 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 466,560 739,366 1.5847
03 Jan 2013 04:25:02 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 440,640 694,070 1.5751
02 Jan 2013 14:21:17 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 414,720 646,760 1.5595
02 Jan 2013 04:09:08 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 388,800 611,649 1.5732
01 Jan 2013 18:28:17 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 362,880 576,949 1.5899
01 Jan 2013 08:41:20 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 336,960 541,890 1.6082
31 Dec 2012 20:37:27 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 311,040 503,685 1.6194
31 Dec 2012 07:18:10 1209461 15510237 hadcm3n_o6el_2140_40_008270186_1 285,120 460,292 1.6144


©2024 cpdn.org