climateprediction.net home page
Task 15503441

Task 15503441

Name hadcm3n_o3co_2140_40_008270409_0
Workunit 8425533
Created 24 Dec 2012, 2:21:05 UTC
Sent 24 Dec 2012, 2:53:16 UTC
Report deadline 25 Mar 2013, 10:20:27 UTC
Received 12 Jan 2013, 15:45:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1144715
Run time 19 days 3 hours 21 min 2 sec
CPU time 15 days 16 hours 23 min 11 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.61 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
12:15:29 (18073): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:19:10 (20095): No heartbeat from core client for 30 sec - exiting
12:19:13 (20095): No heartbeat from core client for 30 sec - exiting
12:19:14 (20095): No heartbeat from core client for 30 sec - exiting
12:19:15 (20095): No heartbeat from core client for 30 sec - exiting
12:19:16 (20095): No heartbeat from core client for 30 sec - exiting
12:19:20 (20095): No heartbeat from core client for 30 sec - exiting
12:19:21 (20095): No heartbeat from core client for 30 sec - exiting
12:19:22 (20095): No heartbeat from core client for 30 sec - exiting
12:19:23 (20095): No heartbeat from core client for 30 sec - exiting
12:19:24 (20095): No heartbeat from core client for 30 sec - exiting
12:19:26 (20095): No heartbeat from core client for 30 sec - exiting
12:19:27 (20095): No heartbeat from core client for 30 sec - exiting
12:19:28 (20095): No heartbeat from core client for 30 sec - exiting
12:19:29 (20095): No heartbeat from core client for 30 sec - exiting
12:19:30 (20095): No heartbeat from core client for 30 sec - exiting
12:19:33 (20095): No heartbeat from core client for 30 sec - exiting
12:19:34 (20095): No heartbeat from core client for 30 sec - exiting
12:19:35 (20095): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:39:36 (20104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:39:39 (20104): No heartbeat from core client for 30 sec - exiting
12:40:04 (20104): No heartbeat from core client for 30 sec - exiting
12:40:05 (20104): No heartbeat from core client for 30 sec - exiting
12:40:06 (20104): No heartbeat from core client for 30 sec - exiting
12:40:07 (20104): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
06:25:53 (20117): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:01 (20117): No heartbeat from core client for 30 sec - exiting
06:26:02 (20117): No heartbeat from core client for 30 sec - exiting
06:26:03 (20117): No heartbeat from core client for 30 sec - exiting
06:26:04 (20117): No heartbeat from core client for 30 sec - exiting
06:26:05 (20117): No heartbeat from core client for 30 sec - exiting
06:26:06 (20117): No heartbeat from core client for 30 sec - exiting
06:26:13 (20117): No heartbeat from core client for 30 sec - exiting
06:26:14 (20117): No heartbeat from core client for 30 sec - exiting
06:26:15 (20117): No heartbeat from core client for 30 sec - exiting
06:26:16 (20117): No heartbeat from core client for 30 sec - exiting
06:26:17 (20117): No heartbeat from core client for 30 sec - exiting
06:27:15 (21116): No heartbeat from core client for 30 sec - exiting
06:27:16 (21116): No heartbeat from core client for 30 sec - exiting
06:27:23 (21116): No heartbeat from core client for 30 sec - exiting
06:27:24 (21116): No heartbeat from core client for 30 sec - exiting
06:27:25 (21116): No heartbeat from core client for 30 sec - exiting
06:27:35 (21116): No heartbeat from core client for 30 sec - exiting
06:27:36 (21116): No heartbeat from core client for 30 sec - exiting
06:27:41 (21116): No heartbeat from core client for 30 sec - exiting
06:27:43 (21116): No heartbeat from core client for 30 sec - exiting
06:27:44 (21116): No heartbeat from core client for 30 sec - exiting
06:27:45 (21116): No heartbeat from core client for 30 sec - exiting
06:27:46 (21116): No heartbeat from core client for 30 sec - exiting
06:27:47 (21116): No heartbeat from core client for 30 sec - exiting
06:27:52 (21116): No heartbeat from core client for 30 sec - exiting
06:27:56 (21116): No heartbeat from core client for 30 sec - exiting
06:27:57 (21116): No heartbeat from core client for 30 sec - exiting
06:28:04 (21116): No heartbeat from core client for 30 sec - exiting
06:28:05 (21116): No heartbeat from core client for 30 sec - exiting
06:28:06 (21116): No heartbeat from core client for 30 sec - exiting
06:28:07 (21116): No heartbeat from core client for 30 sec - exiting
06:28:08 (21116): No heartbeat from core client for 30 sec - exiting
06:28:09 (21116): No heartbeat from core client for 30 sec - exiting
06:28:10 (21116): No heartbeat from core client for 30 sec - exiting
06:28:11 (21116): No heartbeat from core client for 30 sec - exiting
06:28:17 (21116): No heartbeat from core client for 30 sec - exiting
06:28:21 (21116): No heartbeat from core client for 30 sec - exiting
06:28:22 (21116): No heartbeat from core client for 30 sec - exiting
06:28:23 (21116): No heartbeat from core client for 30 sec - exiting
06:28:27 (21116): No heartbeat from core client for 30 sec - exiting
06:28:28 (21116): No heartbeat from core client for 30 sec - exiting
06:28:32 (21116): No heartbeat from core client for 30 sec - exiting
06:28:33 (21116): No heartbeat from core client for 30 sec - exiting
06:28:37 (21116): No heartbeat from core client for 30 sec - exiting
06:28:38 (21116): No heartbeat from core client for 30 sec - exiting
06:28:49 (21116): No heartbeat from core client for 30 sec - exiting
06:28:50 (21116): No heartbeat from core client for 30 sec - exiting
06:28:55 (21116): No heartbeat from core client for 30 sec - exiting
06:28:56 (21116): No heartbeat from core client for 30 sec - exiting
06:28:57 (21116): No heartbeat from core client for 30 sec - exiting
06:29:00 (21116): No heartbeat from core client for 30 sec - exiting
06:29:01 (21116): No heartbeat from core client for 30 sec - exiting
06:29:02 (21116): No heartbeat from core client for 30 sec - exiting
06:29:03 (21116): No heartbeat from core client for 30 sec - exiting
06:29:10 (21116): No heartbeat from core client for 30 sec - exiting
06:29:11 (21116): No heartbeat from core client for 30 sec - exiting
06:29:12 (21116): No heartbeat from core client for 30 sec - exiting
06:29:13 (21116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:29:16 (21116): No heartbeat from core client for 30 sec - exiting
06:29:17 (21116): No heartbeat from core client for 30 sec - exiting
06:29:18 (21116): No heartbeat from core client for 30 sec - exiting
06:29:19 (21116): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	01:57:39 AM	No files match the supplied pattern.
MainError:	01:57:39 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	04:43:02 PM	No files match the supplied pattern.
MainError:	04:43:02 PM	No files match the supplied pattern.
02:04:56 (21176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	07:00:22 AM	No files match the supplied pattern.
MainError:	07:00:22 AM	No files match the supplied pattern.
MainError:	08:54:05 PM	No files match the supplied pattern.
MainError:	08:54:05 PM	No files match the supplied pattern.
MainError:	11:35:46 AM	No files match the supplied pattern.
MainError:	11:35:46 AM	No files match the supplied pattern.
02:06:15 (24802): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	03:48:02 AM	No files match the supplied pattern.
MainError:	03:48:02 AM	No files match the supplied pattern.
MainError:	06:03:03 PM	No files match the supplied pattern.
MainError:	06:03:03 PM	No files match the supplied pattern.
MainError:	08:54:51 AM	No files match the supplied pattern.
MainError:	08:54:51 AM	No files match the supplied pattern.
MainError:	12:59:12 AM	No files match the supplied pattern.
MainError:	12:59:12 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	04:58:29 PM	No files match the supplied pattern.
MainError:	04:58:29 PM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o3coka.ph11c10
Error converting file to netcdf: dataout/o3coka.pg11c10
Error converting file to netcdf: dataout/o3coka.pe11c10
MainError:	08:52:59 AM	No files match the supplied pattern.
MainError:	08:52:59 AM	No files match the supplied pattern.

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Jan 2013 08:54:16 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 777,600 1,640,474 2.1097
11 Jan 2013 17:00:58 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 751,680 1,583,515 2.1066
11 Jan 2013 01:02:59 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 725,760 1,526,336 2.1031
10 Jan 2013 08:58:40 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 699,840 1,468,660 2.0986
09 Jan 2013 18:06:24 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 673,920 1,415,642 2.1006
09 Jan 2013 03:49:11 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 648,000 1,364,487 2.1057
08 Jan 2013 11:37:35 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 622,080 1,306,530 2.1003
07 Jan 2013 20:58:28 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 596,160 1,253,987 2.1034
07 Jan 2013 07:00:37 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 570,240 1,204,090 2.1115
06 Jan 2013 16:47:51 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 544,320 1,153,075 2.1184
06 Jan 2013 01:59:33 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 518,400 1,100,185 2.1223
05 Jan 2013 11:46:26 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 492,480 1,049,089 2.1302
04 Jan 2013 22:10:06 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 466,560 997,756 2.1385
04 Jan 2013 07:14:57 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 440,640 946,842 2.1488
03 Jan 2013 17:26:27 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 414,720 897,257 2.1635
03 Jan 2013 00:24:05 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 388,800 836,722 2.1521
02 Jan 2013 07:19:46 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 362,880 775,454 2.1369
01 Jan 2013 14:00:38 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 336,960 713,754 2.1182
31 Dec 2012 21:43:31 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 311,040 655,648 2.1079
31 Dec 2012 06:07:57 1144715 15503441 hadcm3n_o3co_2140_40_008270409_0 285,120 599,526 2.1027


©2024 cpdn.org