climateprediction.net home page
Task 15505385

Task 15505385

Name hadcm3n_o20p_2140_40_008269526_2
Workunit 8424650
Created 24 Dec 2012, 10:06:51 UTC
Sent 24 Dec 2012, 10:43:38 UTC
Report deadline 25 Mar 2013, 18:10:49 UTC
Received 24 Jan 2013, 13:45:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1244897
Run time 18 days 21 hours 48 min 37 sec
CPU time 16 days 1 hours 39 min 22 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.08 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:24:39 (3280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2308, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3708, selfPID=3708, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:20:53 (3792): No heartbeat from core client for 30 sec - exiting
21:20:54 (3792): No heartbeat from core client for 30 sec - exiting
21:20:55 (3792): No heartbeat from core client for 30 sec - exiting
21:20:56 (3792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:21:34 (1720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	07:03:47 AM	No files match the supplied pattern.
MainError:	07:03:47 AM	No files match the supplied pattern.
MainError:	10:17:49 PM	No files match the supplied pattern.
MainError:	10:17:49 PM	No files match the supplied pattern.
MainError:	01:51:56 PM	No files match the supplied pattern.
MainError:	01:51:56 PM	No files match the supplied pattern.
MainError:	05:22:59 AM	No files match the supplied pattern.
MainError:	05:22:59 AM	No files match the supplied pattern.
MainError:	08:38:43 PM	No files match the supplied pattern.
MainError:	08:38:43 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	03:46:28 AM	No files match the supplied pattern.
MainError:	03:46:28 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
MainError:	06:12:29 PM	No files match the supplied pattern.
MainError:	06:12:29 PM	No files match the supplied pattern.
MainError:	08:51:02 AM	No files match the supplied pattern.
MainError:	08:51:02 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	12:34:29 AM	No files match the supplied pattern.
MainError:	12:34:29 AM	No files match the supplied pattern.
MainError:	03:46:35 PM	No files match the supplied pattern.
MainError:	03:46:35 PM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o20pka.ph11c10
Error converting file to netcdf: dataout/o20pka.pg11c10
Error converting file to netcdf: dataout/o20pka.pe11c10
MainError:	06:59:41 AM	No files match the supplied pattern.
MainError:	06:59:41 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jan 2013 07:00:51 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 777,600 1,528,016 1.9650
23 Jan 2013 15:50:39 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 751,680 1,477,366 1.9654
23 Jan 2013 00:36:41 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 725,760 1,427,541 1.9670
22 Jan 2013 08:55:31 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 699,840 1,377,820 1.9688
21 Jan 2013 18:16:32 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 673,920 1,328,789 1.9717
21 Jan 2013 03:50:14 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 648,000 1,280,208 1.9756
13 Jan 2013 20:43:37 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 622,080 1,229,897 1.9771
13 Jan 2013 05:23:57 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 596,160 1,179,630 1.9787
12 Jan 2013 13:55:18 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 570,240 1,129,328 1.9804
11 Jan 2013 22:22:04 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 544,320 1,078,927 1.9822
11 Jan 2013 07:09:00 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 518,400 1,028,675 1.9843
10 Jan 2013 15:45:46 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 492,480 978,712 1.9873
09 Jan 2013 21:00:46 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 466,560 928,415 1.9899
08 Jan 2013 00:10:08 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 440,640 878,563 1.9938
07 Jan 2013 09:31:12 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 414,720 827,987 1.9965
06 Jan 2013 16:02:43 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 388,800 777,150 1.9988
06 Jan 2013 01:34:27 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 362,880 726,060 2.0008
05 Jan 2013 07:00:24 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 336,960 675,135 2.0036
04 Jan 2013 16:44:29 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 311,040 624,882 2.0090
04 Jan 2013 02:13:42 1244897 15505385 hadcm3n_o20p_2140_40_008269526_2 285,120 573,622 2.0119


©2024 cpdn.org