climateprediction.net home page
Task 15540708

Task 15540708

Name hadcm3n_o7ho_2140_40_008269776_3
Workunit 8424900
Created 14 Jan 2013, 0:10:52 UTC
Sent 14 Jan 2013, 0:10:57 UTC
Report deadline 15 Apr 2013, 7:38:08 UTC
Received 24 Jan 2013, 17:11:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1229687
Run time 10 days 13 hours 15 min 6 sec
CPU time 10 days 11 hours 52 min 52 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.72 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
19:22:56 (4000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:21:48 (2704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:20:43 (6900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:19:42 (6060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:18:42 (5408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:17:39 (8972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:16:36 (9308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:15:26 (10748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:14:25 (12084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:13:13 (9260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:12:14 (13204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:11:10 (15764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:10:02 (15968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:08:58 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:07:52 (6000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:06:35 (14496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:05:22 (14356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:04:11 (3564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:03:02 (13400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:03:03 (13400): No heartbeat from core client for 30 sec - exiting
13:01:51 (13144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:00:48 (14008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:00:49 (14008): No heartbeat from core client for 30 sec - exiting
20:59:27 (2264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:58:13 (9512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:56:58 (9908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:56:59 (9908): No heartbeat from core client for 30 sec - exiting
10:55:55 (5760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:54:49 (8124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:53:32 (5552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:52:17 (10480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:51:01 (11548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:49:50 (7644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	01:38:54 AM	No files match the supplied pattern.
MainError:	01:38:54 AM	No files match the supplied pattern.
19:48:44 (14620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:47:39 (5604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	10:22:31 AM	No files match the supplied pattern.
MainError:	10:22:31 AM	No files match the supplied pattern.
07:46:34 (6468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	07:04:50 PM	No files match the supplied pattern.
MainError:	07:04:50 PM	No files match the supplied pattern.
12:45:32 (17348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:25 (12060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	03:20:12 AM	No files match the supplied pattern.
MainError:	03:20:12 AM	No files match the supplied pattern.
22:43:13 (6264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:43:14 (6264): No heartbeat from core client for 30 sec - exiting
MainError:	11:40:03 AM	No files match the supplied pattern.
MainError:	11:40:03 AM	No files match the supplied pattern.
03:41:57 (7520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:41:59 (7520): No heartbeat from core client for 30 sec - exiting
07:40:43 (12380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	07:53:07 PM	No files match the supplied pattern.
MainError:	07:53:07 PM	No files match the supplied pattern.
12:39:33 (9728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:38:24 (16452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	04:12:31 AM	No files match the supplied pattern.
MainError:	04:12:31 AM	No files match the supplied pattern.
21:37:18 (7912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:36:18 (15348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	12:35:02 AM	No files match the supplied pattern.
MainError:	12:35:02 AM	No files match the supplied pattern.
06:35:09 (14672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:34:04 (11728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	08:49:49 PM	No files match the supplied pattern.
MainError:	08:49:49 PM	No files match the supplied pattern.
17:32:55 (16856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	05:10:22 AM	No files match the supplied pattern.
MainError:	05:10:22 AM	No files match the supplied pattern.
23:31:52 (10848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Error converting file to netcdf: dataout/o7hoka.ph11c10
Error converting file to netcdf: dataout/o7hoka.pg11c10
Error converting file to netcdf: dataout/o7hoka.pe11c10
MainError:	01:22:43 PM	No files match the supplied pattern.
MainError:	01:22:43 PM	No files match the supplied pattern.
05:30:40 (12364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jan 2013 13:25:16 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 777,600 906,276 1.1655
24 Jan 2013 05:10:29 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 751,680 876,881 1.1666
23 Jan 2013 20:52:23 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 725,760 846,975 1.1670
23 Jan 2013 12:35:06 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 699,840 817,493 1.1681
23 Jan 2013 04:12:24 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 673,920 787,567 1.1686
22 Jan 2013 19:53:41 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 648,000 757,820 1.1695
22 Jan 2013 11:46:02 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 622,080 728,511 1.1711
22 Jan 2013 03:24:30 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 596,160 698,656 1.1719
21 Jan 2013 19:06:52 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 570,240 669,094 1.1734
21 Jan 2013 10:26:38 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 544,320 637,855 1.1718
21 Jan 2013 01:39:49 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 518,400 606,599 1.1701
20 Jan 2013 16:56:12 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 492,480 575,384 1.1683
20 Jan 2013 08:09:32 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 466,560 543,854 1.1657
19 Jan 2013 23:07:39 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 440,640 511,532 1.1609
19 Jan 2013 14:25:28 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 414,720 480,424 1.1584
19 Jan 2013 06:04:37 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 388,800 450,578 1.1589
18 Jan 2013 21:27:18 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 362,880 419,495 1.1560
18 Jan 2013 13:10:17 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 336,960 389,964 1.1573
18 Jan 2013 04:48:58 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 311,040 360,195 1.1580
17 Jan 2013 20:31:28 1229687 15540708 hadcm3n_o7ho_2140_40_008269776_3 285,120 330,511 1.1592


©2024 climateprediction.net