climateprediction.net home page
Task 15555571

Task 15555571

Name hadcm3n_zjpl_1880_40_008253425_3
Workunit 8408549
Created 23 Jan 2013, 10:57:43 UTC
Sent 23 Jan 2013, 10:57:49 UTC
Report deadline 24 Apr 2013, 18:25:00 UTC
Received 11 Feb 2013, 16:44:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1103902
Run time 19 days 4 hours 25 min 47 sec
CPU time 17 days 4 hours 34 min 15 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 1.61 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
13:49:40 (4636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:49:42 (4636): No heartbeat from core client for 30 sec - exiting
18:49:10 (3448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:49:11 (3448): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
04:35:51 (5616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:35:52 (5616): No heartbeat from core client for 30 sec - exiting
16:26:47 (4752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:26:48 (4752): No heartbeat from core client for 30 sec - exiting
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zjplko.pj86c10
Error converting file to netcdf: dataout/zjplko.pi86c10
Error converting file to netcdf: dataout/zjplko.pf86c10
Error converting file to netcdf: dataout/zjplka.ph86c10
Error converting file to netcdf: dataout/zjplka.pg86c10
Error converting file to netcdf: dataout/zjplka.pe86c10
Error converting file to netcdf: dataout/zjplka.pd86c10
06:34:22 (4780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:34:24 (4780): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
18:02:28 (3992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:02:29 (3992): No heartbeat from core client for 30 sec - exiting
18:02:30 (3992): No heartbeat from core client for 30 sec - exiting
18:02:31 (3992): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
18:06:59 (6036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:07:00 (6036): No heartbeat from core client for 30 sec - exiting
18:07:01 (6036): No heartbeat from core client for 30 sec - exiting
18:07:02 (6036): No heartbeat from core client for 30 sec - exiting
18:07:03 (6036): No heartbeat from core client for 30 sec - exiting
18:07:04 (6036): No heartbeat from core client for 30 sec - exiting
18:07:05 (6036): No heartbeat from core client for 30 sec - exiting
18:07:06 (6036): No heartbeat from core client for 30 sec - exiting
18:07:07 (6036): No heartbeat from core client for 30 sec - exiting
18:07:08 (6036): No heartbeat from core client for 30 sec - exiting
18:07:09 (6036): No heartbeat from core client for 30 sec - exiting
18:07:10 (6036): No heartbeat from core client for 30 sec - exiting
18:07:11 (6036): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
05:07:10 (4644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:07:11 (4644): No heartbeat from core client for 30 sec - exiting
05:07:12 (4644): No heartbeat from core client for 30 sec - exiting
06:22:09 (5560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:22:56 (5952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:22:57 (5952): No heartbeat from core client for 30 sec - exiting
04:22:58 (5952): No heartbeat from core client for 30 sec - exiting
16:55:17 (3816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:55:18 (3816): No heartbeat from core client for 30 sec - exiting
16:55:19 (3816): No heartbeat from core client for 30 sec - exiting

Model crashed: TEMPHIST: Failed in OPEN of history file                                                                                                                                                                                                                        tmp/pipe_dummy                                                                  2048    
22:48:57 (2904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:48:59 (2904): No heartbeat from core client for 30 sec - exiting
22:49:00 (2904): No heartbeat from core client for 30 sec - exiting
14:04:04 (5076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:04:05 (5076): No heartbeat from core client for 30 sec - exiting
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Feb 2013 14:10:28 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 492,480 1,485,208 3.0158
10 Feb 2013 10:32:33 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 466,560 1,404,528 3.0104
09 Feb 2013 06:26:15 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 440,640 1,315,392 2.9852
08 Feb 2013 03:55:13 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 414,720 1,232,322 2.9715
07 Feb 2013 02:10:44 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 388,800 1,150,539 2.9592
06 Feb 2013 00:46:21 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 362,880 1,069,035 2.9460
04 Feb 2013 23:14:45 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 336,960 985,941 2.9260
04 Feb 2013 06:02:10 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 311,040 924,398 2.9720
03 Feb 2013 07:32:21 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 285,120 852,014 2.9883
02 Feb 2013 05:44:56 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 259,200 768,741 2.9658
01 Feb 2013 03:00:36 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 233,280 683,071 2.9281
31 Jan 2013 00:14:10 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 207,360 597,449 2.8812
29 Jan 2013 20:21:25 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 181,440 508,171 2.8008
28 Jan 2013 16:30:44 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 155,520 420,776 2.7056
27 Jan 2013 13:52:05 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 129,600 336,695 2.5980
26 Jan 2013 12:00:23 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 103,680 253,183 2.4420
25 Jan 2013 15:30:38 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 77,760 183,847 2.3643
24 Jan 2013 21:57:56 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 51,840 122,471 2.3625
24 Jan 2013 04:30:21 1103902 15555571 hadcm3n_zjpl_1880_40_008253425_3 25,920 61,244 2.3628


©2024 climateprediction.net