climateprediction.net home page
Task 13544214

Task 13544214

Name hadcm3n_ycli_1900_40_007519266_0
Workunit 7716741
Created 28 Oct 2011, 13:02:04 UTC
Sent 19 Nov 2011, 12:36:03 UTC
Report deadline 18 Feb 2012, 20:03:14 UTC
Received 23 Jan 2012, 3:01:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1157390
Run time 34 days 9 hours 30 min 15 sec
CPU time 34 days 8 hours 10 min 30 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 1.37 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:42:43 (13012): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
16:42:44 (13012): No heartbeat from core client for 30 sec - exiting
16:31:12 (20764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:25:29 (21793): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:56:14 (24432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:56:17 (24432): No heartbeat from core client for 30 sec - exiting
06:22:30 (25691): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:37:50 (27050): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:37:53 (27050): No heartbeat from core client for 30 sec - exiting
07:38:55 (31539): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:40:12 (31596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:43:17 (31674): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:30:27 (18254): No heartbeat from core client for 30 sec - exiting
16:30:28 (18254): No heartbeat from core client for 30 sec - exiting
16:30:29 (18254): No heartbeat from core client for 30 sec - exiting
16:30:30 (18254): No heartbeat from core client for 30 sec - exiting
16:30:31 (18254): No heartbeat from core client for 30 sec - exiting
16:30:32 (18254): No heartbeat from core client for 30 sec - exiting
16:30:33 (18254): No heartbeat from core client for 30 sec - exiting
16:30:34 (18254): No heartbeat from core client for 30 sec - exiting
16:30:35 (18254): No heartbeat from core client for 30 sec - exiting
16:30:36 (18254): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_ycli_1900_40_007519266/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Jan 2012 22:32:50 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 1,036,800 2,967,094 2.8618
22 Jan 2012 02:04:46 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 1,010,880 2,893,717 2.8626
21 Jan 2012 05:39:52 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 984,960 2,820,506 2.8636
20 Jan 2012 09:19:56 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 959,040 2,747,284 2.8646
19 Jan 2012 12:52:46 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 933,120 2,674,017 2.8657
18 Jan 2012 16:41:08 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 907,200 2,600,717 2.8668
17 Jan 2012 20:00:46 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 881,280 2,527,329 2.8678
16 Jan 2012 23:41:09 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 855,360 2,454,239 2.8692
16 Jan 2012 03:18:27 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 829,440 2,381,256 2.8709
14 Jan 2012 22:43:17 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 803,520 2,307,170 2.8713
14 Jan 2012 01:25:34 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 777,600 2,231,376 2.8696
13 Jan 2012 03:56:40 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 751,680 2,157,158 2.8698
12 Jan 2012 07:22:50 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 725,760 2,082,545 2.8695
11 Jan 2012 10:49:35 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 699,840 2,008,023 2.8693
10 Jan 2012 14:19:42 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 673,920 1,933,502 2.8690
09 Jan 2012 17:40:45 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 648,000 1,858,618 2.8682
08 Jan 2012 20:44:46 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 622,080 1,783,813 2.8675
07 Jan 2012 23:07:50 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 596,160 1,709,390 2.8673
07 Jan 2012 02:25:56 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 570,240 1,634,905 2.8670
06 Jan 2012 05:47:24 1157390 13544214 hadcm3n_ycli_1900_40_007519266_0 544,320 1,560,960 2.8677


©2024 cpdn.org