climateprediction.net home page
Task 13102084

Task 13102084

Name hadcm3n_ycje_1900_40_007349108_1
Workunit 7546538
Created 6 Jul 2011, 13:55:52 UTC
Sent 17 Jul 2011, 23:12:54 UTC
Report deadline 17 Oct 2011, 6:40:05 UTC
Received 18 Sep 2011, 14:21:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1103902
Run time 48 days 12 hours 16 min 9 sec
CPU time 43 days 23 hours 31 min 14 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 0.69 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
18:35:02 (4032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:35:03 (4032): No heartbeat from core client for 30 sec - exiting
18:35:04 (4032): No heartbeat from core client for 30 sec - exiting
18:35:06 (4032): No heartbeat from core client for 30 sec - exiting
04:24:18 (5976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:24:20 (5976): No heartbeat from core client for 30 sec - exiting
04:24:21 (5976): No heartbeat from core client for 30 sec - exiting
04:24:22 (5976): No heartbeat from core client for 30 sec - exiting
04:27:17 (3772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:28:26 (1096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:34:29 (5436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:34:30 (5436): No heartbeat from core client for 30 sec - exiting
04:40:10 (4788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:58:23 (5196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:58:24 (5196): No heartbeat from core client for 30 sec - exiting
16:58:25 (5196): No heartbeat from core client for 30 sec - exiting
16:58:27 (5196): No heartbeat from core client for 30 sec - exiting
16:58:28 (5196): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
04:22:50 (3540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:22:52 (3540): No heartbeat from core client for 30 sec - exiting
04:27:33 (2424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:27:35 (2424): No heartbeat from core client for 30 sec - exiting
04:27:36 (2424): No heartbeat from core client for 30 sec - exiting
04:27:37 (2424): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=1
Model crash detected, will try to restart...
21:26:18 (2856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2616, iMonCtr=1
Model crash detected, will try to restart...
03:59:51 (4068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:59:52 (4068): No heartbeat from core client for 30 sec - exiting
03:59:53 (4068): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:22:01 (2512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:22:02 (2512): No heartbeat from core client for 30 sec - exiting
05:28:28 (3864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:54:08 (5028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:54:09 (5028): No heartbeat from core client for 30 sec - exiting
17:54:10 (5028): No heartbeat from core client for 30 sec - exiting
17:54:11 (5028): No heartbeat from core client for 30 sec - exiting
17:54:12 (5028): No heartbeat from core client for 30 sec - exiting
17:54:13 (5028): No heartbeat from core client for 30 sec - exiting
17:54:14 (5028): No heartbeat from core client for 30 sec - exiting
17:54:15 (5028): No heartbeat from core client for 30 sec - exiting
17:54:16 (5028): No heartbeat from core client for 30 sec - exiting
17:54:17 (5028): No heartbeat from core client for 30 sec - exiting

Model crashed: TEMPHIST: Failed in OPEN of history file                                                                                                                                                                                                                        tmp/pipe_dummy                                                                  2048    
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:08:06 (4472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:08:07 (4472): No heartbeat from core client for 30 sec - exiting
20:08:08 (4472): No heartbeat from core client for 30 sec - exiting
20:08:09 (4472): No heartbeat from core client for 30 sec - exiting
20:08:10 (4472): No heartbeat from core client for 30 sec - exiting
20:08:11 (4472): No heartbeat from core client for 30 sec - exiting
05:03:42 (4172): No heartbeat from core client for 30 sec - exiting
05:03:43 (4172): No heartbeat from core client for 30 sec - exiting
05:03:44 (4172): No heartbeat from core client for 30 sec - exiting
05:03:46 (4172): No heartbeat from core client for 30 sec - exiting
05:03:47 (4172): No heartbeat from core client for 30 sec - exiting
05:03:48 (4172): No heartbeat from core client for 30 sec - exiting
05:03:49 (4172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Suspended CPDN Monitor - Suspend request from BOINC...
18:39:08 (2056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:56:08 (5532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:56:09 (5532): No heartbeat from core client for 30 sec - exiting
17:56:10 (5532): No heartbeat from core client for 30 sec - exiting
17:56:11 (5532): No heartbeat from core client for 30 sec - exiting
17:56:12 (5532): No heartbeat from core client for 30 sec - exiting
12:39:21 (5340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:39:23 (5340): No heartbeat from core client for 30 sec - exiting
12:39:24 (5340): No heartbeat from core client for 30 sec - exiting
12:39:25 (5340): No heartbeat from core client for 30 sec - exiting
12:39:26 (5340): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
14:15:25 (756): No heartbeat from core client for 30 sec - exiting
14:15:26 (756): No heartbeat from core client for 30 sec - exiting
14:15:27 (756): No heartbeat from core client for 30 sec - exiting
14:15:28 (756): No heartbeat from core client for 30 sec - exiting
14:15:29 (756): No heartbeat from core client for 30 sec - exiting
14:15:30 (756): No heartbeat from core client for 30 sec - exiting
14:15:31 (756): No heartbeat from core client for 30 sec - exiting
14:15:32 (756): No heartbeat from core client for 30 sec - exiting
14:15:33 (756): No heartbeat from core client for 30 sec - exiting
14:15:34 (756): No heartbeat from core client for 30 sec - exiting
14:15:35 (756): No heartbeat from core client for 30 sec - exiting
14:15:36 (756): No heartbeat from core client for 30 sec - exiting
14:15:37 (756): No heartbeat from core client for 30 sec - exiting
14:15:38 (756): No heartbeat from core client for 30 sec - exiting
14:15:39 (756): No heartbeat from core client for 30 sec - exiting
14:15:40 (756): No heartbeat from core client for 30 sec - exiting
14:15:41 (756): No heartbeat from core client for 30 sec - exiting
14:15:42 (756): No heartbeat from core client for 30 sec - exiting
14:15:43 (756): No heartbeat from core client for 30 sec - exiting
14:15:44 (756): No heartbeat from core client for 30 sec - exiting
14:15:45 (756): No heartbeat from core client for 30 sec - exiting
14:15:46 (756): No heartbeat from core client for 30 sec - exiting
14:15:47 (756): No heartbeat from core client for 30 sec - exiting
14:15:48 (756): No heartbeat from core client for 30 sec - exiting
14:15:49 (756): No heartbeat from core client for 30 sec - exiting
14:15:50 (756): No heartbeat from core client for 30 sec - exiting
14:15:51 (756): No heartbeat from core client for 30 sec - exiting
14:16:28 (756): No heartbeat from core client for 30 sec - exiting
14:16:29 (756): No heartbeat from core client for 30 sec - exiting
14:16:30 (756): No heartbeat from core client for 30 sec - exiting
14:16:31 (756): No heartbeat from core client for 30 sec - exiting
14:16:32 (756): No heartbeat from core client for 30 sec - exiting
14:16:33 (756): No heartbeat from core client for 30 sec - exiting
14:16:34 (756): No heartbeat from core client for 30 sec - exiting
14:16:35 (756): No heartbeat from core client for 30 sec - exiting
14:16:36 (756): No heartbeat from core client for 30 sec - exiting
14:16:37 (756): No heartbeat from core client for 30 sec - exiting
14:16:38 (756): No heartbeat from core client for 30 sec - exiting
14:16:40 (756): No heartbeat from core client for 30 sec - exiting
14:16:41 (756): No heartbeat from core client for 30 sec - exiting
14:16:42 (756): No heartbeat from core client for 30 sec - exiting
14:16:43 (756): No heartbeat from core client for 30 sec - exiting
14:16:44 (756): No heartbeat from core client for 30 sec - exiting
14:16:45 (756): No heartbeat from core client for 30 sec - exiting
14:16:46 (756): No heartbeat from core client for 30 sec - exiting
14:16:48 (756): No heartbeat from core client for 30 sec - exiting
14:16:49 (756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycje_1900_40_007349108/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Sep 2011 13:21:22 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 777,600 3,799,731 4.8865
16 Sep 2011 11:12:10 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 751,680 3,638,086 4.8399
14 Sep 2011 17:13:56 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 725,760 3,501,383 4.8244
13 Sep 2011 06:27:34 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 699,840 3,390,393 4.8445
11 Sep 2011 21:25:52 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 673,920 3,280,918 4.8684
10 Sep 2011 08:10:18 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 648,000 3,163,551 4.8820
08 Sep 2011 23:42:52 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 622,080 3,058,377 4.9164
07 Sep 2011 17:44:05 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 596,160 2,961,044 4.9669
06 Sep 2011 11:15:00 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 570,240 2,862,588 5.0200
05 Sep 2011 00:48:52 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 544,320 2,750,899 5.0538
03 Sep 2011 04:08:13 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 518,400 2,606,822 5.0286
01 Sep 2011 19:12:37 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 492,480 2,499,712 5.0758
31 Aug 2011 15:14:59 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 466,560 2,405,993 5.1569
30 Aug 2011 10:09:29 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 440,640 2,315,720 5.2554
27 Aug 2011 17:05:02 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 414,720 2,251,400 5.4287
26 Aug 2011 12:30:37 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 388,800 2,199,126 5.6562
22 Aug 2011 07:48:40 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 362,880 2,064,859 5.6902
20 Aug 2011 15:28:34 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 336,960 1,929,871 5.7273
19 Aug 2011 04:47:00 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 311,040 1,812,127 5.8260
17 Aug 2011 18:40:31 1103902 13102084 hadcm3n_ycje_1900_40_007349108_1 285,120 1,698,700 5.9578


©2024 climateprediction.net