climateprediction.net home page
Task 15970499

Task 15970499

Name hadcm3n_7k4e_1980_40_008437361_0
Workunit 8588217
Created 30 Aug 2013, 7:22:13 UTC
Sent 30 Aug 2013, 7:35:41 UTC
Report deadline 29 Nov 2013, 15:02:52 UTC
Received 23 Sep 2013, 11:36:38 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1287172
Run time 7 days 19 hours 0 min 5 sec
CPU time 7 days 11 hours 14 min 34 sec
Validate state Invalid
Credit 8,398.08
Device peak FLOPS 2.41 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognise the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
18:21:20 (4312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:08:03 (3316): No heartbeat from core client for 30 sec - exiting
21:08:04 (3316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:08:05 (3316): No heartbeat from core client for 30 sec - exiting
21:08:47 (1968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  2048    
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:09:17 (320): No heartbeat from core client for 30 sec - exiting
23:09:18 (320): No heartbeat from core client for 30 sec - exiting
23:09:19 (320): No heartbeat from core client for 30 sec - exiting
23:09:20 (320): No heartbeat from core client for 30 sec - exiting
23:09:21 (320): No heartbeat from core client for 30 sec - exiting
23:09:23 (320): No heartbeat from core client for 30 sec - exiting
23:09:24 (320): No heartbeat from core client for 30 sec - exiting
23:09:25 (320): No heartbeat from core client for 30 sec - exiting
23:09:26 (320): No heartbeat from core client for 30 sec - exiting
23:09:27 (320): No heartbeat from core client for 30 sec - exiting
23:09:28 (320): No heartbeat from core client for 30 sec - exiting
23:09:29 (320): No heartbeat from core client for 30 sec - exiting
23:09:30 (320): No heartbeat from core client for 30 sec - exiting
23:09:31 (320): No heartbeat from core client for 30 sec - exiting
23:09:32 (320): No heartbeat from core client for 30 sec - exiting
23:09:33 (320): No heartbeat from core client for 30 sec - exiting
23:09:35 (320): No heartbeat from core client for 30 sec - exiting
23:09:36 (320): No heartbeat from core client for 30 sec - exiting
23:09:37 (320): No heartbeat from core client for 30 sec - exiting
23:09:38 (320): No heartbeat from core client for 30 sec - exiting
23:09:39 (320): No heartbeat from core client for 30 sec - exiting
23:09:40 (320): No heartbeat from core client for 30 sec - exiting
23:09:41 (320): No heartbeat from core client for 30 sec - exiting
23:09:42 (320): No heartbeat from core client for 30 sec - exiting
23:09:43 (320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:09:50 (2984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:54:44 (1880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
07:34:06 (3520): No heartbeat from core client for 30 sec - exiting
07:34:08 (3520): No heartbeat from core client for 30 sec - exiting
07:34:10 (3520): No heartbeat from core client for 30 sec - exiting
07:34:11 (3520): No heartbeat from core client for 30 sec - exiting
07:34:12 (3520): No heartbeat from core client for 30 sec - exiting
07:34:13 (3520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:34:14 (3520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:52:57 (4328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:16:59 (3080): No heartbeat from core client for 30 sec - exiting
16:17:00 (3080): No heartbeat from core client for 30 sec - exiting
16:17:01 (3080): No heartbeat from core client for 30 sec - exiting
16:17:02 (3080): No heartbeat from core client for 30 sec - exiting
16:17:03 (3080): No heartbeat from core client for 30 sec - exiting
16:17:04 (3080): No heartbeat from core client for 30 sec - exiting
16:17:05 (3080): No heartbeat from core client for 30 sec - exiting
16:17:06 (3080): No heartbeat from core client for 30 sec - exiting
16:17:08 (3080): No heartbeat from core client for 30 sec - exiting
16:17:09 (3080): No heartbeat from core client for 30 sec - exiting
16:17:10 (3080): No heartbeat from core client for 30 sec - exiting
16:17:11 (3080): No heartbeat from core client for 30 sec - exiting
16:17:12 (3080): No heartbeat from core client for 30 sec - exiting
16:17:13 (3080): No heartbeat from core client for 30 sec - exiting
16:17:14 (3080): No heartbeat from core client for 30 sec - exiting
16:17:15 (3080): No heartbeat from core client for 30 sec - exiting
16:17:16 (3080): No heartbeat from core client for 30 sec - exiting
16:17:17 (3080): No heartbeat from core client for 30 sec - exiting
16:17:18 (3080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:55:12 (4128): No heartbeat from core client for 30 sec - exiting
15:55:13 (4128): No heartbeat from core client for 30 sec - exiting
15:55:14 (4128): No heartbeat from core client for 30 sec - exiting
15:55:15 (4128): No heartbeat from core client for 30 sec - exiting
15:55:16 (4128): No heartbeat from core client for 30 sec - exiting
15:55:17 (4128): No heartbeat from core client for 30 sec - exiting
15:55:18 (4128): No heartbeat from core client for 30 sec - exiting
15:55:19 (4128): No heartbeat from core client for 30 sec - exiting
15:55:21 (4128): No heartbeat from core client for 30 sec - exiting
15:55:22 (4128): No heartbeat from core client for 30 sec - exiting
15:55:23 (4128): No heartbeat from core client for 30 sec - exiting
15:55:24 (4128): No heartbeat from core client for 30 sec - exiting
15:55:25 (4128): No heartbeat from core client for 30 sec - exiting
15:55:26 (4128): No heartbeat from core client for 30 sec - exiting
15:55:27 (4128): No heartbeat from core client for 30 sec - exiting
15:55:28 (4128): No heartbeat from core client for 30 sec - exiting
15:55:29 (4128): No heartbeat from core client for 30 sec - exiting
15:55:30 (4128): No heartbeat from core client for 30 sec - exiting
15:55:32 (4128): No heartbeat from core client for 30 sec - exiting
15:55:33 (4128): No heartbeat from core client for 30 sec - exiting
15:55:34 (4128): No heartbeat from core client for 30 sec - exiting
15:55:35 (4128): No heartbeat from core client for 30 sec - exiting
15:55:36 (4128): No heartbeat from core client for 30 sec - exiting
15:55:37 (4128): No heartbeat from core client for 30 sec - exiting
15:55:38 (4128): No heartbeat from core client for 30 sec - exiting
15:55:39 (4128): No heartbeat from core client for 30 sec - exiting
15:55:40 (4128): No heartbeat from core client for 30 sec - exiting
15:55:41 (4128): No heartbeat from core client for 30 sec - exiting
15:55:42 (4128): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:41:51 (792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/7k4eko.pji7c10
Error converting file to netcdf: dataout/7k4eko.pii7c10
Error converting file to netcdf: dataout/7k4eko.pfi7c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Sep 2013 11:37:32 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 699,840 642,069 0.9175
21 Sep 2013 08:50:33 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 673,920 624,337 0.9264
21 Sep 2013 03:44:00 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 648,000 606,694 0.9363
20 Sep 2013 22:42:29 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 622,080 589,250 0.9472
20 Sep 2013 17:45:38 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 596,160 571,978 0.9594
20 Sep 2013 10:19:42 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 570,240 554,653 0.9727
19 Sep 2013 04:26:17 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 544,320 537,101 0.9867
18 Sep 2013 22:59:39 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 518,400 518,909 1.0010
18 Sep 2013 17:47:25 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 492,480 500,940 1.0172
17 Sep 2013 12:11:59 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 466,560 483,411 1.0361
17 Sep 2013 06:37:25 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 440,640 465,946 1.0574
15 Sep 2013 21:17:38 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 414,720 447,735 1.0796
15 Sep 2013 14:30:51 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 388,800 429,346 1.1043
15 Sep 2013 09:34:05 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 362,880 411,857 1.1350
14 Sep 2013 02:45:35 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 336,960 394,193 1.1699
13 Sep 2013 21:44:01 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 311,040 376,621 1.2108
13 Sep 2013 16:37:49 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 285,120 358,872 1.2587
12 Sep 2013 16:22:01 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 259,200 341,189 1.3163
12 Sep 2013 11:20:12 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 233,280 323,689 1.3876
11 Sep 2013 16:58:17 1287172 15970499 hadcm3n_7k4e_1980_40_008437361_0 207,360 305,856 1.4750


©2024 cpdn.org