Name | hadcm3n_7k4e_1980_40_008437361_0 |
Workunit | 8588217 |
Created | 30 Aug 2013, 7:22:13 UTC |
Sent | 30 Aug 2013, 7:35:41 UTC |
Report deadline | 29 Nov 2013, 15:02:52 UTC |
Received | 23 Sep 2013, 11:36:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1287172 |
Run time | 7 days 19 hours 0 min 5 sec |
CPU time | 7 days 11 hours 14 min 34 sec |
Validate state | Invalid |
Credit | 8,398.08 |
Device peak FLOPS | 2.41 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognise the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 18:21:20 (4312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:08:03 (3316): No heartbeat from core client for 30 sec - exiting 21:08:04 (3316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:08:05 (3316): No heartbeat from core client for 30 sec - exiting 21:08:47 (1968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:09:17 (320): No heartbeat from core client for 30 sec - exiting 23:09:18 (320): No heartbeat from core client for 30 sec - exiting 23:09:19 (320): No heartbeat from core client for 30 sec - exiting 23:09:20 (320): No heartbeat from core client for 30 sec - exiting 23:09:21 (320): No heartbeat from core client for 30 sec - exiting 23:09:23 (320): No heartbeat from core client for 30 sec - exiting 23:09:24 (320): No heartbeat from core client for 30 sec - exiting 23:09:25 (320): No heartbeat from core client for 30 sec - exiting 23:09:26 (320): No heartbeat from core client for 30 sec - exiting 23:09:27 (320): No heartbeat from core client for 30 sec - exiting 23:09:28 (320): No heartbeat from core client for 30 sec - exiting 23:09:29 (320): No heartbeat from core client for 30 sec - exiting 23:09:30 (320): No heartbeat from core client for 30 sec - exiting 23:09:31 (320): No heartbeat from core client for 30 sec - exiting 23:09:32 (320): No heartbeat from core client for 30 sec - exiting 23:09:33 (320): No heartbeat from core client for 30 sec - exiting 23:09:35 (320): No heartbeat from core client for 30 sec - exiting 23:09:36 (320): No heartbeat from core client for 30 sec - exiting 23:09:37 (320): No heartbeat from core client for 30 sec - exiting 23:09:38 (320): No heartbeat from core client for 30 sec - exiting 23:09:39 (320): No heartbeat from core client for 30 sec - exiting 23:09:40 (320): No heartbeat from core client for 30 sec - exiting 23:09:41 (320): No heartbeat from core client for 30 sec - exiting 23:09:42 (320): No heartbeat from core client for 30 sec - exiting 23:09:43 (320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:09:50 (2984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:54:44 (1880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:34:06 (3520): No heartbeat from core client for 30 sec - exiting 07:34:08 (3520): No heartbeat from core client for 30 sec - exiting 07:34:10 (3520): No heartbeat from core client for 30 sec - exiting 07:34:11 (3520): No heartbeat from core client for 30 sec - exiting 07:34:12 (3520): No heartbeat from core client for 30 sec - exiting 07:34:13 (3520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:34:14 (3520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:52:57 (4328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:16:59 (3080): No heartbeat from core client for 30 sec - exiting 16:17:00 (3080): No heartbeat from core client for 30 sec - exiting 16:17:01 (3080): No heartbeat from core client for 30 sec - exiting 16:17:02 (3080): No heartbeat from core client for 30 sec - exiting 16:17:03 (3080): No heartbeat from core client for 30 sec - exiting 16:17:04 (3080): No heartbeat from core client for 30 sec - exiting 16:17:05 (3080): No heartbeat from core client for 30 sec - exiting 16:17:06 (3080): No heartbeat from core client for 30 sec - exiting 16:17:08 (3080): No heartbeat from core client for 30 sec - exiting 16:17:09 (3080): No heartbeat from core client for 30 sec - exiting 16:17:10 (3080): No heartbeat from core client for 30 sec - exiting 16:17:11 (3080): No heartbeat from core client for 30 sec - exiting 16:17:12 (3080): No heartbeat from core client for 30 sec - exiting 16:17:13 (3080): No heartbeat from core client for 30 sec - exiting 16:17:14 (3080): No heartbeat from core client for 30 sec - exiting 16:17:15 (3080): No heartbeat from core client for 30 sec - exiting 16:17:16 (3080): No heartbeat from core client for 30 sec - exiting 16:17:17 (3080): No heartbeat from core client for 30 sec - exiting 16:17:18 (3080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:55:12 (4128): No heartbeat from core client for 30 sec - exiting 15:55:13 (4128): No heartbeat from core client for 30 sec - exiting 15:55:14 (4128): No heartbeat from core client for 30 sec - exiting 15:55:15 (4128): No heartbeat from core client for 30 sec - exiting 15:55:16 (4128): No heartbeat from core client for 30 sec - exiting 15:55:17 (4128): No heartbeat from core client for 30 sec - exiting 15:55:18 (4128): No heartbeat from core client for 30 sec - exiting 15:55:19 (4128): No heartbeat from core client for 30 sec - exiting 15:55:21 (4128): No heartbeat from core client for 30 sec - exiting 15:55:22 (4128): No heartbeat from core client for 30 sec - exiting 15:55:23 (4128): No heartbeat from core client for 30 sec - exiting 15:55:24 (4128): No heartbeat from core client for 30 sec - exiting 15:55:25 (4128): No heartbeat from core client for 30 sec - exiting 15:55:26 (4128): No heartbeat from core client for 30 sec - exiting 15:55:27 (4128): No heartbeat from core client for 30 sec - exiting 15:55:28 (4128): No heartbeat from core client for 30 sec - exiting 15:55:29 (4128): No heartbeat from core client for 30 sec - exiting 15:55:30 (4128): No heartbeat from core client for 30 sec - exiting 15:55:32 (4128): No heartbeat from core client for 30 sec - exiting 15:55:33 (4128): No heartbeat from core client for 30 sec - exiting 15:55:34 (4128): No heartbeat from core client for 30 sec - exiting 15:55:35 (4128): No heartbeat from core client for 30 sec - exiting 15:55:36 (4128): No heartbeat from core client for 30 sec - exiting 15:55:37 (4128): No heartbeat from core client for 30 sec - exiting 15:55:38 (4128): No heartbeat from core client for 30 sec - exiting 15:55:39 (4128): No heartbeat from core client for 30 sec - exiting 15:55:40 (4128): No heartbeat from core client for 30 sec - exiting 15:55:41 (4128): No heartbeat from core client for 30 sec - exiting 15:55:42 (4128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:41:51 (792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/7k4eko.pji7c10 Error converting file to netcdf: dataout/7k4eko.pii7c10 Error converting file to netcdf: dataout/7k4eko.pfi7c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7k4e_1980_40_008437361/dataout/ocean_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Sep 2013 11:37:32 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 699,840 | 642,069 | 0.9175 |
21 Sep 2013 08:50:33 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 673,920 | 624,337 | 0.9264 |
21 Sep 2013 03:44:00 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 648,000 | 606,694 | 0.9363 |
20 Sep 2013 22:42:29 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 622,080 | 589,250 | 0.9472 |
20 Sep 2013 17:45:38 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 596,160 | 571,978 | 0.9594 |
20 Sep 2013 10:19:42 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 570,240 | 554,653 | 0.9727 |
19 Sep 2013 04:26:17 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 544,320 | 537,101 | 0.9867 |
18 Sep 2013 22:59:39 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 518,400 | 518,909 | 1.0010 |
18 Sep 2013 17:47:25 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 492,480 | 500,940 | 1.0172 |
17 Sep 2013 12:11:59 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 466,560 | 483,411 | 1.0361 |
17 Sep 2013 06:37:25 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 440,640 | 465,946 | 1.0574 |
15 Sep 2013 21:17:38 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 414,720 | 447,735 | 1.0796 |
15 Sep 2013 14:30:51 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 388,800 | 429,346 | 1.1043 |
15 Sep 2013 09:34:05 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 362,880 | 411,857 | 1.1350 |
14 Sep 2013 02:45:35 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 336,960 | 394,193 | 1.1699 |
13 Sep 2013 21:44:01 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 311,040 | 376,621 | 1.2108 |
13 Sep 2013 16:37:49 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 285,120 | 358,872 | 1.2587 |
12 Sep 2013 16:22:01 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 259,200 | 341,189 | 1.3163 |
12 Sep 2013 11:20:12 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 233,280 | 323,689 | 1.3876 |
11 Sep 2013 16:58:17 | 1287172 | 15970499 | hadcm3n_7k4e_1980_40_008437361_0 | 207,360 | 305,856 | 1.4750 |
©2024 cpdn.org