Name | hadcm3n_p7op_1900_40_007227545_1 |
Workunit | 7425785 |
Created | 26 Apr 2011, 15:40:27 UTC |
Sent | 26 Apr 2011, 16:14:38 UTC |
Report deadline | 26 Jul 2011, 23:41:49 UTC |
Received | 5 Jun 2011, 12:44:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 859190 |
Run time | 25 days 17 hours 50 min 59 sec |
CPU time | 22 days 14 hours 51 min 9 sec |
Validate state | Invalid |
Credit | 7,776.00 |
Device peak FLOPS | 1.51 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish 10:18:29 (2026): No heartbeat from core client for 30 sec - exiting 10:18:30 (2026): No heartbeat from core client for 30 sec - exiting 10:18:31 (2026): No heartbeat from core client for 30 sec - exiting 10:18:32 (2026): No heartbeat from core client for 30 sec - exiting 10:18:33 (2026): No heartbeat from core client for 30 sec - exiting 10:18:34 (2026): No heartbeat from core client for 30 sec - exiting 10:18:35 (2026): No heartbeat from core client for 30 sec - exiting 10:18:36 (2026): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish 10:15:43 (1977): No heartbeat from core client for 30 sec - exiting 10:15:44 (1977): No heartbeat from core client for 30 sec - exiting 10:15:46 (1977): No heartbeat from core client for 30 sec - exiting 10:15:47 (1977): No heartbeat from core client for 30 sec - exiting 10:15:48 (1977): No heartbeat from core client for 30 sec - exiting 10:15:49 (1977): No heartbeat from core client for 30 sec - exiting 10:15:50 (1977): No heartbeat from core client for 30 sec - exiting 10:15:51 (1977): No heartbeat from core client for 30 sec - exiting 10:15:52 (1977): No heartbeat from core client for 30 sec - exiting 10:15:53 (1977): No heartbeat from core client for 30 sec - exiting 10:15:54 (1977): No heartbeat from core client for 30 sec - exiting 10:15:55 (1977): No heartbeat from core client for 30 sec - exiting 10:15:56 (1977): No heartbeat from core client for 30 sec - exiting 10:15:57 (1977): No heartbeat from core client for 30 sec - exiting 10:15:58 (1977): No heartbeat from core client for 30 sec - exiting 10:15:59 (1977): No heartbeat from core client for 30 sec - exiting 10:16:03 (1977): No heartbeat from core client for 30 sec - exiting 10:16:04 (1977): No heartbeat from core client for 30 sec - exiting 10:16:05 (1977): No heartbeat from core client for 30 sec - exiting 10:16:06 (1977): No heartbeat from core client for 30 sec - exiting 10:16:07 (1977): No heartbeat from core client for 30 sec - exiting 10:16:08 (1977): No heartbeat from core client for 30 sec - exiting 10:16:09 (1977): No heartbeat from core client for 30 sec - exiting 10:16:10 (1977): No heartbeat from core client for 30 sec - exiting 10:16:11 (1977): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:21 (1894): No heartbeat from core client for 30 sec - exiting 10:40:22 (1894): No heartbeat from core client for 30 sec - exiting 10:40:23 (1894): No heartbeat from core client for 30 sec - exiting 10:40:24 (1894): No heartbeat from core client for 30 sec - exiting 10:40:25 (1894): No heartbeat from core client for 30 sec - exiting 10:40:26 (1894): No heartbeat from core client for 30 sec - exiting 10:40:27 (1894): No heartbeat from core client for 30 sec - exiting 10:40:28 (1894): No heartbeat from core client for 30 sec - exiting 10:40:29 (1894): No heartbeat from core client for 30 sec - exiting 10:40:30 (1894): No heartbeat from core client for 30 sec - exiting 10:40:31 (1894): No heartbeat from core client for 30 sec - exiting 10:40:32 (1894): No heartbeat from core client for 30 sec - exiting 10:40:33 (1894): No heartbeat from core client for 30 sec - exiting 10:40:34 (1894): No heartbeat from core client for 30 sec - exiting 10:40:35 (1894): No heartbeat from core client for 30 sec - exiting 10:40:36 (1894): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2163, iMonCtr=1 Model crash detected, will try to restart... Signal 15 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16299, iMonCtr=1 Model crash detected, will try to restart... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Signal 1 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 1 received, exiting... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/ocean_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 5, file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/climate.cpdc Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 084399E4 Unknown Unknown Unknown hadcm3n_um_6.07_i 083403FC Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F198 Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B75E7BD6 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7215, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/ocean_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 5, file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/climate.cpdc Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 084399E4 Unknown Unknown Unknown hadcm3n_um_6.07_i 083403FC Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F198 Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B7696BD6 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7215, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/ocean_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 5, file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/climate.cpdc Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 084399E4 Unknown Unknown Unknown hadcm3n_um_6.07_i 083403FC Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F198 Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B757ABD6 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7215, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/ocean_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 5, file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/climate.cpdc Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 084399E4 Unknown Unknown Unknown hadcm3n_um_6.07_i 083403FC Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F198 Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B7702BD6 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7215, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/ocean_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 5, file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/climate.cpdc Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 084399E4 Unknown Unknown Unknown hadcm3n_um_6.07_i 083403FC Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F198 Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B7707BD6 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7215, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/xabnk.ihist after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/dataout/ocean_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 5, file /home/hans/BOINC/projects/climateprediction.net/hadcm3n_p7op_1900_40_007227545/jobs/climate.cpdc Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 084399E4 Unknown Unknown Unknown hadcm3n_um_6.07_i 083403FC Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F198 Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B76C3BD6 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7215, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Jun 2011 21:27:29 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 648,000 | 1,884,009 | 2.9074 |
01 Jun 2011 19:25:43 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 622,080 | 1,808,696 | 2.9075 |
30 May 2011 23:00:31 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 596,160 | 1,732,475 | 2.9061 |
29 May 2011 11:53:51 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 570,240 | 1,657,269 | 2.9063 |
28 May 2011 10:30:08 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 544,320 | 1,582,264 | 2.9069 |
27 May 2011 03:32:58 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 518,400 | 1,506,041 | 2.9052 |
25 May 2011 18:38:45 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 492,480 | 1,430,003 | 2.9037 |
24 May 2011 00:05:18 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 466,560 | 1,355,391 | 2.9051 |
22 May 2011 14:32:27 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 440,640 | 1,280,015 | 2.9049 |
21 May 2011 08:10:38 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 414,720 | 1,204,496 | 2.9044 |
19 May 2011 18:55:12 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 388,800 | 1,129,237 | 2.9044 |
18 May 2011 00:18:07 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 362,880 | 1,053,688 | 2.9037 |
16 May 2011 05:25:17 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 336,960 | 977,796 | 2.9018 |
15 May 2011 06:15:17 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 311,040 | 902,448 | 2.9014 |
14 May 2011 05:37:17 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 285,120 | 827,319 | 2.9017 |
12 May 2011 23:01:01 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 259,200 | 752,413 | 2.9028 |
11 May 2011 01:34:43 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 233,280 | 676,526 | 2.9001 |
09 May 2011 05:54:16 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 207,360 | 602,873 | 2.9074 |
08 May 2011 04:55:01 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 181,440 | 530,479 | 2.9237 |
05 May 2011 18:06:01 | 859190 | 12835021 | hadcm3n_p7op_1900_40_007227545_1 | 155,520 | 451,954 | 2.9061 |
©2024 cpdn.org