Name | hadcm3n_o5aj_2020_40_008375992_1 |
Workunit | 8526851 |
Created | 9 Jun 2013, 17:51:56 UTC |
Sent | 9 Jun 2013, 18:11:53 UTC |
Report deadline | 9 Sep 2013, 1:39:04 UTC |
Received | 10 Jun 2013, 20:46:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 1 days 0 hours 9 min 25 sec |
CPU time | 23 hours 26 min 11 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 20:53:26 (38198): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:05:51 (39150): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:05:52 (39150): No heartbeat from core client for 30 sec - exiting 21:05:53 (39150): No heartbeat from core client for 30 sec - exiting 21:05:54 (39150): No heartbeat from core client for 30 sec - exiting 21:05:55 (39150): No heartbeat from core client for 30 sec - exiting 21:05:56 (39150): No heartbeat from core client for 30 sec - exiting 21:05:57 (39150): No heartbeat from core client for 30 sec - exiting 21:05:58 (39150): No heartbeat from core client for 30 sec - exiting 21:05:59 (39150): No heartbeat from core client for 30 sec - exiting 21:06:00 (39150): No heartbeat from core client for 30 sec - exiting 21:06:01 (39150): No heartbeat from core client for 30 sec - exiting 21:06:02 (39150): No heartbeat from core client for 30 sec - exiting 21:06:03 (39150): No heartbeat from core client for 30 sec - exiting 21:10:10 (39371): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:11 (39371): No heartbeat from core client for 30 sec - exiting 21:10:12 (39371): No heartbeat from core client for 30 sec - exiting 21:10:13 (39371): No heartbeat from core client for 30 sec - exiting 21:10:14 (39371): No heartbeat from core client for 30 sec - exiting 21:10:15 (39371): No heartbeat from core client for 30 sec - exiting 21:10:16 (39371): No heartbeat from core client for 30 sec - exiting 21:10:17 (39371): No heartbeat from core client for 30 sec - exiting 21:10:18 (39371): No heartbeat from core client for 30 sec - exiting 21:10:19 (39371): No heartbeat from core client for 30 sec - exiting 21:10:20 (39371): No heartbeat from core client for 30 sec - exiting 21:10:21 (39371): No heartbeat from core client for 30 sec - exiting 21:10:22 (39371): No heartbeat from core client for 30 sec - exiting 21:10:23 (39371): No heartbeat from core client for 30 sec - exiting 21:10:24 (39371): No heartbeat from core client for 30 sec - exiting 21:10:25 (39371): No heartbeat from core client for 30 sec - exiting 21:10:26 (39371): No heartbeat from core client for 30 sec - exiting 21:10:27 (39371): No heartbeat from core client for 30 sec - exiting 21:14:31 (39520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:32 (39520): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 22:28:19 (39682): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:20 (39682): No heartbeat from core client for 30 sec - exiting 22:28:21 (39682): No heartbeat from core client for 30 sec - exiting 22:28:22 (39682): No heartbeat from core client for 30 sec - exiting 22:28:23 (39682): No heartbeat from core client for 30 sec - exiting 22:28:24 (39682): No heartbeat from core client for 30 sec - exiting 22:28:25 (39682): No heartbeat from core client for 30 sec - exiting 22:28:26 (39682): No heartbeat from core client for 30 sec - exiting 22:28:27 (39682): No heartbeat from core client for 30 sec - exiting 22:28:28 (39682): No heartbeat from core client for 30 sec - exiting 22:32:17 (40409): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:32:18 (40409): No heartbeat from core client for 30 sec - exiting 22:32:19 (40409): No heartbeat from core client for 30 sec - exiting 22:32:20 (40409): No heartbeat from core client for 30 sec - exiting 22:32:21 (40409): No heartbeat from core client for 30 sec - exiting 22:32:22 (40409): No heartbeat from core client for 30 sec - exiting 22:32:23 (40409): No heartbeat from core client for 30 sec - exiting 22:32:24 (40409): No heartbeat from core client for 30 sec - exiting 22:32:25 (40409): No heartbeat from core client for 30 sec - exiting 22:32:26 (40409): No heartbeat from core client for 30 sec - exiting 22:32:27 (40409): No heartbeat from core client for 30 sec - exiting 22:32:28 (40409): No heartbeat from core client for 30 sec - exiting 22:32:29 (40409): No heartbeat from core client for 30 sec - exiting 22:32:30 (40409): No heartbeat from core client for 30 sec - exiting 22:32:31 (40409): No heartbeat from core client for 30 sec - exiting 22:32:32 (40409): No heartbeat from core client for 30 sec - exiting 23:07:16 (40575): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:11:00 (40953): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:11:01 (40953): No heartbeat from core client for 30 sec - exiting 23:11:02 (40953): No heartbeat from core client for 30 sec - exiting 23:11:03 (40953): No heartbeat from core client for 30 sec - exiting 23:11:04 (40953): No heartbeat from core client for 30 sec - exiting 23:11:05 (40953): No heartbeat from core client for 30 sec - exiting 23:11:06 (40953): No heartbeat from core client for 30 sec - exiting 23:11:07 (40953): No heartbeat from core client for 30 sec - exiting 23:11:08 (40953): No heartbeat from core client for 30 sec - exiting 23:11:09 (40953): No heartbeat from core client for 30 sec - exiting 23:11:10 (40953): No heartbeat from core client for 30 sec - exiting 23:11:11 (40953): No heartbeat from core client for 30 sec - exiting 23:11:12 (40953): No heartbeat from core client for 30 sec - exiting 23:11:13 (40953): No heartbeat from core client for 30 sec - exiting 23:11:14 (40953): No heartbeat from core client for 30 sec - exiting 23:11:15 (40953): No heartbeat from core client for 30 sec - exiting 23:11:16 (40953): No heartbeat from core client for 30 sec - exiting 23:11:17 (40953): No heartbeat from core client for 30 sec - exiting 23:11:18 (40953): No heartbeat from core client for 30 sec - exiting 23:11:19 (40953): No heartbeat from core client for 30 sec - exiting 23:11:20 (40953): No heartbeat from core client for 30 sec - exiting 23:11:21 (40953): No heartbeat from core client for 30 sec - exiting 23:11:22 (40953): No heartbeat from core client for 30 sec - exiting 23:11:23 (40953): No heartbeat from core client for 30 sec - exiting 23:11:24 (40953): No heartbeat from core client for 30 sec - exiting 23:11:25 (40953): No heartbeat from core client for 30 sec - exiting 23:11:26 (40953): No heartbeat from core client for 30 sec - exiting 23:11:27 (40953): No heartbeat from core client for 30 sec - exiting 23:11:28 (40953): No heartbeat from core client for 30 sec - exiting 00:50:21 (41099): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:50:22 (41099): No heartbeat from core client for 30 sec - exiting 00:50:23 (41099): No heartbeat from core client for 30 sec - exiting 01:10:03 (41977): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:14:20 (42256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:14:21 (42256): No heartbeat from core client for 30 sec - exiting 01:14:22 (42256): No heartbeat from core client for 30 sec - exiting 01:14:23 (42256): No heartbeat from core client for 30 sec - exiting 01:14:24 (42256): No heartbeat from core client for 30 sec - exiting 01:14:25 (42256): No heartbeat from core client for 30 sec - exiting 01:14:26 (42256): No heartbeat from core client for 30 sec - exiting 01:14:27 (42256): No heartbeat from core client for 30 sec - exiting 01:14:28 (42256): No heartbeat from core client for 30 sec - exiting 01:14:29 (42256): No heartbeat from core client for 30 sec - exiting 01:14:30 (42256): No heartbeat from core client for 30 sec - exiting 01:14:31 (42256): No heartbeat from core client for 30 sec - exiting 01:14:32 (42256): No heartbeat from core client for 30 sec - exiting 01:14:33 (42256): No heartbeat from core client for 30 sec - exiting 01:18:19 (42394): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:18:20 (42394): No heartbeat from core client for 30 sec - exiting 01:18:21 (42394): No heartbeat from core client for 30 sec - exiting 01:18:22 (42394): No heartbeat from core client for 30 sec - exiting 01:18:23 (42394): No heartbeat from core client for 30 sec - exiting 01:18:24 (42394): No heartbeat from core client for 30 sec - exiting 01:18:25 (42394): No heartbeat from core client for 30 sec - exiting 01:18:26 (42394): No heartbeat from core client for 30 sec - exiting 01:18:27 (42394): No heartbeat from core client for 30 sec - exiting 01:18:28 (42394): No heartbeat from core client for 30 sec - exiting 01:18:29 (42394): No heartbeat from core client for 30 sec - exiting 01:18:30 (42394): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 01:22:27 (42565): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:22:28 (42565): No heartbeat from core client for 30 sec - exiting 01:22:29 (42565): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 05:34:53 (42708): No heartbeat from core client for 30 sec - exiting 05:34:54 (42708): No heartbeat from core client for 30 sec - exiting 05:34:55 (42708): No heartbeat from core client for 30 sec - exiting 05:34:56 (42708): No heartbeat from core client for 30 sec - exiting 05:34:57 (42708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:39:17 (44928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:39:18 (44928): No heartbeat from core client for 30 sec - exiting 05:39:19 (44928): No heartbeat from core client for 30 sec - exiting 05:39:20 (44928): No heartbeat from core client for 30 sec - exiting 05:39:21 (44928): No heartbeat from core client for 30 sec - exiting 05:39:22 (44928): No heartbeat from core client for 30 sec - exiting 05:39:23 (44928): No heartbeat from core client for 30 sec - exiting 05:39:24 (44928): No heartbeat from core client for 30 sec - exiting 05:39:25 (44928): No heartbeat from core client for 30 sec - exiting 05:39:26 (44928): No heartbeat from core client for 30 sec - exiting 05:39:27 (44928): No heartbeat from core client for 30 sec - exiting 05:39:28 (44928): No heartbeat from core client for 30 sec - exiting 05:39:29 (44928): No heartbeat from core client for 30 sec - exiting 05:39:30 (44928): No heartbeat from core client for 30 sec - exiting 05:39:31 (44928): No heartbeat from core client for 30 sec - exiting 05:39:32 (44928): No heartbeat from core client for 30 sec - exiting 05:39:33 (44928): No heartbeat from core client for 30 sec - exiting 05:39:34 (44928): No heartbeat from core client for 30 sec - exiting 05:39:35 (44928): No heartbeat from core client for 30 sec - exiting 05:39:36 (44928): No heartbeat from core client for 30 sec - exiting 05:39:37 (44928): No heartbeat from core client for 30 sec - exiting 05:39:38 (44928): No heartbeat from core client for 30 sec - exiting 05:39:39 (44928): No heartbeat from core client for 30 sec - exiting 05:39:40 (44928): No heartbeat from core client for 30 sec - exiting 05:44:09 (45079): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:49:14 (45255): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:27 (49129): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:09 (49275): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:10 (49275): No heartbeat from core client for 30 sec - exiting 21:38:24 (49425): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:38:25 (49425): No heartbeat from core client for 30 sec - exiting 21:38:26 (49425): No heartbeat from core client for 30 sec - exiting 21:38:27 (49425): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77bf400] [0xf77bf425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75dc1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75df825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c74d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=53884, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77b3400] [0xf77b3425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d01df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75d3825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75bb4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=53884, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76f6400] [0xf76f6425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75131df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7516825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74fe4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=53884, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e5400] [0xf76e5425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75021df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7505825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74ed4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=53884, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7728400] [0xf7728425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75451df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7548825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75304d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=53884, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77a5400] [0xf77a5425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75c21df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c5825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75ad4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=53884, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Jun 2013 14:15:00 | 1282401 | 15836688 | hadcm3n_o5aj_2020_40_008375992_1 | 25,920 | 63,369 | 2.4448 |
©2024 climateprediction.net