Name | hadcm3n_o2mc_1980_40_008387945_3 |
Workunit | 8538804 |
Created | 7 Jun 2013, 4:35:35 UTC |
Sent | 7 Jun 2013, 4:50:49 UTC |
Report deadline | 6 Sep 2013, 12:18:00 UTC |
Received | 8 Jun 2013, 16:37:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 1 days 4 hours 43 min 39 sec |
CPU time | 1 days 4 hours 13 min |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 1.99 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 05:59:37 (44600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:56:02 (44748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:56:03 (44748): No heartbeat from core client for 30 sec - exiting 11:56:04 (44748): No heartbeat from core client for 30 sec - exiting 11:59:42 (48202): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:46:57 (2771): No heartbeat from core client for 30 sec - exiting 15:47:27 (2771): No heartbeat from core client for 30 sec - exiting 15:47:28 (2771): No heartbeat from core client for 30 sec - exiting 15:47:29 (2771): No heartbeat from core client for 30 sec - exiting 15:47:30 (2771): No heartbeat from core client for 30 sec - exiting 15:47:31 (2771): No heartbeat from core client for 30 sec - exiting 15:47:33 (2771): No heartbeat from core client for 30 sec - exiting 15:47:34 (2771): No heartbeat from core client for 30 sec - exiting 15:47:35 (2771): No heartbeat from core client for 30 sec - exiting 15:47:36 (2771): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:43:43 (3060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:56:29 (3601): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:01:16 (5448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:06:33 (5662): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:11:20 (5869): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:16:18 (6086): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:20:13 (6250): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:20:14 (6250): No heartbeat from core client for 30 sec - exiting 20:20:15 (6250): No heartbeat from core client for 30 sec - exiting 20:20:16 (6250): No heartbeat from core client for 30 sec - exiting 20:20:17 (6250): No heartbeat from core client for 30 sec - exiting 20:20:18 (6250): No heartbeat from core client for 30 sec - exiting 20:20:19 (6250): No heartbeat from core client for 30 sec - exiting 20:20:20 (6250): No heartbeat from core client for 30 sec - exiting 20:20:21 (6250): No heartbeat from core client for 30 sec - exiting 20:20:22 (6250): No heartbeat from core client for 30 sec - exiting 20:20:23 (6250): No heartbeat from core client for 30 sec - exiting 20:20:24 (6250): No heartbeat from core client for 30 sec - exiting 20:20:25 (6250): No heartbeat from core client for 30 sec - exiting 20:20:26 (6250): No heartbeat from core client for 30 sec - exiting 20:24:17 (6378): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:24:18 (6378): No heartbeat from core client for 30 sec - exiting 20:24:19 (6378): No heartbeat from core client for 30 sec - exiting 20:24:20 (6378): No heartbeat from core client for 30 sec - exiting 20:24:21 (6378): No heartbeat from core client for 30 sec - exiting 20:24:22 (6378): No heartbeat from core client for 30 sec - exiting 20:24:23 (6378): No heartbeat from core client for 30 sec - exiting 21:13:37 (6560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:18:38 (7121): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:01:14 (7329): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:05:12 (7853): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:14 (8021): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:16:59 (13889): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:17:00 (13889): No heartbeat from core client for 30 sec - exiting 08:17:01 (13889): No heartbeat from core client for 30 sec - exiting 08:43:36 (14078): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:48:12 (14444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:51:59 (14634): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:52:00 (14634): No heartbeat from core client for 30 sec - exiting 08:52:01 (14634): No heartbeat from core client for 30 sec - exiting 08:52:02 (14634): No heartbeat from core client for 30 sec - exiting 08:52:03 (14634): No heartbeat from core client for 30 sec - exiting 08:52:04 (14634): No heartbeat from core client for 30 sec - exiting 08:52:05 (14634): No heartbeat from core client for 30 sec - exiting 08:52:06 (14634): No heartbeat from core client for 30 sec - exiting 08:52:07 (14634): No heartbeat from core client for 30 sec - exiting 08:52:08 (14634): No heartbeat from core client for 30 sec - exiting 08:52:09 (14634): No heartbeat from core client for 30 sec - exiting 08:52:10 (14634): No heartbeat from core client for 30 sec - exiting 08:52:11 (14634): No heartbeat from core client for 30 sec - exiting 08:52:12 (14634): No heartbeat from core client for 30 sec - exiting 08:52:13 (14634): No heartbeat from core client for 30 sec - exiting 08:52:14 (14634): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 09:43:32 (14817): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:15 (15398): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:16 (15398): No heartbeat from core client for 30 sec - exiting 09:48:17 (15398): No heartbeat from core client for 30 sec - exiting 09:48:18 (15398): No heartbeat from core client for 30 sec - exiting 09:48:19 (15398): No heartbeat from core client for 30 sec - exiting 09:48:20 (15398): No heartbeat from core client for 30 sec - exiting 09:48:21 (15398): No heartbeat from core client for 30 sec - exiting 09:48:22 (15398): No heartbeat from core client for 30 sec - exiting 09:48:23 (15398): No heartbeat from core client for 30 sec - exiting 09:48:24 (15398): No heartbeat from core client for 30 sec - exiting 09:48:25 (15398): No heartbeat from core client for 30 sec - exiting 09:48:26 (15398): No heartbeat from core client for 30 sec - exiting 09:48:27 (15398): No heartbeat from core client for 30 sec - exiting 09:48:28 (15398): No heartbeat from core client for 30 sec - exiting 09:48:29 (15398): No heartbeat from core client for 30 sec - exiting 09:48:30 (15398): No heartbeat from core client for 30 sec - exiting 09:48:31 (15398): No heartbeat from core client for 30 sec - exiting 09:48:32 (15398): No heartbeat from core client for 30 sec - exiting 09:48:33 (15398): No heartbeat from core client for 30 sec - exiting 09:48:34 (15398): No heartbeat from core client for 30 sec - exiting 09:48:35 (15398): No heartbeat from core client for 30 sec - exiting 09:48:36 (15398): No heartbeat from core client for 30 sec - exiting 09:48:37 (15398): No heartbeat from core client for 30 sec - exiting 09:48:38 (15398): No heartbeat from core client for 30 sec - exiting 09:48:39 (15398): No heartbeat from core client for 30 sec - exiting 09:48:40 (15398): No heartbeat from core client for 30 sec - exiting 09:48:41 (15398): No heartbeat from core client for 30 sec - exiting 09:48:42 (15398): No heartbeat from core client for 30 sec - exiting 09:48:43 (15398): No heartbeat from core client for 30 sec - exiting 09:48:44 (15398): No heartbeat from core client for 30 sec - exiting 09:48:45 (15398): No heartbeat from core client for 30 sec - exiting 09:48:46 (15398): No heartbeat from core client for 30 sec - exiting 09:48:47 (15398): No heartbeat from core client for 30 sec - exiting 09:48:48 (15398): No heartbeat from core client for 30 sec - exiting 09:48:49 (15398): No heartbeat from core client for 30 sec - exiting 09:48:50 (15398): No heartbeat from core client for 30 sec - exiting 09:48:51 (15398): No heartbeat from core client for 30 sec - exiting 09:48:52 (15398): No heartbeat from core client for 30 sec - exiting 09:52:58 (15597): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:59:37 (15801): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:03:41 (18714): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:08:06 (18891): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:13:02 (19066): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:20:20 (19250): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:24:25 (19993): No heartbeat from core client for 30 sec - exiting 16:24:26 (19993): No heartbeat from core client for 30 sec - exiting 16:24:27 (19993): No heartbeat from core client for 30 sec - exiting 16:24:28 (19993): No heartbeat from core client for 30 sec - exiting 16:24:29 (19993): No heartbeat from core client for 30 sec - exiting 16:24:30 (19993): No heartbeat from core client for 30 sec - exiting 16:24:31 (19993): No heartbeat from core client for 30 sec - exiting 16:24:32 (19993): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:29:21 (20167): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:18:46 (20336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:23:13 (20892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:27:16 (21053): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:27:17 (21053): No heartbeat from core client for 30 sec - exiting 17:27:18 (21053): No heartbeat from core client for 30 sec - exiting 17:27:19 (21053): No heartbeat from core client for 30 sec - exiting 17:27:20 (21053): No heartbeat from core client for 30 sec - exiting 17:27:21 (21053): No heartbeat from core client for 30 sec - exiting 17:27:22 (21053): No heartbeat from core client for 30 sec - exiting 17:27:23 (21053): No heartbeat from core client for 30 sec - exiting 17:27:24 (21053): No heartbeat from core client for 30 sec - exiting 17:27:25 (21053): No heartbeat from core client for 30 sec - exiting 17:27:26 (21053): No heartbeat from core client for 30 sec - exiting 17:27:27 (21053): No heartbeat from core client for 30 sec - exiting 17:27:28 (21053): No heartbeat from core client for 30 sec - exiting 17:27:29 (21053): No heartbeat from core client for 30 sec - exiting 17:27:30 (21053): No heartbeat from core client for 30 sec - exiting 17:27:31 (21053): No heartbeat from core client for 30 sec - exiting 17:27:32 (21053): No heartbeat from core client for 30 sec - exiting 17:27:33 (21053): No heartbeat from core client for 30 sec - exiting 17:27:34 (21053): No heartbeat from core client for 30 sec - exiting 17:27:35 (21053): No heartbeat from core client for 30 sec - exiting 17:27:36 (21053): No heartbeat from core client for 30 sec - exiting 17:27:37 (21053): No heartbeat from core client for 30 sec - exiting 17:27:38 (21053): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77a0400] [0xf77a0425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75bd1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c0825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a84d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21221, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7796400] [0xf7796425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21221, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76f0400] [0xf76f0425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf750d1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7510825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74f84d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21221, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf773a400] [0xf773a425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75571df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf755a825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75424d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21221, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf773e400] [0xf773e425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf755b1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf755e825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75464d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21221, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf779f400] [0xf779f425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75bc1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75bf825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a74d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21221, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jun 2013 07:02:55 | 1282401 | 15833766 | hadcm3n_o2mc_1980_40_008387945_3 | 25,920 | 70,828 | 2.7326 |
©2024 climateprediction.net