Name | hadcm3n_o702_1980_40_008386173_0 |
Workunit | 8537032 |
Created | 3 Jun 2013, 5:39:12 UTC |
Sent | 8 Jun 2013, 22:28:52 UTC |
Report deadline | 8 Sep 2013, 5:56:03 UTC |
Received | 9 Jun 2013, 22:20:12 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 21 hours 0 min 28 sec |
CPU time | 20 hours 31 min 20 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 23:31:05 (25368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:06:05 (25506): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:24:51 (25899): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:28:43 (26646): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:28:44 (26646): No heartbeat from core client for 30 sec - exiting 01:28:45 (26646): No heartbeat from core client for 30 sec - exiting 01:32:22 (26801): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:32:59 (26801): No heartbeat from core client for 30 sec - exiting 01:37:44 (26970): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:37:45 (26970): No heartbeat from core client for 30 sec - exiting 01:37:46 (26970): No heartbeat from core client for 30 sec - exiting 01:45:28 (27148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:45:29 (27148): No heartbeat from core client for 30 sec - exiting 01:45:30 (27148): No heartbeat from core client for 30 sec - exiting 01:45:31 (27148): No heartbeat from core client for 30 sec - exiting 01:45:32 (27148): No heartbeat from core client for 30 sec - exiting 01:45:33 (27148): No heartbeat from core client for 30 sec - exiting 01:45:34 (27148): No heartbeat from core client for 30 sec - exiting 01:45:35 (27148): No heartbeat from core client for 30 sec - exiting 01:45:36 (27148): No heartbeat from core client for 30 sec - exiting 01:45:37 (27148): No heartbeat from core client for 30 sec - exiting 01:45:38 (27148): No heartbeat from core client for 30 sec - exiting 01:45:39 (27148): No heartbeat from core client for 30 sec - exiting 01:45:40 (27148): No heartbeat from core client for 30 sec - exiting 01:45:41 (27148): No heartbeat from core client for 30 sec - exiting 01:49:44 (27355): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:57:46 (27527): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:51 (27756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:52 (27756): No heartbeat from core client for 30 sec - exiting 13:50:53 (27756): No heartbeat from core client for 30 sec - exiting 13:50:54 (27756): No heartbeat from core client for 30 sec - exiting 13:50:55 (27756): No heartbeat from core client for 30 sec - exiting 13:50:56 (27756): No heartbeat from core client for 30 sec - exiting 13:50:57 (27756): No heartbeat from core client for 30 sec - exiting 13:50:58 (27756): No heartbeat from core client for 30 sec - exiting 13:55:21 (34123): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:55:22 (34123): No heartbeat from core client for 30 sec - exiting 13:55:23 (34123): No heartbeat from core client for 30 sec - exiting 13:55:24 (34123): No heartbeat from core client for 30 sec - exiting 13:55:25 (34123): No heartbeat from core client for 30 sec - exiting 13:55:26 (34123): No heartbeat from core client for 30 sec - exiting 13:55:27 (34123): No heartbeat from core client for 30 sec - exiting 13:55:28 (34123): No heartbeat from core client for 30 sec - exiting 13:55:29 (34123): No heartbeat from core client for 30 sec - exiting 13:55:30 (34123): No heartbeat from core client for 30 sec - exiting 13:55:31 (34123): No heartbeat from core client for 30 sec - exiting 13:55:32 (34123): No heartbeat from core client for 30 sec - exiting 13:55:33 (34123): No heartbeat from core client for 30 sec - exiting 13:55:34 (34123): No heartbeat from core client for 30 sec - exiting 13:55:35 (34123): No heartbeat from core client for 30 sec - exiting 13:55:36 (34123): No heartbeat from core client for 30 sec - exiting 13:59:40 (34280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:36:04 (34453): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:36:05 (34453): No heartbeat from core client for 30 sec - exiting 15:36:06 (34453): No heartbeat from core client for 30 sec - exiting 15:36:07 (34453): No heartbeat from core client for 30 sec - exiting 15:36:08 (34453): No heartbeat from core client for 30 sec - exiting 15:40:17 (35368): No heartbeat from core client for 30 sec - exiting 15:40:18 (35368): No heartbeat from core client for 30 sec - exiting 15:40:19 (35368): No heartbeat from core client for 30 sec - exiting 15:40:20 (35368): No heartbeat from core client for 30 sec - exiting 15:40:21 (35368): No heartbeat from core client for 30 sec - exiting 15:40:22 (35368): No heartbeat from core client for 30 sec - exiting 15:40:23 (35368): No heartbeat from core client for 30 sec - exiting 15:40:24 (35368): No heartbeat from core client for 30 sec - exiting 15:40:25 (35368): No heartbeat from core client for 30 sec - exiting 15:40:26 (35368): No heartbeat from core client for 30 sec - exiting 15:40:27 (35368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:49:04 (35526): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:15 (35717): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:07 (35862): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:08 (35862): No heartbeat from core client for 30 sec - exiting 15:57:09 (35862): No heartbeat from core client for 30 sec - exiting 15:57:10 (35862): No heartbeat from core client for 30 sec - exiting 15:57:11 (35862): No heartbeat from core client for 30 sec - exiting 15:57:12 (35862): No heartbeat from core client for 30 sec - exiting 15:57:13 (35862): No heartbeat from core client for 30 sec - exiting 15:57:14 (35862): No heartbeat from core client for 30 sec - exiting 15:57:15 (35862): No heartbeat from core client for 30 sec - exiting 15:57:16 (35862): No heartbeat from core client for 30 sec - exiting 15:57:17 (35862): No heartbeat from core client for 30 sec - exiting 15:57:18 (35862): No heartbeat from core client for 30 sec - exiting 15:57:19 (35862): No heartbeat from core client for 30 sec - exiting 15:57:20 (35862): No heartbeat from core client for 30 sec - exiting 15:57:21 (35862): No heartbeat from core client for 30 sec - exiting 15:57:22 (35862): No heartbeat from core client for 30 sec - exiting 16:01:27 (36009): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:46:58 (36150): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:48 (36665): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:49 (36665): No heartbeat from core client for 30 sec - exiting 16:50:50 (36665): No heartbeat from core client for 30 sec - exiting 16:50:51 (36665): No heartbeat from core client for 30 sec - exiting 16:50:52 (36665): No heartbeat from core client for 30 sec - exiting 16:50:53 (36665): No heartbeat from core client for 30 sec - exiting 16:50:54 (36665): No heartbeat from core client for 30 sec - exiting 16:50:55 (36665): No heartbeat from core client for 30 sec - exiting 16:50:56 (36665): No heartbeat from core client for 30 sec - exiting 16:50:57 (36665): No heartbeat from core client for 30 sec - exiting 16:50:58 (36665): No heartbeat from core client for 30 sec - exiting 16:55:00 (36810): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:53:26 (36964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:53:27 (36964): No heartbeat from core client for 30 sec - exiting 20:53:28 (36964): No heartbeat from core client for 30 sec - exiting 20:53:29 (36964): No heartbeat from core client for 30 sec - exiting 20:53:30 (36964): No heartbeat from core client for 30 sec - exiting 21:05:51 (39074): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:10 (39296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:31 (39447): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:32 (39447): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 22:28:19 (39598): No heartbeat from core client for 30 sec - exiting 22:28:20 (39598): No heartbeat from core client for 30 sec - exiting 22:28:21 (39598): No heartbeat from core client for 30 sec - exiting 22:28:22 (39598): No heartbeat from core client for 30 sec - exiting 22:28:23 (39598): No heartbeat from core client for 30 sec - exiting 22:28:24 (39598): No heartbeat from core client for 30 sec - exiting 22:28:25 (39598): No heartbeat from core client for 30 sec - exiting 22:28:26 (39598): No heartbeat from core client for 30 sec - exiting 22:28:27 (39598): No heartbeat from core client for 30 sec - exiting 22:28:28 (39598): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:32:17 (40336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:32:18 (40336): No heartbeat from core client for 30 sec - exiting 22:32:19 (40336): No heartbeat from core client for 30 sec - exiting 22:32:20 (40336): No heartbeat from core client for 30 sec - exiting 22:32:21 (40336): No heartbeat from core client for 30 sec - exiting 22:32:22 (40336): No heartbeat from core client for 30 sec - exiting 22:32:23 (40336): No heartbeat from core client for 30 sec - exiting 22:32:24 (40336): No heartbeat from core client for 30 sec - exiting 22:32:25 (40336): No heartbeat from core client for 30 sec - exiting 22:32:26 (40336): No heartbeat from core client for 30 sec - exiting 22:32:27 (40336): No heartbeat from core client for 30 sec - exiting 22:32:28 (40336): No heartbeat from core client for 30 sec - exiting 22:32:29 (40336): No heartbeat from core client for 30 sec - exiting 22:32:30 (40336): No heartbeat from core client for 30 sec - exiting 22:32:31 (40336): No heartbeat from core client for 30 sec - exiting 22:32:32 (40336): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7704400] [0xf7704425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75211df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7524825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf750c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40488, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7700400] [0xf7700425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf751d1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7520825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75084d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40488, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7774400] [0xf7774425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75911df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7594825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf757c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40488, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77c5400] [0xf77c5425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75e21df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75e5825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75cd4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40488, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77d3400] [0xf77d3425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f01df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75f3825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75db4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40488, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77af400] [0xf77af425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75cc1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75cf825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b74d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40488, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jun 2013 18:13:48 | 1282401 | 15821672 | hadcm3n_o702_1980_40_008386173_0 | 25,920 | 60,739 | 2.3433 |
©2024 climateprediction.net