Name | hadcm3n_o5lf_2020_40_008373894_2 |
Workunit | 8524753 |
Created | 8 Jun 2013, 19:26:13 UTC |
Sent | 8 Jun 2013, 20:11:12 UTC |
Report deadline | 8 Sep 2013, 3:38:23 UTC |
Received | 9 Jun 2013, 22:20:12 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 23 hours 15 min 54 sec |
CPU time | 22 hours 43 min 2 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 21:36:17 (23701): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:40:42 (24029): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:45:05 (24204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 21:57:11 (24361): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:31:06 (24590): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:06:04 (25493): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:24:52 (25888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:24:53 (25888): No heartbeat from core client for 30 sec - exiting 01:24:54 (25888): No heartbeat from core client for 30 sec - exiting 01:24:55 (25888): No heartbeat from core client for 30 sec - exiting 01:24:56 (25888): No heartbeat from core client for 30 sec - exiting 01:28:43 (26638): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:28:44 (26638): No heartbeat from core client for 30 sec - exiting 01:32:21 (26793): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:32:59 (26793): No heartbeat from core client for 30 sec - exiting 01:37:43 (26960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:37:44 (26960): No heartbeat from core client for 30 sec - exiting 01:37:45 (26960): No heartbeat from core client for 30 sec - exiting 01:37:46 (26960): No heartbeat from core client for 30 sec - exiting 01:45:29 (27134): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:49:45 (27347): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:57:47 (27517): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:51 (27750): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:55:20 (34118): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:55:21 (34118): No heartbeat from core client for 30 sec - exiting 13:55:22 (34118): No heartbeat from core client for 30 sec - exiting 13:55:23 (34118): No heartbeat from core client for 30 sec - exiting 13:55:24 (34118): No heartbeat from core client for 30 sec - exiting 13:55:25 (34118): No heartbeat from core client for 30 sec - exiting 13:55:26 (34118): No heartbeat from core client for 30 sec - exiting 13:55:27 (34118): No heartbeat from core client for 30 sec - exiting 13:55:28 (34118): No heartbeat from core client for 30 sec - exiting 13:55:29 (34118): No heartbeat from core client for 30 sec - exiting 13:55:30 (34118): No heartbeat from core client for 30 sec - exiting 13:55:31 (34118): No heartbeat from core client for 30 sec - exiting 13:55:32 (34118): No heartbeat from core client for 30 sec - exiting 13:55:33 (34118): No heartbeat from core client for 30 sec - exiting 13:55:34 (34118): No heartbeat from core client for 30 sec - exiting 13:55:35 (34118): No heartbeat from core client for 30 sec - exiting 13:55:36 (34118): No heartbeat from core client for 30 sec - exiting 13:59:40 (34276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:36:04 (34447): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:40:18 (35364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:49:04 (35522): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:15 (35712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:07 (35858): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:08 (35858): No heartbeat from core client for 30 sec - exiting 15:57:09 (35858): No heartbeat from core client for 30 sec - exiting 15:57:10 (35858): No heartbeat from core client for 30 sec - exiting 15:57:11 (35858): No heartbeat from core client for 30 sec - exiting 15:57:12 (35858): No heartbeat from core client for 30 sec - exiting 15:57:13 (35858): No heartbeat from core client for 30 sec - exiting 15:57:14 (35858): No heartbeat from core client for 30 sec - exiting 15:57:15 (35858): No heartbeat from core client for 30 sec - exiting 15:57:16 (35858): No heartbeat from core client for 30 sec - exiting 15:57:17 (35858): No heartbeat from core client for 30 sec - exiting 15:57:18 (35858): No heartbeat from core client for 30 sec - exiting 15:57:19 (35858): No heartbeat from core client for 30 sec - exiting 15:57:20 (35858): No heartbeat from core client for 30 sec - exiting 15:57:21 (35858): No heartbeat from core client for 30 sec - exiting 15:57:22 (35858): No heartbeat from core client for 30 sec - exiting 16:01:26 (36005): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:27 (36005): No heartbeat from core client for 30 sec - exiting 16:01:28 (36005): No heartbeat from core client for 30 sec - exiting 16:01:29 (36005): No heartbeat from core client for 30 sec - exiting 16:01:30 (36005): No heartbeat from core client for 30 sec - exiting 16:01:31 (36005): No heartbeat from core client for 30 sec - exiting 16:01:32 (36005): No heartbeat from core client for 30 sec - exiting 16:01:33 (36005): No heartbeat from core client for 30 sec - exiting 16:46:58 (36144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:48 (36661): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:49 (36661): No heartbeat from core client for 30 sec - exiting 16:50:50 (36661): No heartbeat from core client for 30 sec - exiting 16:50:51 (36661): No heartbeat from core client for 30 sec - exiting 16:50:52 (36661): No heartbeat from core client for 30 sec - exiting 16:50:53 (36661): No heartbeat from core client for 30 sec - exiting 16:50:54 (36661): No heartbeat from core client for 30 sec - exiting 16:50:55 (36661): No heartbeat from core client for 30 sec - exiting 16:50:56 (36661): No heartbeat from core client for 30 sec - exiting 16:50:57 (36661): No heartbeat from core client for 30 sec - exiting 16:50:58 (36661): No heartbeat from core client for 30 sec - exiting 16:55:00 (36806): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:53:26 (36959): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:53:27 (36959): No heartbeat from core client for 30 sec - exiting 20:53:28 (36959): No heartbeat from core client for 30 sec - exiting 20:53:29 (36959): No heartbeat from core client for 30 sec - exiting 20:53:30 (36959): No heartbeat from core client for 30 sec - exiting 21:05:51 (39070): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:05:52 (39070): No heartbeat from core client for 30 sec - exiting 21:05:53 (39070): No heartbeat from core client for 30 sec - exiting 21:05:54 (39070): No heartbeat from core client for 30 sec - exiting 21:05:55 (39070): No heartbeat from core client for 30 sec - exiting 21:05:56 (39070): No heartbeat from core client for 30 sec - exiting 21:05:57 (39070): No heartbeat from core client for 30 sec - exiting 21:05:58 (39070): No heartbeat from core client for 30 sec - exiting 21:05:59 (39070): No heartbeat from core client for 30 sec - exiting 21:06:00 (39070): No heartbeat from core client for 30 sec - exiting 21:06:01 (39070): No heartbeat from core client for 30 sec - exiting 21:06:02 (39070): No heartbeat from core client for 30 sec - exiting 21:06:03 (39070): No heartbeat from core client for 30 sec - exiting 21:10:10 (39292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:11 (39292): No heartbeat from core client for 30 sec - exiting 21:10:12 (39292): No heartbeat from core client for 30 sec - exiting 21:10:13 (39292): No heartbeat from core client for 30 sec - exiting 21:10:14 (39292): No heartbeat from core client for 30 sec - exiting 21:10:15 (39292): No heartbeat from core client for 30 sec - exiting 21:10:16 (39292): No heartbeat from core client for 30 sec - exiting 21:10:17 (39292): No heartbeat from core client for 30 sec - exiting 21:10:18 (39292): No heartbeat from core client for 30 sec - exiting 21:10:19 (39292): No heartbeat from core client for 30 sec - exiting 21:10:20 (39292): No heartbeat from core client for 30 sec - exiting 21:10:21 (39292): No heartbeat from core client for 30 sec - exiting 21:10:22 (39292): No heartbeat from core client for 30 sec - exiting 21:10:23 (39292): No heartbeat from core client for 30 sec - exiting 21:10:24 (39292): No heartbeat from core client for 30 sec - exiting 21:10:25 (39292): No heartbeat from core client for 30 sec - exiting 21:10:26 (39292): No heartbeat from core client for 30 sec - exiting 21:10:27 (39292): No heartbeat from core client for 30 sec - exiting 21:14:31 (39443): No heartbeat from core client for 30 sec - exiting 21:14:32 (39443): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:20 (39594): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:21 (39594): No heartbeat from core client for 30 sec - exiting 22:28:22 (39594): No heartbeat from core client for 30 sec - exiting 22:28:23 (39594): No heartbeat from core client for 30 sec - exiting 22:28:24 (39594): No heartbeat from core client for 30 sec - exiting 22:28:25 (39594): No heartbeat from core client for 30 sec - exiting 22:28:26 (39594): No heartbeat from core client for 30 sec - exiting 22:28:27 (39594): No heartbeat from core client for 30 sec - exiting 22:28:28 (39594): No heartbeat from core client for 30 sec - exiting 22:28:29 (39594): No heartbeat from core client for 30 sec - exiting 22:32:17 (40331): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:32:18 (40331): No heartbeat from core client for 30 sec - exiting 22:32:19 (40331): No heartbeat from core client for 30 sec - exiting 22:32:20 (40331): No heartbeat from core client for 30 sec - exiting 22:32:21 (40331): No heartbeat from core client for 30 sec - exiting 22:32:22 (40331): No heartbeat from core client for 30 sec - exiting 22:32:23 (40331): No heartbeat from core client for 30 sec - exiting 22:32:24 (40331): No heartbeat from core client for 30 sec - exiting 22:32:25 (40331): No heartbeat from core client for 30 sec - exiting 22:32:26 (40331): No heartbeat from core client for 30 sec - exiting 22:32:27 (40331): No heartbeat from core client for 30 sec - exiting 22:32:28 (40331): No heartbeat from core client for 30 sec - exiting 22:32:29 (40331): No heartbeat from core client for 30 sec - exiting 22:32:30 (40331): No heartbeat from core client for 30 sec - exiting 22:32:31 (40331): No heartbeat from core client for 30 sec - exiting 22:32:32 (40331): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf771f400] [0xf771f425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf753c1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf753f825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75274d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40484, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7735400] [0xf7735425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75521df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7555825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf753d4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40484, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf778f400] [0xf778f425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75ac1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75af825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75974d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40484, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7714400] [0xf7714425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75311df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7534825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf751c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40484, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77a8400] [0xf77a8425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75c51df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c8825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b04d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40484, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf777a400] [0xf777a425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75971df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf759a825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75824d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40484, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jun 2013 16:12:51 | 1282401 | 15835667 | hadcm3n_o5lf_2020_40_008373894_2 | 25,920 | 64,246 | 2.4786 |
©2024 climateprediction.net