Name | hadcm3n_o7fm_2140_40_008269643_1 |
Workunit | 8424767 |
Created | 5 Jan 2013, 13:15:07 UTC |
Sent | 5 Jan 2013, 13:15:16 UTC |
Report deadline | 6 Apr 2013, 20:42:27 UTC |
Received | 1 Feb 2013, 6:26:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1218276 |
Run time | 20 hours 22 min 57 sec |
CPU time | 16 hours 53 min 18 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.52 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 22:09:50 (22601): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:09:52 (22601): No heartbeat from core client for 30 sec - exiting 22:09:53 (22601): No heartbeat from core client for 30 sec - exiting 22:09:54 (22601): No heartbeat from core client for 30 sec - exiting 22:09:55 (22601): No heartbeat from core client for 30 sec - exiting 22:09:56 (22601): No heartbeat from core client for 30 sec - exiting 07:29:18 (22630): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:29:19 (22630): No heartbeat from core client for 30 sec - exiting 07:29:20 (22630): No heartbeat from core client for 30 sec - exiting 07:29:21 (22630): No heartbeat from core client for 30 sec - exiting 07:29:22 (22630): No heartbeat from core client for 30 sec - exiting 07:29:23 (22630): No heartbeat from core client for 30 sec - exiting 07:29:24 (22630): No heartbeat from core client for 30 sec - exiting 08:33:50 (22766): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:33:52 (22766): No heartbeat from core client for 30 sec - exiting 08:33:53 (22766): No heartbeat from core client for 30 sec - exiting 14:14:17 (23086): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:14:18 (23086): No heartbeat from core client for 30 sec - exiting 14:14:19 (23086): No heartbeat from core client for 30 sec - exiting 14:14:20 (23086): No heartbeat from core client for 30 sec - exiting 14:14:21 (23086): No heartbeat from core client for 30 sec - exiting 14:34:11 (23249): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:34:12 (23249): No heartbeat from core client for 30 sec - exiting 14:34:13 (23249): No heartbeat from core client for 30 sec - exiting 14:56:18 (23291): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:56:19 (23291): No heartbeat from core client for 30 sec - exiting 14:56:20 (23291): No heartbeat from core client for 30 sec - exiting 14:56:21 (23291): No heartbeat from core client for 30 sec - exiting 14:56:22 (23291): No heartbeat from core client for 30 sec - exiting 14:56:23 (23291): No heartbeat from core client for 30 sec - exiting 14:56:24 (23291): No heartbeat from core client for 30 sec - exiting 14:56:25 (23291): No heartbeat from core client for 30 sec - exiting 14:56:26 (23291): No heartbeat from core client for 30 sec - exiting 14:56:27 (23291): No heartbeat from core client for 30 sec - exiting 14:56:28 (23291): No heartbeat from core client for 30 sec - exiting 14:56:29 (23291): No heartbeat from core client for 30 sec - exiting 14:56:30 (23291): No heartbeat from core client for 30 sec - exiting 14:56:31 (23291): No heartbeat from core client for 30 sec - exiting 14:56:32 (23291): No heartbeat from core client for 30 sec - exiting 14:56:33 (23291): No heartbeat from core client for 30 sec - exiting 14:56:34 (23291): No heartbeat from core client for 30 sec - exiting 14:56:35 (23291): No heartbeat from core client for 30 sec - exiting 16:04:48 (23330): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:04:49 (23330): No heartbeat from core client for 30 sec - exiting 16:04:50 (23330): No heartbeat from core client for 30 sec - exiting 16:04:51 (23330): No heartbeat from core client for 30 sec - exiting 16:04:52 (23330): No heartbeat from core client for 30 sec - exiting 16:14:10 (23386): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:14:11 (23386): No heartbeat from core client for 30 sec - exiting 16:14:12 (23386): No heartbeat from core client for 30 sec - exiting 16:14:13 (23386): No heartbeat from core client for 30 sec - exiting 16:14:14 (23386): No heartbeat from core client for 30 sec - exiting 16:14:15 (23386): No heartbeat from core client for 30 sec - exiting 16:14:16 (23386): No heartbeat from core client for 30 sec - exiting 16:14:17 (23386): No heartbeat from core client for 30 sec - exiting 16:14:18 (23386): No heartbeat from core client for 30 sec - exiting 16:14:19 (23386): No heartbeat from core client for 30 sec - exiting 16:14:20 (23386): No heartbeat from core client for 30 sec - exiting 16:14:21 (23386): No heartbeat from core client for 30 sec - exiting 16:14:22 (23386): No heartbeat from core client for 30 sec - exiting 16:14:23 (23386): No heartbeat from core client for 30 sec - exiting 16:14:24 (23386): No heartbeat from core client for 30 sec - exiting 16:14:25 (23386): No heartbeat from core client for 30 sec - exiting 16:14:26 (23386): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77c2400] [0xf77c2430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75e31ef] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75e6835] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75ce4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23423, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf778e400] [0xf778e430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75af1ef] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b2835] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759a4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23423, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7754400] [0xf7754430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75751ef] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7578835] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75604d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23423, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77db400] [0xf77db430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75fc1ef] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75ff835] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75e74d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23423, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf771b400] [0xf771b430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf753c1ef] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf753f835] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75274d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23423, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7744400] [0xf7744430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75651ef] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7568835] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75504d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23423, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Jan 2013 14:45:32 | 1218276 | 15523532 | hadcm3n_o7fm_2140_40_008269643_1 | 25,920 | 39,746 | 1.5334 |
©2024 cpdn.org