Name | hadcm3n_o4pd_1980_40_008386063_2 |
Workunit | 8536922 |
Created | 10 Jun 2013, 2:09:55 UTC |
Sent | 10 Jun 2013, 2:24:03 UTC |
Report deadline | 9 Sep 2013, 9:51:14 UTC |
Received | 11 Jun 2013, 1:44:00 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 19 hours 58 min 2 sec |
CPU time | 19 hours 24 min 45 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 05:34:53 (43774): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:39:16 (44955): No heartbeat from core client for 30 sec - exiting 05:39:17 (44955): No heartbeat from core client for 30 sec - exiting 05:39:18 (44955): No heartbeat from core client for 30 sec - exiting 05:39:19 (44955): No heartbeat from core client for 30 sec - exiting 05:39:20 (44955): No heartbeat from core client for 30 sec - exiting 05:39:21 (44955): No heartbeat from core client for 30 sec - exiting 05:39:22 (44955): No heartbeat from core client for 30 sec - exiting 05:39:23 (44955): No heartbeat from core client for 30 sec - exiting 05:39:24 (44955): No heartbeat from core client for 30 sec - exiting 05:39:25 (44955): No heartbeat from core client for 30 sec - exiting 05:39:26 (44955): No heartbeat from core client for 30 sec - exiting 05:39:27 (44955): No heartbeat from core client for 30 sec - exiting 05:39:28 (44955): No heartbeat from core client for 30 sec - exiting 05:39:29 (44955): No heartbeat from core client for 30 sec - exiting 05:39:30 (44955): No heartbeat from core client for 30 sec - exiting 05:39:31 (44955): No heartbeat from core client for 30 sec - exiting 05:39:32 (44955): No heartbeat from core client for 30 sec - exiting 05:39:33 (44955): No heartbeat from core client for 30 sec - exiting 05:39:34 (44955): No heartbeat from core client for 30 sec - exiting 05:39:35 (44955): No heartbeat from core client for 30 sec - exiting 05:39:36 (44955): No heartbeat from core client for 30 sec - exiting 05:39:37 (44955): No heartbeat from core client for 30 sec - exiting 05:39:38 (44955): No heartbeat from core client for 30 sec - exiting 05:39:39 (44955): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:44:10 (45114): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:49:14 (45293): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:27 (49154): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:28 (49154): No heartbeat from core client for 30 sec - exiting 12:57:09 (49303): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:38:25 (49442): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:38:26 (49442): No heartbeat from core client for 30 sec - exiting 21:38:27 (49442): No heartbeat from core client for 30 sec - exiting 21:38:28 (49442): No heartbeat from core client for 30 sec - exiting 21:42:18 (53902): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:42:19 (53902): No heartbeat from core client for 30 sec - exiting 21:42:20 (53902): No heartbeat from core client for 30 sec - exiting 21:42:21 (53902): No heartbeat from core client for 30 sec - exiting 21:42:22 (53902): No heartbeat from core client for 30 sec - exiting 21:42:23 (53902): No heartbeat from core client for 30 sec - exiting 21:42:24 (53902): No heartbeat from core client for 30 sec - exiting 21:42:25 (53902): No heartbeat from core client for 30 sec - exiting 21:42:26 (53902): No heartbeat from core client for 30 sec - exiting 21:42:27 (53902): No heartbeat from core client for 30 sec - exiting 21:42:28 (53902): No heartbeat from core client for 30 sec - exiting 21:42:29 (53902): No heartbeat from core client for 30 sec - exiting 21:58:24 (54073): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:06 (54322): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:07 (54322): No heartbeat from core client for 30 sec - exiting 22:02:08 (54322): No heartbeat from core client for 30 sec - exiting 22:02:09 (54322): No heartbeat from core client for 30 sec - exiting 22:02:10 (54322): No heartbeat from core client for 30 sec - exiting 22:02:11 (54322): No heartbeat from core client for 30 sec - exiting 22:02:12 (54322): No heartbeat from core client for 30 sec - exiting 22:02:13 (54322): No heartbeat from core client for 30 sec - exiting 22:02:14 (54322): No heartbeat from core client for 30 sec - exiting 22:02:15 (54322): No heartbeat from core client for 30 sec - exiting 22:02:16 (54322): No heartbeat from core client for 30 sec - exiting 22:02:17 (54322): No heartbeat from core client for 30 sec - exiting 22:02:18 (54322): No heartbeat from core client for 30 sec - exiting 22:02:19 (54322): No heartbeat from core client for 30 sec - exiting 22:02:20 (54322): No heartbeat from core client for 30 sec - exiting 22:02:21 (54322): No heartbeat from core client for 30 sec - exiting 22:02:22 (54322): No heartbeat from core client for 30 sec - exiting 22:02:23 (54322): No heartbeat from core client for 30 sec - exiting 22:02:24 (54322): No heartbeat from core client for 30 sec - exiting 22:02:25 (54322): No heartbeat from core client for 30 sec - exiting 22:06:18 (54480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:06:19 (54480): No heartbeat from core client for 30 sec - exiting 22:06:20 (54480): No heartbeat from core client for 30 sec - exiting 22:06:21 (54480): No heartbeat from core client for 30 sec - exiting 22:06:22 (54480): No heartbeat from core client for 30 sec - exiting 22:06:23 (54480): No heartbeat from core client for 30 sec - exiting 22:06:24 (54480): No heartbeat from core client for 30 sec - exiting 22:06:25 (54480): No heartbeat from core client for 30 sec - exiting 22:06:26 (54480): No heartbeat from core client for 30 sec - exiting 22:06:27 (54480): No heartbeat from core client for 30 sec - exiting 22:06:28 (54480): No heartbeat from core client for 30 sec - exiting 22:06:29 (54480): No heartbeat from core client for 30 sec - exiting 22:06:30 (54480): No heartbeat from core client for 30 sec - exiting 22:06:31 (54480): No heartbeat from core client for 30 sec - exiting 22:06:32 (54480): No heartbeat from core client for 30 sec - exiting 22:06:33 (54480): No heartbeat from core client for 30 sec - exiting 22:06:34 (54480): No heartbeat from core client for 30 sec - exiting 22:10:59 (54649): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:11:00 (54649): No heartbeat from core client for 30 sec - exiting 22:11:01 (54649): No heartbeat from core client for 30 sec - exiting 22:11:02 (54649): No heartbeat from core client for 30 sec - exiting 00:32:53 (54831): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:36:30 (56180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:55:17 (56334): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:17 (56625): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:18 (56625): No heartbeat from core client for 30 sec - exiting 00:59:19 (56625): No heartbeat from core client for 30 sec - exiting 00:59:20 (56625): No heartbeat from core client for 30 sec - exiting 00:59:21 (56625): No heartbeat from core client for 30 sec - exiting 00:59:22 (56625): No heartbeat from core client for 30 sec - exiting 01:03:52 (56791): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:53 (56791): No heartbeat from core client for 30 sec - exiting 01:03:54 (56791): No heartbeat from core client for 30 sec - exiting 01:03:55 (56791): No heartbeat from core client for 30 sec - exiting 01:03:56 (56791): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7780400] [0xf7780425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759d1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a0825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75884d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=56911, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7742400] [0xf7742425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf755f1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7562825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf754a4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=56911, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf776d400] [0xf776d425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf758a1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf758d825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75754d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=56911, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76f6400] [0xf76f6425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75131df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7516825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74fe4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=56911, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7772400] [0xf7772425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf758f1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7592825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf757a4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=56911, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76fb400] [0xf76fb425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75181df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf751b825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75034d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=56911, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Jun 2013 21:47:47 | 1282401 | 15837039 | hadcm3n_o4pd_1980_40_008386063_2 | 25,920 | 61,820 | 2.3850 |
©2024 climateprediction.net