Name | hadcm3n_zduj_1920_40_008362273_2 |
Workunit | 8513132 |
Created | 7 Jun 2013, 23:58:17 UTC |
Sent | 8 Jun 2013, 0:18:46 UTC |
Report deadline | 7 Sep 2013, 7:45:57 UTC |
Received | 9 Jun 2013, 3:35:19 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 21 hours 0 min 50 sec |
CPU time | 20 hours 30 min 32 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 02:32:39 (9802): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:12:23 (10467): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:15 (10862): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:17:00 (13950): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:43:35 (14146): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:48:13 (14505): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:48:14 (14505): No heartbeat from core client for 30 sec - exiting 08:48:15 (14505): No heartbeat from core client for 30 sec - exiting 08:48:16 (14505): No heartbeat from core client for 30 sec - exiting 08:48:17 (14505): No heartbeat from core client for 30 sec - exiting 08:48:18 (14505): No heartbeat from core client for 30 sec - exiting 08:48:19 (14505): No heartbeat from core client for 30 sec - exiting 08:48:20 (14505): No heartbeat from core client for 30 sec - exiting 08:48:21 (14505): No heartbeat from core client for 30 sec - exiting 08:48:22 (14505): No heartbeat from core client for 30 sec - exiting 08:48:23 (14505): No heartbeat from core client for 30 sec - exiting 08:51:59 (14697): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:52:00 (14697): No heartbeat from core client for 30 sec - exiting 08:52:01 (14697): No heartbeat from core client for 30 sec - exiting 08:52:02 (14697): No heartbeat from core client for 30 sec - exiting 08:52:03 (14697): No heartbeat from core client for 30 sec - exiting 08:52:04 (14697): No heartbeat from core client for 30 sec - exiting 08:52:05 (14697): No heartbeat from core client for 30 sec - exiting 08:52:06 (14697): No heartbeat from core client for 30 sec - exiting 08:52:07 (14697): No heartbeat from core client for 30 sec - exiting 08:52:08 (14697): No heartbeat from core client for 30 sec - exiting 08:52:09 (14697): No heartbeat from core client for 30 sec - exiting 08:52:10 (14697): No heartbeat from core client for 30 sec - exiting 08:52:11 (14697): No heartbeat from core client for 30 sec - exiting 08:52:12 (14697): No heartbeat from core client for 30 sec - exiting 08:52:13 (14697): No heartbeat from core client for 30 sec - exiting 08:52:14 (14697): No heartbeat from core client for 30 sec - exiting 08:52:15 (14697): No heartbeat from core client for 30 sec - exiting 08:52:16 (14697): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 09:43:32 (14883): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:14 (15462): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:52:58 (15683): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:50:57 (15857): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:59:36 (17011): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:03:41 (18778): No heartbeat from core client for 30 sec - exiting 15:03:42 (18778): No heartbeat from core client for 30 sec - exiting 15:03:43 (18778): No heartbeat from core client for 30 sec - exiting 15:03:44 (18778): No heartbeat from core client for 30 sec - exiting 15:03:45 (18778): No heartbeat from core client for 30 sec - exiting 15:03:46 (18778): No heartbeat from core client for 30 sec - exiting 15:03:47 (18778): No heartbeat from core client for 30 sec - exiting 15:03:48 (18778): No heartbeat from core client for 30 sec - exiting 15:03:49 (18778): No heartbeat from core client for 30 sec - exiting 15:03:50 (18778): No heartbeat from core client for 30 sec - exiting 15:03:51 (18778): No heartbeat from core client for 30 sec - exiting 15:03:52 (18778): No heartbeat from core client for 30 sec - exiting 15:03:53 (18778): No heartbeat from core client for 30 sec - exiting 15:03:54 (18778): No heartbeat from core client for 30 sec - exiting 15:03:55 (18778): No heartbeat from core client for 30 sec - exiting 15:03:56 (18778): No heartbeat from core client for 30 sec - exiting 15:03:57 (18778): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:08:06 (18955): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:13:02 (19133): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:13:33 (19133): No heartbeat from core client for 30 sec - exiting 15:23:04 (19320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:20:20 (19458): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:25:03 (20070): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:25:05 (20070): No heartbeat from core client for 30 sec - exiting 16:29:21 (20254): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:18:46 (20408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:23:13 (20966): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:27:16 (21125): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:27:17 (21125): No heartbeat from core client for 30 sec - exiting 17:27:18 (21125): No heartbeat from core client for 30 sec - exiting 17:27:19 (21125): No heartbeat from core client for 30 sec - exiting 17:27:20 (21125): No heartbeat from core client for 30 sec - exiting 17:27:21 (21125): No heartbeat from core client for 30 sec - exiting 17:27:22 (21125): No heartbeat from core client for 30 sec - exiting 17:27:23 (21125): No heartbeat from core client for 30 sec - exiting 17:27:24 (21125): No heartbeat from core client for 30 sec - exiting 17:27:25 (21125): No heartbeat from core client for 30 sec - exiting 17:27:26 (21125): No heartbeat from core client for 30 sec - exiting 17:27:27 (21125): No heartbeat from core client for 30 sec - exiting 17:27:28 (21125): No heartbeat from core client for 30 sec - exiting 17:27:29 (21125): No heartbeat from core client for 30 sec - exiting 17:27:30 (21125): No heartbeat from core client for 30 sec - exiting 17:27:31 (21125): No heartbeat from core client for 30 sec - exiting 17:27:32 (21125): No heartbeat from core client for 30 sec - exiting 17:27:33 (21125): No heartbeat from core client for 30 sec - exiting 17:27:34 (21125): No heartbeat from core client for 30 sec - exiting 17:27:35 (21125): No heartbeat from core client for 30 sec - exiting 17:27:36 (21125): No heartbeat from core client for 30 sec - exiting 17:27:37 (21125): No heartbeat from core client for 30 sec - exiting 17:27:38 (21125): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 18:41:25 (21314): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:45:26 (22018): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:45:27 (22018): No heartbeat from core client for 30 sec - exiting 18:45:28 (22018): No heartbeat from core client for 30 sec - exiting 18:45:29 (22018): No heartbeat from core client for 30 sec - exiting 18:45:30 (22018): No heartbeat from core client for 30 sec - exiting 18:45:31 (22018): No heartbeat from core client for 30 sec - exiting 18:45:32 (22018): No heartbeat from core client for 30 sec - exiting 18:45:33 (22018): No heartbeat from core client for 30 sec - exiting 18:45:34 (22018): No heartbeat from core client for 30 sec - exiting 18:45:35 (22018): No heartbeat from core client for 30 sec - exiting 18:45:36 (22018): No heartbeat from core client for 30 sec - exiting 18:45:37 (22018): No heartbeat from core client for 30 sec - exiting 18:45:38 (22018): No heartbeat from core client for 30 sec - exiting 18:45:39 (22018): No heartbeat from core client for 30 sec - exiting 18:45:40 (22018): No heartbeat from core client for 30 sec - exiting 18:54:18 (22181): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:54:19 (22181): No heartbeat from core client for 30 sec - exiting 18:54:20 (22181): No heartbeat from core client for 30 sec - exiting 18:58:28 (22376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:58:29 (22376): No heartbeat from core client for 30 sec - exiting 21:36:18 (22535): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:40:42 (23968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:45:04 (24126): No heartbeat from core client for 30 sec - exiting 21:45:05 (24126): No heartbeat from core client for 30 sec - exiting 21:45:06 (24126): No heartbeat from core client for 30 sec - exiting 21:45:07 (24126): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:57:11 (24307): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:31:06 (24525): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:06:04 (25441): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:24:52 (25840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:28:43 (26621): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:28:44 (26621): No heartbeat from core client for 30 sec - exiting 01:32:22 (26776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:32:59 (26776): No heartbeat from core client for 30 sec - exiting 01:37:43 (26938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:37:44 (26938): No heartbeat from core client for 30 sec - exiting 01:37:45 (26938): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7780400] [0xf7780425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759d1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a0825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75884d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27110, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7775400] [0xf7775425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75921df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7595825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf757d4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27110, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e8400] [0xf76e8425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75051df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7508825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74f04d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27110, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77af400] [0xf77af425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75cc1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75cf825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b74d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27110, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7796400] [0xf7796425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27110, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7744400] [0xf7744425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75611df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7564825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf754c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27110, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jun 2013 22:30:31 | 1282401 | 15834669 | hadcm3n_zduj_1920_40_008362273_2 | 25,920 | 65,540 | 2.5285 |
©2024 climateprediction.net