Name | hadcm3n_o0hg_1980_40_007958621_1 |
Workunit | 8113733 |
Created | 9 May 2012, 21:22:47 UTC |
Sent | 10 May 2012, 3:48:40 UTC |
Report deadline | 9 Aug 2012, 11:15:51 UTC |
Received | 16 Jun 2012, 4:48:01 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1179592 |
Run time | 29 days 19 hours 53 min 21 sec |
CPU time | 29 days 16 hours 53 min 24 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 1.37 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:34:52 (19502): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:04:10 (23694): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:24:50 (24187): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:24:57 (24187): No heartbeat from core client for 30 sec - exiting 17:27:20 (26416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:20:18 (26640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:05 (26874): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:22:33 (27177): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:27:53 (27386): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:33:58 (27592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:32 (27804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:39:39 (28209): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:47:17 (28433): No heartbeat from core client for 30 sec - exiting 17:47:23 (28433): No heartbeat from core client for 30 sec - exiting 17:47:24 (28433): No heartbeat from core client for 30 sec - exiting 17:47:25 (28433): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:06:00 (28677): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:29 (29060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:18 (29431): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:56 (29636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:45:19 (29840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:32:12 (30139): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:48:22 (30357): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:49:03 (31407): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:03:07 (33366): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:35:17 (33618): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:47:09 (34086): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:39:54 (34335): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:43:47 (34553): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x09cfe958 *** ======= Backtrace: ========= /lib32/libc.so.6(+0x6bff1)[0xf7569ff1] /lib32/libc.so.6(+0x6d880)[0xf756b880] /lib32/libc.so.6(cfree+0x6d)[0xf756e92d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf77457b1] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf774580d] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf7514ce7] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 08:02 3514599 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 08:02 3514599 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 09ca9000-09d0e000 rw-p 00000000 00:00 0 [heap] f6f00000-f6f21000 rw-p 00000000 00:00 0 f6f21000-f7000000 ---p 00000000 00:00 0 f7080000-f74fb000 rw-s 00000000 08:02 8716313 /var/lib/boinc-client/slots/13/137030 f74fb000-f74fe000 rw-p 00000000 00:00 0 f74fe000-f7652000 r-xp 00000000 08:02 9879703 /lib32/libc-2.12.1.so f7652000-f7653000 ---p 00154000 08:02 9879703 /lib32/libc-2.12.1.so f7653000-f7655000 r--p 00154000 08:02 9879703 /lib32/libc-2.12.1.so f7655000-f7656000 rw-p 00156000 08:02 9879703 /lib32/libc-2.12.1.so f7656000-f7659000 rw-p 00000000 00:00 0 f7659000-f7673000 r-xp 00000000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7673000-f7674000 r--p 00019000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7674000-f7675000 rw-p 0001a000 08:02 13436533 /usr/lib32/libgcc_s.so.1 f7675000-f7699000 r-xp 00000000 08:02 9879707 /lib32/libm-2.12.1.so f7699000-f769a000 r--p 00023000 08:02 9879707 /lib32/libm-2.12.1.so f769a000-f769b000 rw-p 00024000 08:02 9879707 /lib32/libm-2.12.1.so f769b000-f777a000 r-xp 00000000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f777a000-f777e000 r--p 000de000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f777e000-f777f000 rw-p 000e2000 08:02 13436540 /usr/lib32/libstdc++.so.6.0.14 f777f000-f7786000 rw-p 00000000 00:00 0 f7786000-f7788000 r-xp 00000000 08:02 9879706 /lib32/libdl-2.12.1.so f7788000-f7789000 r--p 00001000 08:02 9879706 /lib32/libdl-2.12.1.so f7789000-f778a000 rw-p 00002000 08:02 9879706 /lib32/libdl-2.12.1.so f778a000-f778b000 rw-p 00000000 00:00 0 f778b000-f77a0000 r-xp 00000000 08:02 9879717 /lib32/libpthread-2.12.1.so f77a0000-f77a1000 r--p 00014000 08:02 9879717 /lib32/libpthread-2.12.1.so f77a1000-f77a2000 rw-p 00015000 08:02 9879717 /lib32/libpthread-2.12.1.so f77a2000-f77a4000 rw-p 00000000 00:00 0 f77b5000-f77b6000 rw-p 00000000 00:00 0 f77b6000-f77b7000 ---p 00000000 00:00 0 f77b7000-f77ba000 rw-p 00000000 00:00 0 f77ba000-f77bc000 rw-s 00000000 08:02 8716310 /var/lib/boinc-client/slots/13/boinc_mmap_file f77bc000-f77be000 rw-p 00000000 00:00 0 f77be000-f77bf000 r-xp 00000000 00:00 0 [vdso] f77bf000-f77db000 r-xp 00000000 08:02 9879700 /lib32/ld-2.12.1.so f77db000-f77dc000 r--p 0001b000 08:02 9879700 /lib32/ld-2.12.1.so f77dc000-f77dd000 rw-p 0001c000 08:02 9879700 /lib32/ld-2.12.1.so ff8ce000-ff93d000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (19 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf77be400] [0xf77be425] /lib32/libc.so.6(gsignal+0x51)[0xf7528a01] /lib32/libc.so.6(abort+0x182)[0xf752be42] /lib32/libc.so.6(+0x61f15)[0xf755ff15] /lib32/libc.so.6(+0x6bff1)[0xf7569ff1] /lib32/libc.so.6(+0x6d880)[0xf756b880] /lib32/libc.so.6(cfree+0x6d)[0xf756e92d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf77457b1] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf774580d] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe7)[0xf7514ce7] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Jun 2012 04:51:07 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 777,600 | 2,566,367 | 3.3004 |
15 Jun 2012 05:25:24 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 751,680 | 2,480,752 | 3.3003 |
14 Jun 2012 02:41:02 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 725,760 | 2,389,416 | 3.2923 |
13 Jun 2012 01:18:27 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 699,840 | 2,295,693 | 3.2803 |
11 Jun 2012 23:11:36 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 673,920 | 2,204,578 | 3.2713 |
10 Jun 2012 21:54:13 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 648,000 | 2,113,713 | 3.2619 |
09 Jun 2012 21:20:37 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 622,080 | 2,024,350 | 3.2542 |
08 Jun 2012 20:52:47 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 596,160 | 1,936,645 | 3.2485 |
07 Jun 2012 20:08:13 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 570,240 | 1,849,110 | 3.2427 |
06 Jun 2012 19:43:19 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 544,320 | 1,762,465 | 3.2379 |
05 Jun 2012 16:54:47 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 518,400 | 1,669,675 | 3.2208 |
04 Jun 2012 18:33:36 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 492,480 | 1,588,629 | 3.2258 |
03 Jun 2012 16:30:11 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 466,560 | 1,496,147 | 3.2068 |
02 Jun 2012 14:05:28 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 440,640 | 1,402,552 | 3.1830 |
01 Jun 2012 11:15:48 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 414,720 | 1,308,999 | 3.1563 |
31 May 2012 09:16:56 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 388,800 | 1,216,123 | 3.1279 |
30 May 2012 07:32:45 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 362,880 | 1,123,674 | 3.0965 |
29 May 2012 07:02:05 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 336,960 | 1,034,459 | 3.0700 |
28 May 2012 08:23:58 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 311,040 | 954,973 | 3.0703 |
27 May 2012 09:45:10 | 1179592 | 14652125 | hadcm3n_o0hg_1980_40_007958621_1 | 285,120 | 874,194 | 3.0661 |
©2024 cpdn.org