Name | hadcm3n_o2wz_1940_40_007447424_1 |
Workunit | 7644927 |
Created | 9 Sep 2011, 17:58:24 UTC |
Sent | 16 Sep 2011, 5:43:37 UTC |
Report deadline | 16 Dec 2011, 13:10:48 UTC |
Received | 4 Oct 2011, 16:26:31 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 689064 |
Run time | 15 days 9 hours 55 min |
CPU time | 15 days 9 hours 55 min |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.28 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.2.14</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1) </message> <stderr_txt> 09:45:56 (5897): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:44:51 (15554): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:54:57 (11003): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:49:39 (11906): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:34 (20752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:08:24 (21866): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:22:19 (26969): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:06:48 (28062): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:11:24 (22548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGSEGV: segmentation violation Stack trace (11 frames): CPDN Monitor - Quit request from BOINC... 20:03:07 (8806): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... *** glibc detected *** hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x08190a70 *** ======= Backtrace: ========= /lib/i686/cmov/libc.so.6[0xb7d4ceed] /lib/i686/cmov/libc.so.6(cfree+0x90)[0xb7d50530] /usr/lib/libstdc++.so.6(_ZdlPv+0x21)[0xb7f0e611] /usr/lib/libstdc++.so.6(_ZdaPv+0x1d)[0xb7f0e66d] hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i686/cmov/libc.so.6(__libc_start_main+0xdc)[0xb7cfaebc] hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 08:01 14157209 /mnt/raid/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rwxp 0009b000 08:01 14157209 /mnt/raid/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-081a1000 rwxp 080e4000 00:00 0 [heap] b7700000-b7721000 rwxp b7700000 00:00 0 b7721000-b7800000 ---p b7721000 00:00 0 b7868000-b7ce3000 rwxs 00000000 08:01 14155879 /mnt/raid/var/lib/boinc-client/slots/1/137925 b7ce3000-b7ce5000 rwxp b7ce3000 00:00 0 b7ce5000-b7e22000 r-xp 00000000 08:17 231370 /lib/i686/cmov/libc-2.5.so b7e22000-b7e23000 r-xp 0013d000 08:17 231370 /lib/i686/cmov/libc-2.5.so b7e23000-b7e25000 rwxp 0013e000 08:17 231370 /lib/i686/cmov/libc-2.5.so b7e25000-b7e28000 rwxp b7e25000 00:00 0 b7e28000-b7e32000 r-xp 00000000 08:17 231149 /lib/libgcc_s.so.1 b7e32000-b7e33000 rwxp 00009000 08:17 231149 /lib/libgcc_s.so.1 b7e33000-b7e58000 r-xp 00000000 08:17 231374 /lib/i686/cmov/libm-2.5.so b7e58000-b7e5a000 rwxp 00024000 08:17 231374 /lib/i686/cmov/libm-2.5.so b7e5a000-b7f38000 r-xp 00000000 08:17 443734 /usr/lib/libstdc++.so.6.0.9 b7f38000-b7f3b000 r-xp 000dd000 08:17 443734 /usr/lib/libstdc++.so.6.0.9 b7f3b000-b7f3d000 rwxp 000e0000 08:17 443734 /usr/lib/libstdc++.so.6.0.9 b7f3d000-b7f44000 rwxp b7f3d000 00:00 0 b7f44000-b7f46000 r-xp 00000000 08:17 231373 /lib/i686/cmov/libdl-2.5.so b7f46000-b7f48000 rwxp 00001000 08:17 231373 /lib/i686/cmov/libdl-2.5.so b7f48000-b7f5b000 r-xp 00000000 08:17 231384 /lib/i686/cmov/libpthread-2.5.so b7f5b000-b7f5d000 rwxp 00013000 08:17 231384 /lib/i686/cmov/libpthread-2.5.so b7f5d000-b7f5f000 rwxp b7f5d000 00:00 0 b7f70000-b7f71000 rwxp b7f70000 00:00 0 b7f71000-b7f72000 ---p b7f71000 00:00 0 b7f72000-b7f75000 rwxp b7f72000 00:00 0 b7f75000-b7f77000 rwxs 00000000 00:08 294914 /SYSV010100af (deleted) b7f77000-b7f79000 rwxp b7f77000 00:00 0 b7f79000-b7f7a000 r-xp b7f79000 00:00 0 [vdso] b7f7a000-b7f95000 r-xp 00000000 08:17 979753 /lib/ld-2.5.so b7f95000-b7f97000 rwxp 0001b000 08:17 979753 /lib/ld-2.5.so bfc1f000-bfc90000 rw-p bfc1f000 00:00 0 [stack] SIGABRT: abort called Stack trace (17 frames): hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xb7f79420] [0xb7f79410] /lib/i686/cmov/libc.so.6(abort+0x101)[0xb7d105b1] /lib/i686/cmov/libc.so.6[0xb7d4508b] /lib/i686/cmov/libc.so.6[0xb7d4ceed] /lib/i686/cmov/libc.so.6(cfree+0x90)[0xb7d50530] /usr/lib/libstdc++.so.6(_ZdlPv+0x21)[0xb7f0e611] /usr/lib/libstdc++.so.6(_ZdaPv+0x1d)[0xb7f0e66d] hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i686/cmov/libc.so.6(__libc_start_main+0xdc)[0xb7cfaebc] hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 259,200 | 1,331,686 | 5.1377 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 233,280 | 1,216,012 | 5.2127 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 207,360 | 1,100,742 | 5.3084 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 181,440 | 985,443 | 5.4312 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 155,520 | 869,808 | 5.5929 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 129,600 | 753,018 | 5.8103 |
04 Oct 2011 16:24:05 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 103,680 | 637,354 | 6.1473 |
04 Oct 2011 16:24:05 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 77,760 | 520,901 | 6.6988 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 51,840 | 404,198 | 7.7970 |
04 Oct 2011 16:24:06 | 689064 | 13359161 | hadcm3n_o2wz_1940_40_007447424_1 | 25,920 | 216,975 | 8.3709 |
©2024 climateprediction.net