Name | hadcm3n_48x9_1940_40_008308006_0 |
Workunit | 8459141 |
Created | 7 Feb 2013, 15:47:29 UTC |
Sent | 7 Feb 2013, 15:53:02 UTC |
Report deadline | 9 May 2013, 23:20:13 UTC |
Received | 15 Feb 2013, 20:45:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 383878 |
Run time | 6 days 23 hours 31 min 25 sec |
CPU time | 5 days 13 hours 52 min 10 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.48 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> 05:21:00 (81996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:54:13 (39932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:54:15 (39932): No heartbeat from core client for 30 sec - exiting 19:54:16 (39932): No heartbeat from core client for 30 sec - exiting 19:54:17 (39932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:09:08 (58340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:09:10 (58340): No heartbeat from core client for 30 sec - exiting 19:09:11 (58340): No heartbeat from core client for 30 sec - exiting 19:09:12 (58340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:07:20 (76571): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:46:17 (91454): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:48:19 (92922): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:50:15 (94021): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x831004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x701f600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x481b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x5832204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x8800e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x8801c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x8801c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(2476,0xa0b41540) malloc: *** error for object 0x7004e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Fri Feb 15 12:38:45 2013 13:01:25 (2476): No heartbeat from core client for 30 sec - exiting SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) hadcm3n_6.07_i686-apple-darwin(3410,0xa0b41540) malloc: *** error for object 0x2843604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3410,0xa0b41540) malloc: *** error for object 0x301f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3410,0xa0b41540) malloc: *** error for object 0x2843604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3410,0xa0b41540) malloc: *** error for object 0x3825e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x601b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x501b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x601b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681c404: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681d204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681d200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681d204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681d200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3754,0xa0b41540) malloc: *** error for object 0x681d200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Fri Feb 15 14:02:53 2013 Thread 0 Crashed: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin. 0 libSystem.B.dylib 0x9a157b0f small_free_list_remove_ptr + 246 1 libSystem.B.dylib 0x9a1545cc szone_free_definite_size + 3457 2 libSystem.B.dylib 0x9a1535e8 free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b decadalMeans(int, char const*) + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff doCM3Proc() + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000876a worker() + 2896 7 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 main + 491 8 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: 0 libSystem.B.dylib 0x9a14cc0e mach_wait_until + 10 1 libSystem.B.dylib 0x9a1d4429 nanosleep + 345 2 libSystem.B.dylib 0x9a1d42ca usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c boinc_sleep(double) + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 timer_thread(void*) + 78 5 libSystem.B.dylib 0x9a17a259 _pthread_start + 345 6 libSystem.B.dylib 0x9a17a0de thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f978 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/11/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x93436000 - 0x934a0fff /usr/lib/libstdc++.6.dylib 0x94873000 - 0x94881fff /usr/lib/libz.1.dylib 0x98242000 - 0x98245fff /usr/lib/system/libmathCommon.A.dylib 0x9a14c000 - 0x9a2f3fff /usr/lib/libSystem.B.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Feb 2013 18:57:25 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 259,200 | 479,289 | 1.8491 |
15 Feb 2013 01:40:39 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 233,280 | 435,464 | 1.8667 |
14 Feb 2013 02:19:18 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 207,360 | 390,347 | 1.8825 |
13 Feb 2013 02:04:57 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 181,440 | 345,140 | 1.9022 |
12 Feb 2013 00:47:41 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 155,520 | 298,690 | 1.9206 |
11 Feb 2013 02:34:52 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 129,600 | 249,021 | 1.9215 |
10 Feb 2013 09:32:20 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 103,680 | 199,231 | 1.9216 |
09 Feb 2013 16:00:48 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 77,760 | 149,657 | 1.9246 |
08 Feb 2013 22:23:59 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 51,840 | 99,916 | 1.9274 |
08 Feb 2013 07:06:25 | 383878 | 15594684 | hadcm3n_48x9_1940_40_008308006_0 | 25,920 | 50,155 | 1.9350 |
©2024 climateprediction.net