Name | hadcm3n_4h6z_1940_40_008312414_0 |
Workunit | 8463549 |
Created | 8 Feb 2013, 8:03:58 UTC |
Sent | 8 Feb 2013, 13:20:18 UTC |
Report deadline | 10 May 2013, 20:47:29 UTC |
Received | 16 Feb 2013, 23:44:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 383878 |
Run time | 6 days 21 hours 36 min 43 sec |
CPU time | 5 days 12 hours 38 min 54 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.48 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:54:13 (39943): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:54:16 (39943): No heartbeat from core client for 30 sec - exiting 19:54:17 (39943): No heartbeat from core client for 30 sec - exiting 19:54:18 (39943): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:09:09 (58351): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:09:10 (58351): No heartbeat from core client for 30 sec - exiting 19:09:11 (58351): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:07:20 (76582): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:46:16 (91465): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:48:19 (92900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:50:15 (94043): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:01:26 (2486): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(6416,0xa0b41540) malloc: *** error for object 0x2020a04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(6416,0xa0b41540) malloc: *** error for object 0x2020a00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(6416,0xa0b41540) malloc: *** error for object 0x5832a04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(6416,0xa0b41540) malloc: *** error for object 0x782e804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Sat Feb 16 06:32:41 2013 hadcm3n_6.07_i686-apple-darwin(18917,0xa0b41540) malloc: *** error for object 0x801b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(18917,0xa0b41540) malloc: *** error for object 0x901c404: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(18917,0xa0b41540) malloc: *** error for object 0x901d204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(18917,0xa0b41540) malloc: *** error for object 0x901d200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(18917,0xa0b41540) malloc: *** error for object 0x901d200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Sat Feb 16 17:08:42 2013 Thread 0 Crashed: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin. 0 libSystem.B.dylib 0x9a157b0f small_free_list_remove_ptr + 246 1 libSystem.B.dylib 0x9a1545cc szone_free_definite_size + 3457 2 libSystem.B.dylib 0x9a1535e8 free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b decadalMeans(int, char const*) + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff doCM3Proc() + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000876a worker() + 2896 7 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 main + 491 8 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: hadcm3n_6.07_i686-apple-darwin(18917,0xa0b41540) malloc: *** error for object 0x8830404: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug 0 libSystem.B.dylib 0x9a14cc0e mach_wait_until + 10 1 libSystem.B.dylib 0x9a1d4429 nanosleep + 345 2 libSystem.B.dylib 0x9a1d42ca usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c boinc_sleep(double) + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 timer_thread(void*) + 78 5 libSystem.B.dylib 0x9a17a259 _pthread_start + 345 6 libSystem.B.dylib 0x9a17a0de thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f978 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/4/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x93436000 - 0x934a0fff /usr/lib/libstdc++.6.dylib 0x94873000 - 0x94881fff /usr/lib/libz.1.dylib 0x98242000 - 0x98245fff /usr/lib/system/libmathCommon.A.dylib 0x9a14c000 - 0x9a2f3fff /usr/lib/libSystem.B.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Feb 2013 12:45:08 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 259,200 | 462,515 | 1.7844 |
15 Feb 2013 22:51:44 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 233,280 | 420,779 | 1.8038 |
15 Feb 2013 05:41:32 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 207,360 | 378,573 | 1.8257 |
14 Feb 2013 06:25:12 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 181,440 | 334,171 | 1.8418 |
13 Feb 2013 06:40:56 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 155,520 | 289,627 | 1.8623 |
12 Feb 2013 06:04:02 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 129,600 | 244,954 | 1.8901 |
11 Feb 2013 08:01:02 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 103,680 | 196,303 | 1.8934 |
10 Feb 2013 15:15:47 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 77,760 | 147,400 | 1.8956 |
09 Feb 2013 22:14:11 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 51,840 | 98,476 | 1.8996 |
09 Feb 2013 05:21:01 | 383878 | 15600257 | hadcm3n_4h6z_1940_40_008312414_0 | 25,920 | 49,439 | 1.9074 |
©2024 cpdn.org