Name | hadcm3n_o619_1980_40_007858715_1 |
Workunit | 8013827 |
Created | 5 Apr 2012, 20:30:58 UTC |
Sent | 5 Apr 2012, 20:39:21 UTC |
Report deadline | 6 Jul 2012, 4:06:32 UTC |
Received | 21 Apr 2012, 10:49:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1179768 |
Run time | 6 days 20 hours 10 min 39 sec |
CPU time | 6 days 19 hours 56 min 4 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 3.48 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:00:32 (34668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x680ce04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x680dc04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x680dc00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8808e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8809c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8809c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8809c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8809c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8809c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x8809c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(80061,0xa014f540) malloc: *** error for object 0x2800e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Sat Apr 21 11:48:27 2012 Thread 0 Crashed: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin. 0 libSystem.B.dylib 0x91075b0f small_free_list_remove_ptr + 246 1 libSystem.B.dylib 0x910725cc szone_free_definite_size + 3457 2 libSystem.B.dylib 0x910715e8 free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x00022341 ncio_px_free + 97 4 hadcm3n_6.07_i686-apple-darwin 0x0002290c ncio_free + 28 5 hadcm3n_6.07_i686-apple-darwin 0x00022f09 ncio_close + 57 6 hadcm3n_6.07_i686-apple-darwin 0x0001cdd7 nc_close + 103 7 hadcm3n_6.07_i686-apple-darwin 0x0000ba9f annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3553 8 hadcm3n_6.07_i686-apple-darwin 0x0000d36b decadalMeans(int, char const*) + 957 9 hadcm3n_6.07_i686-apple-darwin 0x000067ff doCM3Proc() + 185 10 hadcm3n_6.07_i686-apple-darwin 0x0000791c mainLoop() + 410 11 hadcm3n_6.07_i686-apple-darwin 0x000087c7 worker() + 2989 12 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 main + 491 13 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: 0 libSystem.B.dylib 0x9106ac0e mach_wait_until + 10 1 libSystem.B.dylib 0x910f2429 nanosleep + 345 2 libSystem.B.dylib 0x910f22ca usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c boinc_sleep(double) + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 timer_thread(void*) + 78 5 libSystem.B.dylib 0x91098259 _pthread_start + 345 6 libSystem.B.dylib 0x910980de thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f618 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/1/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x9106a000 - 0x91211fff /usr/lib/libSystem.B.dylib 0x973b7000 - 0x973bafff /usr/lib/system/libmathCommon.A.dylib 0x9a54e000 - 0x9a55cfff /usr/lib/libz.1.dylib 0x9ab23000 - 0x9ab8dfff /usr/lib/libstdc++.6.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Apr 2012 09:52:29 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 518,400 | 590,161 | 1.1384 |
21 Apr 2012 01:37:34 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 492,480 | 560,603 | 1.1383 |
20 Apr 2012 05:23:22 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 466,560 | 531,051 | 1.1382 |
19 Apr 2012 21:22:17 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 440,640 | 501,331 | 1.1377 |
19 Apr 2012 00:54:24 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 414,720 | 471,842 | 1.1377 |
18 Apr 2012 04:37:38 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 388,800 | 442,182 | 1.1373 |
17 Apr 2012 20:31:51 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 362,880 | 412,691 | 1.1373 |
17 Apr 2012 00:44:39 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 336,960 | 383,112 | 1.1370 |
16 Apr 2012 03:58:30 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 311,040 | 353,714 | 1.1372 |
15 Apr 2012 20:18:29 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 285,120 | 324,396 | 1.1378 |
15 Apr 2012 11:47:25 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 259,200 | 294,950 | 1.1379 |
15 Apr 2012 04:14:34 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 233,280 | 265,536 | 1.1383 |
14 Apr 2012 19:21:48 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 207,360 | 236,327 | 1.1397 |
14 Apr 2012 11:08:51 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 181,440 | 206,880 | 1.1402 |
14 Apr 2012 03:20:50 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 155,520 | 177,402 | 1.1407 |
13 Apr 2012 19:03:27 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 129,600 | 147,901 | 1.1412 |
12 Apr 2012 23:11:44 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 103,680 | 118,502 | 1.1430 |
12 Apr 2012 02:58:02 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 77,760 | 88,940 | 1.1438 |
11 Apr 2012 06:45:01 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 51,840 | 59,416 | 1.1461 |
10 Apr 2012 05:09:37 | 1179768 | 14367421 | hadcm3n_o619_1980_40_007858715_1 | 25,920 | 29,794 | 1.1495 |
©2024 cpdn.org