Name | hadcm3n_u0yq_2020_40_008337985_1 |
Workunit | 8488846 |
Created | 6 Apr 2013, 21:35:42 UTC |
Sent | 7 Apr 2013, 9:02:56 UTC |
Report deadline | 7 Jul 2013, 16:30:07 UTC |
Received | 17 Apr 2013, 1:04:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1276234 |
Run time | 9 days 13 hours 22 min 58 sec |
CPU time | 7 days 11 hours 4 min 20 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.86 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 18:40:01 (80312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:13:40 (3815): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:24:14 (15234): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:14:13 (36114): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:14:17 (36114): No heartbeat from core client for 30 sec - exiting 20:15:33 (42738): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:15:34 (42738): No heartbeat from core client for 30 sec - exiting 06:23:46 (42748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:19:38 (80605): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 Suspended CPDN Monitor - Suspend request from BOINC... 14:13:43 (96974): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:45 (96974): No heartbeat from core client for 30 sec - exiting 14:15:57 (13332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:17:31 (15226): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:15:18 (27682): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:08 (51635): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:17:16 (55225): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 12:20:04 (57813): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 17:17:42 (37484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 OPEN: File Creation Failed: No05:15:35 (48853): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:12:08 (80718): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:12:34 (170): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:22:07 (36374): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:18:27 (55276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:20:37 (61435): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:11:48 (98566): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x2039c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x2055204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x2055200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x833604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x833600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x84ec04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x84ec00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x382dc04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x3849204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x3849200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.7.5 build 11G63 Tue Apr 16 19:39:35 2013 Thread 0 Crashed: Thread 1: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386. Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8fa38 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/11/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x90486000 - 0x9048dfff /usr/lib/system/libsystem_notify.dylib 0x9097d000 - 0x9097efff /usr/lib/system/libquarantine.dylib 0x91e38000 - 0x91e41fff /usr/lib/libc++abi.dylib 0x91fca000 - 0x91ff8fff /usr/lib/libSystem.B.dylib 0x92bc4000 - 0x92bc5fff /usr/lib/system/libunc.dylib 0x930c6000 - 0x930c7fff /usr/lib/system/libsystem_sandbox.dylib 0x930d4000 - 0x930d6fff /usr/lib/system/libdyld.dylib 0x932ec000 - 0x932f4fff /usr/lib/system/libunwind.dylib 0x93fa4000 - 0x93fb2fff /usr/lib/system/libdispatch.dylib 0x94259000 - 0x94288fff /usr/lib/system/libsystem_info.dylib 0x94289000 - 0x942ebfff /usr/lib/libstdc++.6.dylib 0x95404000 - 0x95404fff /usr/lib/system/libdnsinfo.dylib 0x95702000 - 0x9570afff /usr/lib/system/libcopyfile.dylib 0x95747000 - 0x9574ffff /usr/lib/system/liblaunch.dylib 0x9593a000 - 0x9593efff /usr/lib/system/libsystem_network.dylib 0x95c5c000 - 0x95c5dfff /usr/lib/system/libremovefile.dylib 0x95cfc000 - 0x95d0afff /usr/lib/libz.1.dylib 0x95d0b000 - 0x95d12fff /usr/lib/system/libsystem_dnssd.dylib 0x96139000 - 0x9614ffff /usr/lib/system/libxpc.dylib 0x96150000 - 0x96193fff /usr/lib/system/libcommonCrypto.dylib 0x98bd0000 - 0x98c9bfff /usr/lib/system/libsystem_c.dylib 0x99282000 - 0x99285fff /usr/lib/system/libmathCommon.A.dylib 0x99901000 - 0x99901fff /usr/lib/system/libkeymgr.dylib 0x9aa05000 - 0x9aa06fff /usr/lib/system/libsystem_blocks.dylib 0x9aa1a000 - 0x9aa1efff /usr/lib/system/libcache.dylib 0x9ab1c000 - 0x9ab1ffff /usr/lib/system/libcompiler_rt.dylib 0x9b990000 - 0x9b995fff /usr/lib/system/libmacho.dylib 0x9cacd000 - 0x9caebfff /usr/lib/system/libsystem_kernel.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Apr 2013 01:09:18 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 259,200 | 644,657 | 2.4871 |
16 Apr 2013 02:47:42 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 233,280 | 570,646 | 2.4462 |
15 Apr 2013 03:33:39 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 207,360 | 493,326 | 2.3791 |
14 Apr 2013 04:25:06 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 181,440 | 415,414 | 2.2895 |
13 Apr 2013 05:37:29 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 155,520 | 449,774 | 2.8921 |
12 Apr 2013 06:08:27 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 129,600 | 371,366 | 2.8655 |
11 Apr 2013 06:33:18 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 103,680 | 307,220 | 2.9632 |
10 Apr 2013 07:07:39 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 77,760 | 230,328 | 2.9620 |
09 Apr 2013 08:04:44 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 51,840 | 154,685 | 2.9839 |
08 Apr 2013 09:22:20 | 1276234 | 15713435 | hadcm3n_u0yq_2020_40_008337985_1 | 25,920 | 77,641 | 2.9954 |
©2024 climateprediction.net