climateprediction.net home page
Task 15915888

Task 15915888

Name hadcm3n_zgat_1920_40_008365430_4
Workunit 8516289
Created 14 Aug 2013, 11:40:47 UTC
Sent 14 Aug 2013, 17:55:31 UTC
Report deadline 14 Nov 2013, 1:22:42 UTC
Received 24 Aug 2013, 13:54:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1190785
Run time 9 days 12 hours 24 min 2 sec
CPU time 8 days 2 hours 27 min 12 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:21:57 (25686): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:23:46 (37570): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:30:22 (37588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:34:40 (37628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:37:14 (37658): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:49:04 (37677): No heartbeat from core client for 30 sec - exiting
15:49:05 (37677): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/zgatko.pjd2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatko.pjd2c10
Error: Input file: dataout/zgatko.pid2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatko.pid2c10
Error: Input file: dataout/zgatko.pfd2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatko.pfd2c10
Error: Input file: dataout/zgatka.phd2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatka.phd2c10
Error: Input file: dataout/zgatka.pgd2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatka.pgd2c10
Error: Input file: dataout/zgatka.ped2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatka.ped2c10
Error: Input file: dataout/zgatka.pdd2c10 is not a valid UM file.
Error converting file to netcdf: dataout/zgatka.pdd2c10
15:53:11 (37740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:18:12 (37782): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x138a804: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x138a800: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b6fc04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383000: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383000: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383000: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0xb59c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x136f204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x136f200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b45604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b45600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b3b804: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b3b800: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b3b804: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel 80486 (32-bit executable)
System version: Macintosh OS 10.7.5 build 11G63
Sat Aug 24 14:10:02 2013

Thread 0 Crashed:

Thread 1:

atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386.
Thread 0 crashed with X86 Thread State (32-bit):
  eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000
  edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8faa8 esp: 0x00000000
   ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e  cs: 0x00000000
   ds: 0x00000000  es: 0x00000000  fs: 0x00000000  gs: 0x00000000

Binary Images Description:
    0x1000 -    0x93fff /Library/Application Support/BOINC Data/slots/2/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin
0x9082c000 - 0x9082cfff /usr/lib/system/libkeymgr.dylib
0x91556000 - 0x91621fff /usr/lib/system/libsystem_c.dylib
0x92d8c000 - 0x92d93fff /usr/lib/system/libsystem_notify.dylib
0x92ffa000 - 0x92ffffff /usr/lib/system/libmacho.dylib
0x93091000 - 0x930bffff /usr/lib/libSystem.B.dylib
0x931cc000 - 0x931eafff /usr/lib/system/libsystem_kernel.dylib
0x94316000 - 0x94324fff /usr/lib/system/libdispatch.dylib
0x94325000 - 0x9432efff /usr/lib/libc++abi.dylib
0x94815000 - 0x94818fff /usr/lib/system/libcompiler_rt.dylib
0x95961000 - 0x95965fff /usr/lib/system/libsystem_network.dylib
0x95f17000 - 0x95f25fff /usr/lib/libz.1.dylib
0x95f8c000 - 0x95fa2fff /usr/lib/system/libxpc.dylib
0x9747e000 - 0x97486fff /usr/lib/system/liblaunch.dylib
0x9754a000 - 0x9754bfff /usr/lib/system/libunc.dylib
0x978c1000 - 0x978c5fff /usr/lib/system/libcache.dylib
0x991bd000 - 0x9921ffff /usr/lib/libstdc++.6.dylib
0x99221000 - 0x99221fff /usr/lib/system/libdnsinfo.dylib
0x99742000 - 0x99743fff /usr/lib/system/libquarantine.dylib
0x99ed1000 - 0x99ed8fff /usr/lib/system/libsystem_dnssd.dylib
0x99ed9000 - 0x99f1cfff /usr/lib/system/libcommonCrypto.dylib
0x9a010000 - 0x9a03ffff /usr/lib/system/libsystem_info.dylib
0x9a4f3000 - 0x9a4f4fff /usr/lib/system/libsystem_blocks.dylib
0x9bcdd000 - 0x9bce5fff /usr/lib/system/libcopyfile.dylib
0x9c196000 - 0x9c197fff /usr/lib/system/libsystem_sandbox.dylib
0x9c77b000 - 0x9c77efff /usr/lib/system/libmathCommon.A.dylib
0x9c7dd000 - 0x9c7defff /usr/lib/system/libremovefile.dylib
0x9ca05000 - 0x9ca07fff /usr/lib/system/libdyld.dylib
0x9ca08000 - 0x9ca10fff /usr/lib/system/libunwind.dylib


Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Aug 2013 13:59:24 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 777,600 700,027 0.9002
24 Aug 2013 05:27:18 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 751,680 676,616 0.9001
23 Aug 2013 21:54:50 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 725,760 653,157 0.9000
23 Aug 2013 13:17:17 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 699,840 629,741 0.8998
23 Aug 2013 10:48:49 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 673,920 606,356 0.8997
23 Aug 2013 10:48:49 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 648,000 582,926 0.8996
22 Aug 2013 15:42:13 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 622,080 559,450 0.8993
22 Aug 2013 07:03:59 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 596,160 536,046 0.8992
21 Aug 2013 23:25:37 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 570,240 512,682 0.8991
21 Aug 2013 15:33:17 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 544,320 489,306 0.8989
21 Aug 2013 08:01:36 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 518,400 465,989 0.8989
21 Aug 2013 00:18:28 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 492,480 442,596 0.8987
20 Aug 2013 16:50:34 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 466,560 419,272 0.8986
20 Aug 2013 09:54:06 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 440,640 395,882 0.8984
20 Aug 2013 01:46:07 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 414,720 372,479 0.8981
19 Aug 2013 15:26:01 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 388,800 349,108 0.8979
19 Aug 2013 06:39:15 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 362,880 325,774 0.8977
18 Aug 2013 23:07:32 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 336,960 302,399 0.8974
18 Aug 2013 15:36:10 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 311,040 278,989 0.8970
18 Aug 2013 07:54:35 1190785 15915888 hadcm3n_zgat_1920_40_008365430_4 285,120 255,840 0.8973


©2024 climateprediction.net