climateprediction.net home page
Task 15713435

Task 15713435

Name hadcm3n_u0yq_2020_40_008337985_1
Workunit 8488846
Created 6 Apr 2013, 21:35:42 UTC
Sent 7 Apr 2013, 9:02:56 UTC
Report deadline 7 Jul 2013, 16:30:07 UTC
Received 17 Apr 2013, 1:04:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1276234
Run time 9 days 13 hours 22 min 58 sec
CPU time 7 days 11 hours 4 min 20 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 1.86 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>7.0.31</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
18:40:01 (80312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:13:40 (3815): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:24:14 (15234): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:13 (36114): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:17 (36114): No heartbeat from core client for 30 sec - exiting
20:15:33 (42738): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:15:34 (42738): No heartbeat from core client for 30 sec - exiting
06:23:46 (42748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:19:38 (80605): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Suspended CPDN Monitor - Suspend request from BOINC...
14:13:43 (96974): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:13:45 (96974): No heartbeat from core client for 30 sec - exiting
14:15:57 (13332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:17:31 (15226): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:15:18 (27682): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:21:08 (51635): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:17:16 (55225): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
12:20:04 (57813): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
17:17:42 (37484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
OPEN:  File Creation Failed: No05:15:35 (48853): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:12:08 (80718): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:12:34 (170): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:22:07 (36374): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:18:27 (55276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:20:37 (61435): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:11:48 (98566): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x2039c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x2055204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x2055200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x833604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x833600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x84ec04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x84ec00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x382dc04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x3849204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(5545,0xacda82c0) malloc: *** error for object 0x3849200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel 80486 (32-bit executable)
System version: Macintosh OS 10.7.5 build 11G63
Tue Apr 16 19:39:35 2013

Thread 0 Crashed:

Thread 1:

atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386.
Thread 0 crashed with X86 Thread State (32-bit):
  eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000
  edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8fa38 esp: 0x00000000
   ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e  cs: 0x00000000
   ds: 0x00000000  es: 0x00000000  fs: 0x00000000  gs: 0x00000000

Binary Images Description:
    0x1000 -    0x93fff /Library/Application Support/BOINC Data/slots/11/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin
0x90486000 - 0x9048dfff /usr/lib/system/libsystem_notify.dylib
0x9097d000 - 0x9097efff /usr/lib/system/libquarantine.dylib
0x91e38000 - 0x91e41fff /usr/lib/libc++abi.dylib
0x91fca000 - 0x91ff8fff /usr/lib/libSystem.B.dylib
0x92bc4000 - 0x92bc5fff /usr/lib/system/libunc.dylib
0x930c6000 - 0x930c7fff /usr/lib/system/libsystem_sandbox.dylib
0x930d4000 - 0x930d6fff /usr/lib/system/libdyld.dylib
0x932ec000 - 0x932f4fff /usr/lib/system/libunwind.dylib
0x93fa4000 - 0x93fb2fff /usr/lib/system/libdispatch.dylib
0x94259000 - 0x94288fff /usr/lib/system/libsystem_info.dylib
0x94289000 - 0x942ebfff /usr/lib/libstdc++.6.dylib
0x95404000 - 0x95404fff /usr/lib/system/libdnsinfo.dylib
0x95702000 - 0x9570afff /usr/lib/system/libcopyfile.dylib
0x95747000 - 0x9574ffff /usr/lib/system/liblaunch.dylib
0x9593a000 - 0x9593efff /usr/lib/system/libsystem_network.dylib
0x95c5c000 - 0x95c5dfff /usr/lib/system/libremovefile.dylib
0x95cfc000 - 0x95d0afff /usr/lib/libz.1.dylib
0x95d0b000 - 0x95d12fff /usr/lib/system/libsystem_dnssd.dylib
0x96139000 - 0x9614ffff /usr/lib/system/libxpc.dylib
0x96150000 - 0x96193fff /usr/lib/system/libcommonCrypto.dylib
0x98bd0000 - 0x98c9bfff /usr/lib/system/libsystem_c.dylib
0x99282000 - 0x99285fff /usr/lib/system/libmathCommon.A.dylib
0x99901000 - 0x99901fff /usr/lib/system/libkeymgr.dylib
0x9aa05000 - 0x9aa06fff /usr/lib/system/libsystem_blocks.dylib
0x9aa1a000 - 0x9aa1efff /usr/lib/system/libcache.dylib
0x9ab1c000 - 0x9ab1ffff /usr/lib/system/libcompiler_rt.dylib
0x9b990000 - 0x9b995fff /usr/lib/system/libmacho.dylib
0x9cacd000 - 0x9caebfff /usr/lib/system/libsystem_kernel.dylib


Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Apr 2013 01:09:18 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 259,200 644,657 2.4871
16 Apr 2013 02:47:42 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 233,280 570,646 2.4462
15 Apr 2013 03:33:39 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 207,360 493,326 2.3791
14 Apr 2013 04:25:06 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 181,440 415,414 2.2895
13 Apr 2013 05:37:29 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 155,520 449,774 2.8921
12 Apr 2013 06:08:27 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 129,600 371,366 2.8655
11 Apr 2013 06:33:18 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 103,680 307,220 2.9632
10 Apr 2013 07:07:39 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 77,760 230,328 2.9620
09 Apr 2013 08:04:44 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 51,840 154,685 2.9839
08 Apr 2013 09:22:20 1276234 15713435 hadcm3n_u0yq_2020_40_008337985_1 25,920 77,641 2.9954


©2024 climateprediction.net