climateprediction.net home page
Task 13795055

Task 13795055

Name hadcm3n_yexd_1900_40_007517684_3
Workunit 7715159
Created 18 Dec 2011, 23:45:17 UTC
Sent 18 Dec 2011, 23:46:06 UTC
Report deadline 19 Mar 2012, 7:13:17 UTC
Received 14 Jan 2012, 22:43:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1184254
Run time 13 days 6 hours 14 min 37 sec
CPU time 11 days 13 hours 57 min 34 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.15 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>6.12.35</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
hadcm3n_6.07_i686-apple-darwin(50690,0xa088c540) malloc: *** error for object 0x825604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(50690,0xa088c540) malloc: *** error for object 0x825600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(50690,0xa088c540) malloc: *** error for object 0x802604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=50697, selfPID=50697, iMonCtr=1
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90987,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
08:59:46 (90931): Can't acquire lockfile (-154) - waiting 35s
09:00:21 (90931): Can't acquire lockfile (-154) - exiting
10:19:15 (14153): Can't acquire lockfile (-154) - waiting 35s
10:19:50 (14153): Can't acquire lockfile (-154) - exiting
11:47:03 (43351): Can't acquire lockfile (-154) - waiting 35s
11:47:38 (43351): Can't acquire lockfile (-154) - exiting
13:15:32 (75509): Can't acquire lockfile (-154) - waiting 35s
13:16:07 (75509): Can't acquire lockfile (-154) - exiting
13:38:35 (83186): Can't acquire lockfile (-154) - waiting 35s
13:39:10 (83186): Can't acquire lockfile (-154) - exiting
14:17:34 (96131): Can't acquire lockfile (-154) - waiting 35s
14:18:09 (96131): Can't acquire lockfile (-154) - exiting
14:40:20 (4025): Can't acquire lockfile (-154) - waiting 35s
14:40:55 (4025): Can't acquire lockfile (-154) - exiting
15:19:06 (17540): Can't acquire lockfile (-154) - waiting 35s
15:19:41 (17540): Can't acquire lockfile (-154) - exiting
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa088c540) malloc: *** error for object 0x83de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
Signal 15 received, exiting...
Called boinc_finish
Signal 15 received, exiting...
Called boinc_finish
hadcm3n_6.07_i686-apple-darwin(392,0xa088c540) malloc: *** error for object 0x103de04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(392,0xa088c540) malloc: *** error for object 0x103de00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(392,0xa088c540) malloc: *** error for object 0x802000: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x82a204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x82a200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x101a204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x101a200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(15603,0xa088c540) malloc: *** error for object 0x1016200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_yexd_1900_40_007517684/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Jan 2012 05:08:07 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 518,400 1,000,690 1.9303
10 Jan 2012 01:46:46 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 492,480 952,394 1.9339
08 Jan 2012 21:35:05 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 466,560 904,064 1.9377
07 Jan 2012 17:44:16 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 440,640 855,577 1.9417
06 Jan 2012 10:33:22 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 414,720 807,647 1.9475
05 Jan 2012 17:26:51 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 388,800 757,296 1.9478
05 Jan 2012 02:17:54 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 362,880 706,604 1.9472
04 Jan 2012 10:16:32 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 336,960 655,803 1.9462
03 Jan 2012 10:21:36 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 311,040 606,260 1.9491
02 Jan 2012 09:19:56 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 285,120 558,139 1.9576
01 Jan 2012 06:21:53 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 259,200 510,277 1.9687
31 Dec 2011 05:45:43 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 233,280 461,566 1.9786
28 Dec 2011 06:34:16 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 207,360 412,928 1.9914
27 Dec 2011 13:18:33 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 181,440 361,148 1.9905
26 Dec 2011 22:09:38 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 155,520 309,382 1.9893
26 Dec 2011 06:39:45 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 129,600 257,540 1.9872
25 Dec 2011 15:35:19 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 103,680 205,614 1.9832
24 Dec 2011 22:41:13 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 77,760 153,661 1.9761
22 Dec 2011 12:24:29 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 51,840 101,756 1.9629
20 Dec 2011 01:21:04 1184254 13795055 hadcm3n_yexd_1900_40_007517684_3 25,920 50,469 1.9471


©2024 climateprediction.net