Name | hadcm3n_o51b_1940_40_007301741_1 |
Workunit | 7499165 |
Created | 22 Jun 2011, 14:52:11 UTC |
Sent | 22 Jun 2011, 14:52:23 UTC |
Report deadline | 21 Sep 2011, 22:19:34 UTC |
Received | 13 Aug 2011, 17:51:55 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1075230 |
Run time | 21 days 0 hours 40 min 50 sec |
CPU time | 18 days 2 hours 35 min 28 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 2.72 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:44:00 (249): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(254,0xa0387540) malloc: *** error for object 0x1013604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(254,0xa0387540) malloc: *** error for object 0x1013600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(254,0xa0387540) malloc: *** error for object 0x812c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(254,0xa0387540) malloc: *** error for object 0x812c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(254,0xa0387540) malloc: *** error for object 0x812c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(254,0xa0387540) malloc: *** error for object 0x812c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=271, selfPID=271, iMonCtr=1 hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(14948,0xa0387540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:44:02 (11784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x83c004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x83c004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x857604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x857600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x857604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x857600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x1053804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x83c004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x100e204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x100e200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x835800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x812804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(246,0xa0901540) malloc: *** error for object 0x812800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:43:05 (67396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 132090) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1538, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 132090) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1538, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 132090) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1538, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 132090) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1538, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 132090) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1538, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 132090) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1538, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Aug 2011 15:30:48 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 751,680 | 1,518,488 | 2.0201 |
10 Aug 2011 14:46:03 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 725,760 | 1,460,493 | 2.0124 |
08 Aug 2011 04:15:40 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 699,840 | 1,401,597 | 2.0027 |
07 Aug 2011 02:50:10 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 673,920 | 1,330,850 | 1.9748 |
04 Aug 2011 23:12:49 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 648,000 | 1,276,244 | 1.9695 |
01 Aug 2011 06:21:54 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 622,080 | 1,230,843 | 1.9786 |
27 Jul 2011 06:59:03 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 596,160 | 1,181,746 | 1.9823 |
26 Jul 2011 03:02:15 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 570,240 | 1,134,818 | 1.9901 |
25 Jul 2011 22:45:39 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 544,320 | 1,085,120 | 1.9935 |
25 Jul 2011 20:39:16 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 518,400 | 1,031,292 | 1.9894 |
25 Jul 2011 19:08:45 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 492,480 | 980,767 | 1.9915 |
25 Jul 2011 18:58:01 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 466,560 | 920,872 | 1.9737 |
25 Jul 2011 16:35:40 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 440,640 | 864,955 | 1.9630 |
25 Jul 2011 15:01:30 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 414,720 | 804,098 | 1.9389 |
10 Jul 2011 19:17:55 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 388,800 | 752,432 | 1.9353 |
10 Jul 2011 04:05:07 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 362,880 | 701,737 | 1.9338 |
07 Jul 2011 23:35:32 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 336,960 | 652,694 | 1.9370 |
07 Jul 2011 15:40:20 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 311,040 | 595,884 | 1.9158 |
05 Jul 2011 08:36:08 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 285,120 | 542,987 | 1.9044 |
03 Jul 2011 23:21:49 | 1075230 | 12995527 | hadcm3n_o51b_1940_40_007301741_1 | 259,200 | 491,460 | 1.8961 |
©2024 cpdn.org