Name | hadcm3n_t5ng_1940_40_007742351_2 |
Workunit | 7897459 |
Created | 27 Jan 2012, 20:15:28 UTC |
Sent | 27 Jan 2012, 23:22:12 UTC |
Report deadline | 28 Apr 2012, 6:49:23 UTC |
Received | 15 Feb 2012, 20:43:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1193024 |
Run time | 6 days 15 hours 47 min 42 sec |
CPU time | 5 days 8 hours 52 min 26 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 3.02 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>6.12.35</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:34:59 (33708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x3ab5a04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x32d1004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x32d1000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12d3204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x12ab000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x329b204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x32a9804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(10660,0xac9822c0) malloc: *** error for object 0x32a9800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug 14:21:38 (10660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_t5ng_1940_40_007742351/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Feb 2012 16:38:26 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 259,200 | 463,978 | 1.7900 |
10 Feb 2012 04:59:18 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 233,280 | 415,987 | 1.7832 |
08 Feb 2012 07:42:00 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 207,360 | 368,554 | 1.7774 |
02 Feb 2012 23:53:24 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 181,440 | 320,673 | 1.7674 |
02 Feb 2012 06:39:49 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 155,520 | 272,903 | 1.7548 |
01 Feb 2012 08:14:13 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 129,600 | 224,521 | 1.7324 |
31 Jan 2012 00:22:15 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 103,680 | 177,035 | 1.7075 |
29 Jan 2012 21:05:43 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 77,760 | 131,894 | 1.6962 |
29 Jan 2012 07:42:06 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 51,840 | 87,922 | 1.6960 |
28 Jan 2012 14:09:33 | 1193024 | 14020523 | hadcm3n_t5ng_1940_40_007742351_2 | 25,920 | 43,521 | 1.6791 |
©2024 cpdn.org