Name | hadcm3n_n7ea_1920_40_008394049_3 |
Workunit | 8544908 |
Created | 17 Oct 2013, 18:35:34 UTC |
Sent | 17 Oct 2013, 18:35:38 UTC |
Report deadline | 17 Jan 2014, 2:02:49 UTC |
Received | 8 Nov 2013, 20:33:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1281076 |
Run time | 22 days 1 hours 5 min 44 sec |
CPU time | 21 days 21 hours 34 min 20 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.82 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:00:05 (1550): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:31:02 (25979): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:31:43 (28860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:35:43 (28880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:39:40 (28939): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:54:17 (28964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:02:31 (29127): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:04:53 (29217): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:07:13 (29239): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:09:18 (29292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:11:37 (29312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:13:45 (29361): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:40 (29378): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:18:17 (29394): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:20:16 (29446): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:22:39 (29459): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:24:32 (29506): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:27:00 (29519): No heartbeat from core client for 30 sec - exiting 01:27:01 (29519): No heartbeat from core client for 30 sec - exiting 01:27:02 (29519): No heartbeat from core client for 30 sec - exiting 01:27:03 (29519): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Error: Input file: dataout/n7eako.pjf7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eako.pjf7c10 Error: Input file: dataout/n7eako.pif7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eako.pif7c10 Error: Input file: dataout/n7eako.pff7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eako.pff7c10 Error: Input file: dataout/n7eaka.phf7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eaka.phf7c10 Error: Input file: dataout/n7eaka.pgf7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eaka.pgf7c10 Error: Input file: dataout/n7eaka.pef7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eaka.pef7c10 Error: Input file: dataout/n7eaka.pdf7c10 is not a valid UM file. Error converting file to netcdf: dataout/n7eaka.pdf7c10 01:28:08 (29566): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 01:29:05 (29579): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 04:43:29 (29593): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:44:42 (31049): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... SIGSEGV: segmentation violation Stack trace (14 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf776a400] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x806c0d5] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x806e5f2] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8072509] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8077f47] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x80781a3] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e1b] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf747a4d3] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Nov 2013 20:38:15 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 1,036,800 | 1,892,058 | 1.8249 |
08 Nov 2013 07:42:01 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 1,010,880 | 1,845,919 | 1.8261 |
07 Nov 2013 18:58:14 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 984,960 | 1,800,483 | 1.8280 |
07 Nov 2013 06:35:14 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 959,040 | 1,756,290 | 1.8313 |
06 Nov 2013 17:25:11 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 933,120 | 1,711,678 | 1.8344 |
06 Nov 2013 04:46:14 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 907,200 | 1,666,448 | 1.8369 |
05 Nov 2013 15:50:25 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 881,280 | 1,620,080 | 1.8383 |
05 Nov 2013 03:01:21 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 855,360 | 1,574,299 | 1.8405 |
04 Nov 2013 14:15:58 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 829,440 | 1,528,657 | 1.8430 |
04 Nov 2013 01:26:44 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 803,520 | 1,482,895 | 1.8455 |
03 Nov 2013 12:28:54 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 777,600 | 1,436,533 | 1.8474 |
02 Nov 2013 23:29:49 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 751,680 | 1,390,327 | 1.8496 |
02 Nov 2013 10:33:52 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 725,760 | 1,344,191 | 1.8521 |
01 Nov 2013 21:53:52 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 699,840 | 1,298,716 | 1.8557 |
01 Nov 2013 08:47:16 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 673,920 | 1,252,000 | 1.8578 |
31 Oct 2013 19:47:58 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 648,000 | 1,205,475 | 1.8603 |
31 Oct 2013 06:53:45 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 622,080 | 1,159,245 | 1.8635 |
30 Oct 2013 17:36:59 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 596,160 | 1,111,853 | 1.8650 |
30 Oct 2013 04:41:10 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 570,240 | 1,065,632 | 1.8687 |
29 Oct 2013 15:39:14 | 1281076 | 16069072 | hadcm3n_n7ea_1920_40_008394049_3 | 544,320 | 1,019,362 | 1.8727 |
©2024 climateprediction.net