Name | hadcm3n_o9yr_1900_40_008468038_0 |
Workunit | 8618877 |
Created | 27 Sep 2013, 9:36:28 UTC |
Sent | 5 Oct 2013, 15:26:24 UTC |
Report deadline | 4 Jan 2014, 22:53:35 UTC |
Received | 21 Oct 2013, 20:39:59 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1142583 |
Run time | 14 days 3 hours 28 min 4 sec |
CPU time | 13 days 15 hours 40 min 25 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.36 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> 09:13:10 (4064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:12:01 (2456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:09:58 (5948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:08:53 (5140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:44:14 (6244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:00:04 (6984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:58:49 (2568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:57:50 (6760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:50:51 (6964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:20 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:45:07 (6864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:43:48 (4804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:40:15 (6380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:38:54 (6652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:37:36 (2452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:35:12 (6504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:33:52 (7068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:32:42 (4072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:20:32 (5684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:07:23 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:32:34 (2584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:31:30 (4972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:30:29 (952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o9yrko.pjb4c10 Error converting file to netcdf: dataout/o9yrko.pib4c10 Error converting file to netcdf: dataout/o9yrko.pfb4c10 Error converting file to netcdf: dataout/o9yrka.phb4c10 Error converting file to netcdf: dataout/o9yrka.pgb4c10 Error converting file to netcdf: dataout/o9yrka.peb4c10 Error converting file to netcdf: dataout/o9yrka.pdb4c10 04:29:28 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:28:22 (3432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:26:46 (4876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:25:42 (4412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:24:26 (5064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:23:20 (3212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:06 (5084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:20:51 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:19:32 (2204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:18:13 (3740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:17:01 (4964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:23 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:13:20 (1260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o9yrko.pjb8c10 Error converting file to netcdf: dataout/o9yrko.pib8c10 Error converting file to netcdf: dataout/o9yrko.pfb8c10 Error converting file to netcdf: dataout/o9yrka.phb8c10 Error converting file to netcdf: dataout/o9yrka.pgb8c10 Error converting file to netcdf: dataout/o9yrka.peb8c10 Error converting file to netcdf: dataout/o9yrka.pdb8c10 13:08:16 (6072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:08:35 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:16:12 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:13:39 (5364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:12:16 (1756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:10:55 (1184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:09:39 (4932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:04:17 (2112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:03:00 (3488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:01:39 (5524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:59:04 (3012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:57:42 (3192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:56:32 (4720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:51:54 (2468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:50:34 (4272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:49:14 (2724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:44:08 (3804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77A17383 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o9yr_1900_40_008468038/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Oct 2013 19:42:28 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 777,600 | 1,179,619 | 1.5170 |
21 Oct 2013 08:18:23 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 751,680 | 1,140,599 | 1.5174 |
20 Oct 2013 21:19:06 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 725,760 | 1,101,763 | 1.5181 |
20 Oct 2013 10:15:04 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 699,840 | 1,062,661 | 1.5184 |
19 Oct 2013 23:24:49 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 673,920 | 1,023,709 | 1.5190 |
19 Oct 2013 12:26:41 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 648,000 | 984,808 | 1.5198 |
19 Oct 2013 01:32:49 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 622,080 | 945,877 | 1.5205 |
18 Oct 2013 14:42:49 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 596,160 | 906,941 | 1.5213 |
18 Oct 2013 03:43:43 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 570,240 | 868,123 | 1.5224 |
17 Oct 2013 16:52:56 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 544,320 | 829,259 | 1.5235 |
17 Oct 2013 06:06:51 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 518,400 | 790,502 | 1.5249 |
16 Oct 2013 18:01:16 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 492,480 | 751,796 | 1.5266 |
16 Oct 2013 07:19:05 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 466,560 | 713,027 | 1.5283 |
15 Oct 2013 20:25:41 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 440,640 | 674,370 | 1.5304 |
15 Oct 2013 09:36:21 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 414,720 | 635,871 | 1.5333 |
14 Oct 2013 22:19:40 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 388,800 | 596,816 | 1.5350 |
14 Oct 2013 08:36:56 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 362,880 | 558,039 | 1.5378 |
13 Oct 2013 20:20:42 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 336,960 | 518,943 | 1.5401 |
11 Oct 2013 14:59:32 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 311,040 | 479,566 | 1.5418 |
11 Oct 2013 03:47:37 | 1142583 | 16038705 | hadcm3n_o9yr_1900_40_008468038_0 | 285,120 | 440,462 | 1.5448 |
©2024 cpdn.org