Name | hadam3p_pnw_auqd_1973_1_008034635_0 |
Workunit | 8189749 |
Created | 9 Jul 2012, 9:22:45 UTC |
Sent | 9 Jul 2012, 9:25:19 UTC |
Report deadline | 21 Jun 2013, 14:45:19 UTC |
Received | 23 Jul 2012, 12:06:31 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1212089 |
Run time | 4 days 4 hours 2 min 56 sec |
CPU time | 3 days 22 hours 4 min 34 sec |
Validate state | Invalid |
Credit | 3,001.70 |
Device peak FLOPS | 2.86 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.29</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 05:22:26 (1046): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:21:13 (3252): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 09:23:58 (4336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:23:59 (4336): No heartbeat from core client for 30 sec - exiting 09:24:00 (4336): No heartbeat from core client for 30 sec - exiting 09:24:03 (4336): No heartbeat from core client for 30 sec - exiting 09:24:04 (4336): No heartbeat from core client for 30 sec - exiting 09:24:05 (4336): No heartbeat from core client for 30 sec - exiting 09:24:06 (4336): No heartbeat from core client for 30 sec - exiting 09:24:07 (4336): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:24:06 (4369): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:42:25 (9341): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:42:26 (9341): No heartbeat from core client for 30 sec - exiting 20:42:27 (9341): No heartbeat from core client for 30 sec - exiting 22:01:05 (9528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:01:06 (9528): No heartbeat from core client for 30 sec - exiting 22:01:07 (9528): No heartbeat from core client for 30 sec - exiting 22:01:08 (9528): No heartbeat from core client for 30 sec - exiting 22:01:09 (9528): No heartbeat from core client for 30 sec - exiting 22:01:10 (9528): No heartbeat from core client for 30 sec - exiting 22:01:11 (9528): No heartbeat from core client for 30 sec - exiting 22:01:12 (9528): No heartbeat from core client for 30 sec - exiting 22:01:13 (9528): No heartbeat from core client for 30 sec - exiting 22:01:15 (9528): No heartbeat from core client for 30 sec - exiting 22:01:16 (9528): No heartbeat from core client for 30 sec - exiting 22:01:17 (9528): No heartbeat from core client for 30 sec - exiting 22:01:18 (9528): No heartbeat from core client for 30 sec - exiting 22:01:22 (9528): No heartbeat from core client for 30 sec - exiting 22:01:23 (9528): No heartbeat from core client for 30 sec - exiting 22:01:24 (9528): No heartbeat from core client for 30 sec - exiting 22:01:25 (9528): No heartbeat from core client for 30 sec - exiting 22:01:28 (9528): No heartbeat from core client for 30 sec - exiting 22:01:29 (9528): No heartbeat from core client for 30 sec - exiting 22:01:30 (9528): No heartbeat from core client for 30 sec - exiting 22:01:31 (9528): No heartbeat from core client for 30 sec - exiting 22:01:32 (9528): No heartbeat from core client for 30 sec - exiting 22:01:34 (9528): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:41:47 (9905): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:06:36 (23878): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:19:48 (25862): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:19:50 (25862): No heartbeat from core client for 30 sec - exiting 16:29:12 (18574): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:23:24 (19242): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:30:24 (19616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:30:25 (19616): No heartbeat from core client for 30 sec - exiting 00:30:26 (19616): No heartbeat from core client for 30 sec - exiting 00:30:27 (19616): No heartbeat from core client for 30 sec - exiting 00:30:28 (19616): No heartbeat from core client for 30 sec - exiting 00:30:31 (19616): No heartbeat from core client for 30 sec - exiting 00:30:32 (19616): No heartbeat from core client for 30 sec - exiting 00:30:33 (19616): No heartbeat from core client for 30 sec - exiting 00:30:34 (19616): No heartbeat from core client for 30 sec - exiting 00:30:35 (19616): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4apr Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:17:31 (20694): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4may Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4jun Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4jul Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:47:37 (11209): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:29:17 (3732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:29:18 (3732): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4aug Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:42:37 (4773): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:42:38 (4773): No heartbeat from core client for 30 sec - exiting 11:41:36 (8428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4sep 20:05:41 (9486): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:30:10 (25407): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4oct Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:25:03 (25543): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Error converting file to netcdf: /home/boinc/projects/climateprediction.net/hadam3p_pnw_auqd_1973_1_008034635/dataout/auqdma.pch4nov Suspended CPDN Monitor - Suspend request from BOINC... 17:14:45 (5543): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_auqd_1973_1_008034635_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Jul 2012 08:56:55 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 138,144 | 337,741 | 2.4448 |
21 Jul 2012 23:04:55 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 126,624 | 310,872 | 2.4551 |
21 Jul 2012 10:06:33 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 115,104 | 283,650 | 2.4643 |
20 Jul 2012 22:29:08 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 103,584 | 256,715 | 2.4783 |
20 Jul 2012 04:56:21 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 92,064 | 230,163 | 2.5000 |
19 Jul 2012 21:29:36 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 80,544 | 203,979 | 2.5325 |
19 Jul 2012 13:57:13 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 69,024 | 177,638 | 2.5736 |
19 Jul 2012 01:04:39 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 57,504 | 150,508 | 2.6173 |
18 Jul 2012 14:15:14 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 46,083 | 121,950 | 2.6463 |
18 Jul 2012 13:09:59 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 46,080 | 121,609 | 2.6391 |
17 Jul 2012 22:41:43 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 34,560 | 90,825 | 2.6280 |
15 Jul 2012 21:53:35 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 23,040 | 60,045 | 2.6061 |
15 Jul 2012 10:12:34 | 1212089 | 14881696 | hadam3p_pnw_auqd_1973_1_008034635_0 | 11,616 | 30,860 | 2.6567 |
©2024 cpdn.org