Name | hadam3p_anz_m3rx_2013_1_009741017_0 |
Workunit | 9812015 |
Created | 9 Apr 2015, 11:36:33 UTC |
Sent | 12 Apr 2015, 1:55:22 UTC |
Report deadline | 24 Mar 2016, 7:15:22 UTC |
Received | 18 Jun 2015, 1:52:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1285809 |
Run time | 17 days 16 hours 49 min 2 sec |
CPU time | 16 days 16 hours 39 min 51 sec |
Validate state | Invalid |
Credit | 2,000.18 |
Device peak FLOPS | 1.32 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 01:23:14 (3480): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 02:14:06 (4268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:03:50 (8368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:30:08 (6928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:27:31 (480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:32:20 (6360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:32:21 (6360): No heartbeat from core client for 30 sec - exiting 02:32:22 (6360): No heartbeat from core client for 30 sec - exiting 02:32:23 (6360): No heartbeat from core client for 30 sec - exiting 02:32:24 (6360): No heartbeat from core client for 30 sec - exiting 02:32:25 (6360): No heartbeat from core client for 30 sec - exiting 02:32:26 (6360): No heartbeat from core client for 30 sec - exiting 02:32:27 (6360): No heartbeat from core client for 30 sec - exiting 02:32:28 (6360): No heartbeat from core client for 30 sec - exiting 02:32:29 (6360): No heartbeat from core client for 30 sec - exiting 02:32:30 (6360): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:37:21 (6688): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:03:29 (4176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:19:42 (7096): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 21:19:43 (7096): No heartbeat from core client for 30 sec - exiting 01:14:28 (11672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:19:47 (8496): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 03:05:45 (7444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:20:00 (11504): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:25:45 (4628): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 15:25:46 (4628): No heartbeat from core client for 30 sec - exiting 15:25:47 (4628): No heartbeat from core client for 30 sec - exiting 15:25:48 (4628): No heartbeat from core client for 30 sec - exiting 15:25:49 (4628): No heartbeat from core client for 30 sec - exiting 15:25:50 (4628): No heartbeat from core client for 30 sec - exiting 15:25:51 (4628): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:17:26 (7152): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:41:39 (4152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:41:40 (4152): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:10:05 (3404): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 22:10:06 (3404): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:42:33 (10980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:35:00 (8400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:35:01 (8400): No heartbeat from core client for 30 sec - exiting 21:14:58 (10452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:59 (10452): No heartbeat from core client for 30 sec - exiting 21:15:00 (10452): No heartbeat from core client for 30 sec - exiting 21:15:01 (10452): No heartbeat from core client for 30 sec - exiting 21:15:02 (10452): No heartbeat from core client for 30 sec - exiting 21:15:03 (10452): No heartbeat from core client for 30 sec - exiting 21:15:04 (10452): No heartbeat from core client for 30 sec - exiting 23:57:21 (11452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:43:33 (7800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:19:28 (1304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:19:29 (1304): No heartbeat from core client for 30 sec - exiting 10:05:27 (6256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:05:28 (6256): No heartbeat from core client for 30 sec - exiting 10:05:29 (6256): No heartbeat from core client for 30 sec - exiting 10:05:30 (6256): No heartbeat from core client for 30 sec - exiting 10:05:31 (6256): No heartbeat from core client for 30 sec - exiting 10:05:32 (6256): No heartbeat from core client for 30 sec - exiting 11:35:27 (7988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:42:26 (9592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:42:28 (9592): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7960, selfPID=7960, iMonCtr=2 00:23:24 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:23:25 (4324): No heartbeat from core client for 30 sec - exiting 00:27:07 (3032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:27:15 (3032): No heartbeat from core client for 30 sec - exiting 00:30:57 (492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2992, selfPID=2992, iMonCtr=2 00:30:58 (492): No heartbeat from core client for 30 sec - exiting 00:30:59 (492): No heartbeat from core client for 30 sec - exiting 00:31:00 (492): No heartbeat from core client for 30 sec - exiting 00:31:01 (492): No heartbeat from core client for 30 sec - exiting 09:11:45 (3000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:11:46 (3000): No heartbeat from core client for 30 sec - exiting 23:16:41 (5488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3924, selfPID=3924, iMonCtr=2 19:58:35 (8480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:00:44 (4948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:17:21 (5000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:30:42 (5240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m3rx_2013_1_009741017_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Jun 2015 22:17:53 | 1285809 | 18294951 | hadam3p_anz_m3rx_2013_1_009741017_0 | 46,379 | 1,100,464 | 23.7276 |
08 May 2015 19:11:09 | 1285809 | 18294951 | hadam3p_anz_m3rx_2013_1_009741017_0 | 34,859 | 755,907 | 21.6847 |
27 Apr 2015 09:29:04 | 1285809 | 18294951 | hadam3p_anz_m3rx_2013_1_009741017_0 | 23,339 | 489,363 | 20.9676 |
15 Apr 2015 10:35:19 | 1285809 | 18294951 | hadam3p_anz_m3rx_2013_1_009741017_0 | 11,819 | 218,043 | 18.4485 |
©2024 cpdn.org