Name | hadam3p_anz_n9mp_2012_1_008600097_1 |
Workunit | 8746609 |
Created | 27 Mar 2014, 13:17:01 UTC |
Sent | 27 Mar 2014, 13:26:19 UTC |
Report deadline | 9 Mar 2015, 18:46:19 UTC |
Received | 31 Mar 2014, 16:33:45 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1299063 |
Run time | 3 days 13 hours 30 min 41 sec |
CPU time | 3 days 7 hours 18 min 17 sec |
Validate state | Invalid |
Credit | 1,503.36 |
Device peak FLOPS | 2.23 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:30:58 (7608): No heartbeat from core client for 30 sec - exiting 11:30:59 (7608): No heartbeat from core client for 30 sec - exiting 11:31:00 (7608): No heartbeat from core client for 30 sec - exiting 11:31:01 (7608): No heartbeat from core client for 30 sec - exiting 11:31:02 (7608): No heartbeat from core client for 30 sec - exiting 11:31:03 (7608): No heartbeat from core client for 30 sec - exiting 11:31:04 (7608): No heartbeat from core client for 30 sec - exiting 11:31:05 (7608): No heartbeat from core client for 30 sec - exiting 11:31:06 (7608): No heartbeat from core client for 30 sec - exiting 11:31:07 (7608): No heartbeat from core client for 30 sec - exiting 11:31:08 (7608): No heartbeat from core client for 30 sec - exiting 11:31:09 (7608): No heartbeat from core client for 30 sec - exiting 11:31:10 (7608): No heartbeat from core client for 30 sec - exiting 11:31:11 (7608): No heartbeat from core client for 30 sec - exiting 11:31:12 (7608): No heartbeat from core client for 30 sec - exiting 11:31:13 (7608): No heartbeat from core client for 30 sec - exiting 11:31:14 (7608): No heartbeat from core client for 30 sec - exiting 11:31:15 (7608): No heartbeat from core client for 30 sec - exiting 11:31:16 (7608): No heartbeat from core client for 30 sec - exiting 11:31:17 (7608): No heartbeat from core client for 30 sec - exiting 11:31:18 (7608): No heartbeat from core client for 30 sec - exiting 11:31:19 (7608): No heartbeat from core client for 30 sec - exiting 11:31:20 (7608): No heartbeat from core client for 30 sec - exiting 11:31:21 (7608): No heartbeat from core client for 30 sec - exiting 11:31:22 (7608): No heartbeat from core client for 30 sec - exiting 11:31:23 (7608): No heartbeat from core client for 30 sec - exiting 11:31:24 (7608): No heartbeat from core client for 30 sec - exiting 11:31:25 (7608): No heartbeat from core client for 30 sec - exiting 11:31:26 (7608): No heartbeat from core client for 30 sec - exiting 11:31:27 (7608): No heartbeat from core client for 30 sec - exiting 11:31:28 (7608): No heartbeat from core client for 30 sec - exiting 11:31:29 (7608): No heartbeat from core client for 30 sec - exiting 11:31:30 (7608): No heartbeat from core client for 30 sec - exiting 11:31:31 (7608): No heartbeat from core client for 30 sec - exiting 11:31:32 (7608): No heartbeat from core client for 30 sec - exiting 11:31:33 (7608): No heartbeat from core client for 30 sec - exiting 11:31:34 (7608): No heartbeat from core client for 30 sec - exiting 11:31:35 (7608): No heartbeat from core client for 30 sec - exiting 11:31:36 (7608): No heartbeat from core client for 30 sec - exiting 11:31:37 (7608): No heartbeat from core client for 30 sec - exiting 11:31:38 (7608): No heartbeat from core client for 30 sec - exiting 11:31:39 (7608): No heartbeat from core client for 30 sec - exiting 11:31:40 (7608): No heartbeat from core client for 30 sec - exiting 11:31:41 (7608): No heartbeat from core client for 30 sec - exiting 11:31:42 (7608): No heartbeat from core client for 30 sec - exiting 11:31:43 (7608): No heartbeat from core client for 30 sec - exiting 11:31:44 (7608): No heartbeat from core client for 30 sec - exiting 11:31:45 (7608): No heartbeat from core client for 30 sec - exiting 11:31:46 (7608): No heartbeat from core client for 30 sec - exiting 11:31:47 (7608): No heartbeat from core client for 30 sec - exiting 11:31:48 (7608): No heartbeat from core client for 30 sec - exiting 11:31:49 (7608): No heartbeat from core client for 30 sec - exiting 11:31:50 (7608): No heartbeat from core client for 30 sec - exiting 11:31:51 (7608): No heartbeat from core client for 30 sec - exiting 11:31:52 (7608): No heartbeat from core client for 30 sec - exiting 11:31:53 (7608): No heartbeat from core client for 30 sec - exiting 11:31:54 (7608): No heartbeat from core client for 30 sec - exiting 11:31:55 (7608): No heartbeat from core client for 30 sec - exiting 11:31:56 (7608): No heartbeat from core client for 30 sec - exiting 11:31:57 (7608): No heartbeat from core client for 30 sec - exiting 11:31:58 (7608): No heartbeat from core client for 30 sec - exiting 11:31:59 (7608): No heartbeat from core client for 30 sec - exiting 11:32:00 (7608): No heartbeat from core client for 30 sec - exiting 11:32:01 (7608): No heartbeat from core client for 30 sec - exiting 11:32:02 (7608): No heartbeat from core client for 30 sec - exiting 11:32:03 (7608): No heartbeat from core client for 30 sec - exiting 11:32:04 (7608): No heartbeat from core client for 30 sec - exiting 11:32:05 (7608): No heartbeat from core client for 30 sec - exiting 11:32:06 (7608): No heartbeat from core client for 30 sec - exiting 11:32:07 (7608): No heartbeat from core client for 30 sec - exiting 11:32:08 (7608): No heartbeat from core client for 30 sec - exiting 11:32:09 (7608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8696, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 17:40:33 (10604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1828, iMonCtr=2 08:12:25 (9808): No heartbeat from core client for 30 sec - exiting 08:12:26 (9808): No heartbeat from core client for 30 sec - exiting 08:12:27 (9808): No heartbeat from core client for 30 sec - exiting 08:12:28 (9808): No heartbeat from core client for 30 sec - exiting 08:12:29 (9808): No heartbeat from core client for 30 sec - exiting 08:12:30 (9808): No heartbeat from core client for 30 sec - exiting 08:12:31 (9808): No heartbeat from core client for 30 sec - exiting 08:12:32 (9808): No heartbeat from core client for 30 sec - exiting 08:12:33 (9808): No heartbeat from core client for 30 sec - exiting 08:12:34 (9808): No heartbeat from core client for 30 sec - exiting 08:12:35 (9808): No heartbeat from core client for 30 sec - exiting 08:12:36 (9808): No heartbeat from core client for 30 sec - exiting 08:12:37 (9808): No heartbeat from core client for 30 sec - exiting 08:12:38 (9808): No heartbeat from core client for 30 sec - exiting 08:12:39 (9808): No heartbeat from core client for 30 sec - exiting 08:12:40 (9808): No heartbeat from core client for 30 sec - exiting 08:12:41 (9808): No heartbeat from core client for 30 sec - exiting 08:12:42 (9808): No heartbeat from core client for 30 sec - exiting 08:12:43 (9808): No heartbeat from core client for 30 sec - exiting 08:12:44 (9808): No heartbeat from core client for 30 sec - exiting 08:12:45 (9808): No heartbeat from core client for 30 sec - exiting 08:12:46 (9808): No heartbeat from core client for 30 sec - exiting 08:12:47 (9808): No heartbeat from core client for 30 sec - exiting 08:12:48 (9808): No heartbeat from core client for 30 sec - exiting 08:12:49 (9808): No heartbeat from core client for 30 sec - exiting 08:12:50 (9808): No heartbeat from core client for 30 sec - exiting 08:12:51 (9808): No heartbeat from core client for 30 sec - exiting 08:12:52 (9808): No heartbeat from core client for 30 sec - exiting 08:12:53 (9808): No heartbeat from core client for 30 sec - exiting 08:12:54 (9808): No heartbeat from core client for 30 sec - exiting 08:12:55 (9808): No heartbeat from core client for 30 sec - exiting 08:12:56 (9808): No heartbeat from core client for 30 sec - exiting 08:12:57 (9808): No heartbeat from core client for 30 sec - exiting 08:12:58 (9808): No heartbeat from core client for 30 sec - exiting 08:12:59 (9808): No heartbeat from core client for 30 sec - exiting 08:13:00 (9808): No heartbeat from core client for 30 sec - exiting 08:13:01 (9808): No heartbeat from core client for 30 sec - exiting 08:13:02 (9808): No heartbeat from core client for 30 sec - exiting 08:13:03 (9808): No heartbeat from core client for 30 sec - exiting 08:13:04 (9808): No heartbeat from core client for 30 sec - exiting 08:13:05 (9808): No heartbeat from core client for 30 sec - exiting 08:13:06 (9808): No heartbeat from core client for 30 sec - exiting 08:13:07 (9808): No heartbeat from core client for 30 sec - exiting 08:13:08 (9808): No heartbeat from core client for 30 sec - exiting 08:13:09 (9808): No heartbeat from core client for 30 sec - exiting 08:13:10 (9808): No heartbeat from core client for 30 sec - exiting 08:13:11 (9808): No heartbeat from core client for 30 sec - exiting 08:13:12 (9808): No heartbeat from core client for 30 sec - exiting 08:13:13 (9808): No heartbeat from core client for 30 sec - exiting 08:13:14 (9808): No heartbeat from core client for 30 sec - exiting 08:13:15 (9808): No heartbeat from core client for 30 sec - exiting 08:13:16 (9808): No heartbeat from core client for 30 sec - exiting 08:13:17 (9808): No heartbeat from core client for 30 sec - exiting 08:13:18 (9808): No heartbeat from core client for 30 sec - exiting 08:13:19 (9808): No heartbeat from core client for 30 sec - exiting 08:13:20 (9808): No heartbeat from core client for 30 sec - exiting 08:13:21 (9808): No heartbeat from core client for 30 sec - exiting 08:13:22 (9808): No heartbeat from core client for 30 sec - exiting 08:13:23 (9808): No heartbeat from core client for 30 sec - exiting 08:13:24 (9808): No heartbeat from core client for 30 sec - exiting 08:13:25 (9808): No heartbeat from core client for 30 sec - exiting 08:13:26 (9808): No heartbeat from core client for 30 sec - exiting 08:13:27 (9808): No heartbeat from core client for 30 sec - exiting 08:13:28 (9808): No heartbeat from core client for 30 sec - exiting 08:13:29 (9808): No heartbeat from core client for 30 sec - exiting 08:13:30 (9808): No heartbeat from core client for 30 sec - exiting 08:13:31 (9808): No heartbeat from core client for 30 sec - exiting 08:13:32 (9808): No heartbeat from core client for 30 sec - exiting 08:13:33 (9808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:34 (9808): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=11496, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8576, selfPID=8576, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8576, selfPID=10788, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9mp_2012_1_008600097_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Mar 2014 19:43:14 | 1299063 | 16422214 | hadam3p_anz_n9mp_2012_1_008600097_1 | 34,859 | 225,226 | 6.4611 |
29 Mar 2014 22:45:22 | 1299063 | 16422214 | hadam3p_anz_n9mp_2012_1_008600097_1 | 23,339 | 150,689 | 6.4565 |
28 Mar 2014 23:20:20 | 1299063 | 16422214 | hadam3p_anz_n9mp_2012_1_008600097_1 | 11,819 | 76,461 | 6.4693 |
©2024 cpdn.org