Name | hadam3p_pnw_x6ti_2006_1_010136494_0 |
Workunit | 10097894 |
Created | 21 Aug 2015, 14:22:25 UTC |
Sent | 27 Aug 2015, 3:56:18 UTC |
Report deadline | 8 Aug 2016, 9:16:18 UTC |
Received | 30 Aug 2015, 11:27:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 957844 |
Run time | 12 hours 14 min 26 sec |
CPU time | 12 hours 14 min 26 sec |
Validate state | Invalid |
Credit | 256.81 |
Device peak FLOPS | 2.93 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.27 windows_intelx86 |
Stderr | <core_client_version>6.4.7</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7884, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1000, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... 17:49:42 (5140): No heartbeat from client for 30 sec - exiting 17:49:42 (5140): timer handler: client dead, exiting 17:49:43 (5140): No heartbeat from client for 30 sec - exiting 17:49:43 (5140): timer handler: client dead, exiting 17:49:44 (5140): No heartbeat from client for 30 sec - exiting 17:49:44 (5140): timer handler: client dead, exiting 17:49:45 (5140): No heartbeat from client for 30 sec - exiting 17:49:45 (5140): timer handler: client dead, exiting 17:49:46 (5140): No heartbeat from client for 30 sec - exiting 17:49:46 (5140): timer handler: client dead, exiting 17:49:47 (5140): No heartbeat from client for 30 sec - exiting 17:49:47 (5140): timer handler: client dead, exiting 17:49:48 (5140): No heartbeat from client for 30 sec - exiting 17:49:48 (5140): timer handler: client dead, exiting 17:49:49 (5140): No heartbeat from client for 30 sec - exiting 17:49:49 (5140): timer handler: client dead, exiting 17:49:50 (5140): No heartbeat from client for 30 sec - exiting 17:49:50 (5140): timer handler: client dead, exiting 17:49:51 (5140): No heartbeat from client for 30 sec - exiting 17:49:51 (5140): timer handler: client dead, exiting 17:49:52 (5140): No heartbeat from client for 30 sec - exiting 17:49:52 (5140): timer handler: client dead, exiting 17:49:53 (5140): No heartbeat from client for 30 sec - exiting 17:49:53 (5140): timer handler: client dead, exiting 17:49:54 (5140): No heartbeat from client for 30 sec - exiting 17:49:54 (5140): timer handler: client dead, exiting 17:49:55 (5140): No heartbeat from client for 30 sec - exiting 17:49:55 (5140): timer handler: client dead, exiting 17:49:56 (5140): No heartbeat from client for 30 sec - exiting 17:49:56 (5140): timer handler: client dead, exiting 17:49:57 (5140): No heartbeat from client for 30 sec - exiting 17:49:57 (5140): timer handler: client dead, exiting 17:49:58 (5140): No heartbeat from client for 30 sec - exiting 17:49:58 (5140): timer handler: client dead, exiting 17:49:59 (5140): No heartbeat from client for 30 sec - exiting 17:49:59 (5140): timer handler: client dead, exiting 17:50:00 (5140): No heartbeat from client for 30 sec - exiting 17:50:00 (5140): timer handler: client dead, exiting 17:50:01 (5140): No heartbeat from client for 30 sec - exiting 17:50:01 (5140): timer handler: client dead, exiting 17:50:02 (5140): No heartbeat from client for 30 sec - exiting 17:50:02 (5140): timer handler: client dead, exiting 17:50:03 (5140): No heartbeat from client for 30 sec - exiting 17:50:03 (5140): timer handler: client dead, exiting 17:50:04 (5140): No heartbeat from client for 30 sec - exiting 17:50:04 (5140): timer handler: client dead, exiting 17:50:05 (5140): No heartbeat from client for 30 sec - exiting 17:50:05 (5140): timer handler: client dead, exiting 17:50:06 (5140): No heartbeat from client for 30 sec - exiting 17:50:06 (5140): timer handler: client dead, exiting 17:50:07 (5140): No heartbeat from client for 30 sec - exiting 17:50:07 (5140): timer handler: client dead, exiting 17:50:08 (5140): No heartbeat from client for 30 sec - exiting 17:50:08 (5140): timer handler: client dead, exiting 17:50:09 (5140): No heartbeat from client for 30 sec - exiting 17:50:09 (5140): timer handler: client dead, exiting 17:50:10 (5140): No heartbeat from client for 30 sec - exiting 17:50:10 (5140): timer handler: client dead, exiting 17:50:11 (5140): No heartbeat from client for 30 sec - exiting 17:50:11 (5140): timer handler: client dead, exiting 17:50:12 (5140): No heartbeat from client for 30 sec - exiting 17:50:12 (5140): timer handler: client dead, exiting 17:50:13 (5140): No heartbeat from client for 30 sec - exiting 17:50:13 (5140): timer handler: client dead, exiting 17:50:14 (5140): No heartbeat from client for 30 sec - exiting 17:50:14 (5140): timer handler: client dead, exiting 17:50:15 (5140): No heartbeat from client for 30 sec - exiting 17:50:15 (5140): timer handler: client dead, exiting 17:50:16 (5140): No heartbeat from client for 30 sec - exiting 17:50:16 (5140): timer handler: client dead, exiting 17:50:17 (5140): No heartbeat from client for 30 sec - exiting 17:50:17 (5140): timer handler: client dead, exiting 17:50:18 (5140): No heartbeat from client for 30 sec - exiting 17:50:18 (5140): timer handler: client dead, exiting 17:50:19 (5140): No heartbeat from client for 30 sec - exiting 17:50:19 (5140): timer handler: client dead, exiting 17:50:20 (5140): No heartbeat from client for 30 sec - exiting 17:50:20 (5140): timer handler: client dead, exiting 17:50:21 (5140): No heartbeat from client for 30 sec - exiting 17:50:21 (5140): timer handler: client dead, exiting 17:50:22 (5140): No heartbeat from client for 30 sec - exiting 17:50:22 (5140): timer handler: client dead, exiting 17:50:23 (5140): No heartbeat from client for 30 sec - exiting 17:50:23 (5140): timer handler: client dead, exiting 17:50:24 (5140): No heartbeat from client for 30 sec - exiting 17:50:24 (5140): timer handler: client dead, exiting 17:50:25 (5140): No heartbeat from client for 30 sec - exiting 17:50:25 (5140): timer handler: client dead, exiting 17:50:26 (5140): No heartbeat from client for 30 sec - exiting 17:50:26 (5140): timer handler: client dead, exiting 17:50:27 (5140): No heartbeat from client for 30 sec - exiting 17:50:27 (5140): timer handler: client dead, exiting 17:50:28 (5140): No heartbeat from client for 30 sec - exiting 17:50:28 (5140): timer handler: client dead, exiting 17:50:29 (5140): No heartbeat from client for 30 sec - exiting 17:50:29 (5140): timer handler: client dead, exiting 17:50:30 (5140): No heartbeat from client for 30 sec - exiting 17:50:30 (5140): timer handler: client dead, exiting 17:50:31 (5140): No heartbeat from client for 30 sec - exiting 17:50:31 (5140): timer handler: client dead, exiting 17:50:32 (5140): No heartbeat from client for 30 sec - exiting 17:50:32 (5140): timer handler: client dead, exiting 17:50:33 (5140): No heartbeat from client for 30 sec - exiting 17:50:33 (5140): timer handler: client dead, exiting 17:50:34 (5140): No heartbeat from client for 30 sec - exiting 17:50:34 (5140): timer handler: client dead, exiting 17:50:35 (5140): No heartbeat from client for 30 sec - exiting 17:50:35 (5140): timer handler: client dead, exiting 17:50:36 (5140): No heartbeat from client for 30 sec - exiting 17:50:36 (5140): timer handler: client dead, exiting 17:50:37 (5140): No heartbeat from client for 30 sec - exiting 17:50:37 (5140): timer handler: client dead, exiting 17:50:38 (5140): No heartbeat from client for 30 sec - exiting 17:50:38 (5140): timer handler: client dead, exiting 17:50:39 (5140): No heartbeat from client for 30 sec - exiting 17:50:39 (5140): timer handler: client dead, exiting 17:50:40 (5140): No heartbeat from client for 30 sec - exiting 17:50:40 (5140): timer handler: client dead, exiting 17:50:41 (5140): No heartbeat from client for 30 sec - exiting 17:50:41 (5140): timer handler: client dead, exiting 17:50:42 (5140): No heartbeat from client for 30 sec - exiting 17:50:42 (5140): timer handler: client dead, exiting 17:50:43 (5140): No heartbeat from client for 30 sec - exiting 17:50:43 (5140): timer handler: client dead, exiting 17:50:44 (5140): No heartbeat from client for 30 sec - exiting 17:50:44 (5140): timer handler: client dead, exiting 17:50:45 (5140): No heartbeat from client for 30 sec - exiting 17:50:45 (5140): timer handler: client dead, exiting 17:50:46 (5140): No heartbeat from client for 30 sec - exiting 17:50:46 (5140): timer handler: client dead, exiting 17:50:47 (5140): No heartbeat from client for 30 sec - exiting 17:50:47 (5140): timer handler: client dead, exiting 17:50:48 (5140): No heartbeat from client for 30 sec - exiting 17:50:48 (5140): timer handler: client dead, exiting 17:50:49 (5140): No heartbeat from client for 30 sec - exiting 17:50:49 (5140): timer handler: client dead, exiting 17:50:50 (5140): No heartbeat from client for 30 sec - exiting 17:50:50 (5140): timer handler: client dead, exiting 17:50:51 (5140): No heartbeat from client for 30 sec - exiting 17:50:51 (5140): timer handler: client dead, exiting 17:50:52 (5140): No heartbeat from client for 30 sec - exiting 17:50:52 (5140): timer handler: client dead, exiting 17:50:53 (5140): No heartbeat from client for 30 sec - exiting 17:50:53 (5140): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3828, selfPID=3828, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3364, iMonCtr=2 Model crash detected, will try to restart... 22:55:00 (5916): No heartbeat from client for 30 sec - exiting 22:55:00 (5916): timer handler: client dead, exiting 22:55:01 (5916): No heartbeat from client for 30 sec - exiting 22:55:01 (5916): timer handler: client dead, exiting 22:55:02 (5916): No heartbeat from client for 30 sec - exiting 22:55:02 (5916): timer handler: client dead, exiting 22:55:03 (5916): No heartbeat from client for 30 sec - exiting 22:55:03 (5916): timer handler: client dead, exiting 22:55:04 (5916): No heartbeat from client for 30 sec - exiting 22:55:04 (5916): timer handler: client dead, exiting 22:55:05 (5916): No heartbeat from client for 30 sec - exiting 22:55:05 (5916): timer handler: client dead, exiting 22:55:06 (5916): No heartbeat from client for 30 sec - exiting 22:55:06 (5916): timer handler: client dead, exiting 22:55:07 (5916): No heartbeat from client for 30 sec - exiting 22:55:07 (5916): timer handler: client dead, exiting 22:55:08 (5916): No heartbeat from client for 30 sec - exiting 22:55:08 (5916): timer handler: client dead, exiting 22:55:09 (5916): No heartbeat from client for 30 sec - exiting 22:55:09 (5916): timer handler: client dead, exiting 22:55:10 (5916): No heartbeat from client for 30 sec - exiting 22:55:10 (5916): timer handler: client dead, exiting 22:55:11 (5916): No heartbeat from client for 30 sec - exiting 22:55:11 (5916): timer handler: client dead, exiting 22:55:12 (5916): No heartbeat from client for 30 sec - exiting 22:55:12 (5916): timer handler: client dead, exiting 22:55:13 (5916): No heartbeat from client for 30 sec - exiting 22:55:13 (5916): timer handler: client dead, exiting 22:55:14 (5916): No heartbeat from client for 30 sec - exiting 22:55:14 (5916): timer handler: client dead, exiting 22:55:15 (5916): No heartbeat from client for 30 sec - exiting 22:55:15 (5916): timer handler: client dead, exiting 22:55:16 (5916): No heartbeat from client for 30 sec - exiting 22:55:16 (5916): timer handler: client dead, exiting 22:55:17 (5916): No heartbeat from client for 30 sec - exiting 22:55:17 (5916): timer handler: client dead, exiting 22:55:18 (5916): No heartbeat from client for 30 sec - exiting 22:55:18 (5916): timer handler: client dead, exiting 22:55:19 (5916): No heartbeat from client for 30 sec - exiting 22:55:19 (5916): timer handler: client dead, exiting 22:55:20 (5916): No heartbeat from client for 30 sec - exiting 22:55:20 (5916): timer handler: client dead, exiting 22:55:21 (5916): No heartbeat from client for 30 sec - exiting 22:55:21 (5916): timer handler: client dead, exiting 22:55:22 (5916): No heartbeat from client for 30 sec - exiting 22:55:22 (5916): timer handler: client dead, exiting 22:55:24 (5916): No heartbeat from client for 30 sec - exiting 22:55:24 (5916): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:14:37 (1568): No heartbeat from client for 30 sec - exiting 02:14:37 (1568): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:26:38 (2932): No heartbeat from client for 30 sec - exiting 03:26:38 (2932): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... 03:26:41 (3824): called boinc_finish(193) </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_x6ti_2006_1_010136494_0_13.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Aug 2015 01:09:26 | 957844 | 18845803 | hadam3p_pnw_x6ti_2006_1_010136494_0 | 11,819 | 43,889 | 3.7134 |
©2024 cpdn.org