Name | hadam3p_pnw_yt35_1976_1_006883161_2 |
Workunit | 7086477 |
Created | 24 Apr 2012, 15:27:55 UTC |
Sent | 24 Apr 2012, 15:40:37 UTC |
Report deadline | 6 Apr 2013, 21:00:37 UTC |
Received | 28 Apr 2012, 17:14:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 962831 |
Run time | 1 days 9 hours 34 min 31 sec |
CPU time | 1 days 9 hours 0 min 37 sec |
Validate state | Invalid |
Credit | 1,003.35 |
Device peak FLOPS | 3.09 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:40:10 (18980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:40:11 (18980): No heartbeat from core client for 30 sec - exiting 16:41:41 (8588): No heartbeat from core client for 30 sec - exiting 16:41:42 (8588): No heartbeat from core client for 30 sec - exiting 16:41:43 (8588): No heartbeat from core client for 30 sec - exiting 16:41:44 (8588): No heartbeat from core client for 30 sec - exiting 16:41:45 (8588): No heartbeat from core client for 30 sec - exiting 16:41:46 (8588): No heartbeat from core client for 30 sec - exiting 16:41:47 (8588): No heartbeat from core client for 30 sec - exiting 16:41:48 (8588): No heartbeat from core client for 30 sec - exiting 16:41:49 (8588): No heartbeat from core client for 30 sec - exiting 16:41:50 (8588): No heartbeat from core client for 30 sec - exiting 16:41:51 (8588): No heartbeat from core client for 30 sec - exiting 16:41:52 (8588): No heartbeat from core client for 30 sec - exiting 16:41:53 (8588): No heartbeat from core client for 30 sec - exiting 16:41:54 (8588): No heartbeat from core client for 30 sec - exiting 16:41:56 (8588): No heartbeat from core client for 30 sec - exiting 16:41:57 (8588): No heartbeat from core client for 30 sec - exiting 16:41:58 (8588): No heartbeat from core client for 30 sec - exiting 16:41:59 (8588): No heartbeat from core client for 30 sec - exiting 16:42:00 (8588): No heartbeat from core client for 30 sec - exiting 16:42:01 (8588): No heartbeat from core client for 30 sec - exiting 16:42:02 (8588): No heartbeat from core client for 30 sec - exiting 16:42:03 (8588): No heartbeat from core client for 30 sec - exiting 16:42:04 (8588): No heartbeat from core client for 30 sec - exiting 16:42:05 (8588): No heartbeat from core client for 30 sec - exiting 16:42:06 (8588): No heartbeat from core client for 30 sec - exiting 16:42:07 (8588): No heartbeat from core client for 30 sec - exiting 16:42:08 (8588): No heartbeat from core client for 30 sec - exiting 16:42:09 (8588): No heartbeat from core client for 30 sec - exiting 16:42:10 (8588): No heartbeat from core client for 30 sec - exiting 16:42:11 (8588): No heartbeat from core client for 30 sec - exiting 16:42:12 (8588): No heartbeat from core client for 30 sec - exiting 16:42:13 (8588): No heartbeat from core client for 30 sec - exiting 16:42:14 (8588): No heartbeat from core client for 30 sec - exiting 16:42:15 (8588): No heartbeat from core client for 30 sec - exiting 16:42:16 (8588): No heartbeat from core client for 30 sec - exiting 16:42:17 (8588): No heartbeat from core client for 30 sec - exiting 16:42:18 (8588): No heartbeat from core client for 30 sec - exiting 16:42:19 (8588): No heartbeat from core client for 30 sec - exiting 16:42:20 (8588): No heartbeat from core client for 30 sec - exiting 16:42:21 (8588): No heartbeat from core client for 30 sec - exiting 16:42:22 (8588): No heartbeat from core client for 30 sec - exiting 16:42:23 (8588): No heartbeat from core client for 30 sec - exiting 16:42:24 (8588): No heartbeat from core client for 30 sec - exiting 16:42:25 (8588): No heartbeat from core client for 30 sec - exiting 16:42:26 (8588): No heartbeat from core client for 30 sec - exiting 16:42:27 (8588): No heartbeat from core client for 30 sec - exiting 16:42:28 (8588): No heartbeat from core client for 30 sec - exiting 16:42:29 (8588): No heartbeat from core client for 30 sec - exiting 16:42:30 (8588): No heartbeat from core client for 30 sec - exiting 16:42:31 (8588): No heartbeat from core client for 30 sec - exiting 16:42:32 (8588): No heartbeat from core client for 30 sec - exiting 16:42:33 (8588): No heartbeat from core client for 30 sec - exiting 16:42:34 (8588): No heartbeat from core client for 30 sec - exiting 16:42:35 (8588): No heartbeat from core client for 30 sec - exiting 16:42:36 (8588): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=27188, iMonCtr=1 16:43:38 (8588): No heartbeat from core client for 30 sec - exiting 16:43:39 (8588): No heartbeat from core client for 30 sec - exiting 16:43:40 (8588): No heartbeat from core client for 30 sec - exiting 16:43:41 (8588): No heartbeat from core client for 30 sec - exiting 16:43:42 (8588): No heartbeat from core client for 30 sec - exiting 16:43:43 (8588): No heartbeat from core client for 30 sec - exiting 16:43:44 (8588): No heartbeat from core client for 30 sec - exiting 16:43:45 (8588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:45:20 (28184): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=23632, iMonCtr=1 CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=27140, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=28224, selfPID=15080, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt35_1976_1_006883161_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Apr 2012 11:24:55 | 962831 | 14585281 | hadam3p_pnw_yt35_1976_1_006883161_2 | 46,176 | 112,613 | 2.4388 |
27 Apr 2012 16:45:45 | 962831 | 14585281 | hadam3p_pnw_yt35_1976_1_006883161_2 | 34,656 | 84,830 | 2.4478 |
27 Apr 2012 06:24:02 | 962831 | 14585281 | hadam3p_pnw_yt35_1976_1_006883161_2 | 23,136 | 56,678 | 2.4498 |
26 Apr 2012 12:00:58 | 962831 | 14585281 | hadam3p_pnw_yt35_1976_1_006883161_2 | 11,616 | 29,038 | 2.4998 |
©2024 cpdn.org