Name | hadam3p_eu_n8e9_2013_1_008805377_0 |
Workunit | 8951355 |
Created | 7 Jul 2014, 16:35:32 UTC |
Sent | 2 Aug 2014, 9:21:11 UTC |
Report deadline | 15 Jul 2015, 14:41:11 UTC |
Received | 5 Aug 2014, 8:59:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1328006 |
Run time | 16 hours 19 min 2 sec |
CPU time | 14 hours 59 min 22 sec |
Validate state | Invalid |
Credit | 399.11 |
Device peak FLOPS | 3.05 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:45:47 (7060): No heartbeat from core client for 30 sec - exiting 23:45:48 (7060): No heartbeat from core client for 30 sec - exiting 23:45:49 (7060): No heartbeat from core client for 30 sec - exiting 23:45:50 (7060): No heartbeat from core client for 30 sec - exiting 23:45:51 (7060): No heartbeat from core client for 30 sec - exiting 23:45:52 (7060): No heartbeat from core client for 30 sec - exiting 23:45:53 (7060): No heartbeat from core client for 30 sec - exiting 23:45:54 (7060): No heartbeat from core client for 30 sec - exiting 23:45:55 (7060): No heartbeat from core client for 30 sec - exiting 23:45:56 (7060): No heartbeat from core client for 30 sec - exiting 23:45:57 (7060): No heartbeat from core client for 30 sec - exiting 23:45:58 (7060): No heartbeat from core client for 30 sec - exiting 23:45:59 (7060): No heartbeat from core client for 30 sec - exiting 23:46:00 (7060): No heartbeat from core client for 30 sec - exiting 23:46:01 (7060): No heartbeat from core client for 30 sec - exiting 23:46:02 (7060): No heartbeat from core client for 30 sec - exiting 23:46:03 (7060): No heartbeat from core client for 30 sec - exiting 23:46:04 (7060): No heartbeat from core client for 30 sec - exiting 23:46:05 (7060): No heartbeat from core client for 30 sec - exiting 23:46:06 (7060): No heartbeat from core client for 30 sec - exiting 23:46:07 (7060): No heartbeat from core client for 30 sec - exiting 23:46:08 (7060): No heartbeat from core client for 30 sec - exiting 23:46:09 (7060): No heartbeat from core client for 30 sec - exiting 23:46:10 (7060): No heartbeat from core client for 30 sec - exiting 23:46:11 (7060): No heartbeat from core client for 30 sec - exiting 23:46:12 (7060): No heartbeat from core client for 30 sec - exiting 23:46:13 (7060): No heartbeat from core client for 30 sec - exiting 23:46:14 (7060): No heartbeat from core client for 30 sec - exiting 23:46:15 (7060): No heartbeat from core client for 30 sec - exiting 23:46:16 (7060): No heartbeat from core client for 30 sec - exiting 23:46:17 (7060): No heartbeat from core client for 30 sec - exiting 23:46:18 (7060): No heartbeat from core client for 30 sec - exiting 23:46:19 (7060): No heartbeat from core client for 30 sec - exiting 23:46:20 (7060): No heartbeat from core client for 30 sec - exiting 23:46:21 (7060): No heartbeat from core client for 30 sec - exiting 23:46:22 (7060): No heartbeat from core client for 30 sec - exiting 23:46:23 (7060): No heartbeat from core client for 30 sec - exiting 23:46:24 (7060): No heartbeat from core client for 30 sec - exiting 23:46:25 (7060): No heartbeat from core client for 30 sec - exiting 23:46:26 (7060): No heartbeat from core client for 30 sec - exiting 23:46:27 (7060): No heartbeat from core client for 30 sec - exiting 23:46:28 (7060): No heartbeat from core client for 30 sec - exiting 23:46:29 (7060): No heartbeat from core client for 30 sec - exiting 23:46:30 (7060): No heartbeat from core client for 30 sec - exiting 23:46:31 (7060): No heartbeat from core client for 30 sec - exiting 23:46:32 (7060): No heartbeat from core client for 30 sec - exiting 23:46:33 (7060): No heartbeat from core client for 30 sec - exiting 23:46:34 (7060): No heartbeat from core client for 30 sec - exiting 23:46:35 (7060): No heartbeat from core client for 30 sec - exiting 23:46:36 (7060): No heartbeat from core client for 30 sec - exiting 23:46:37 (7060): No heartbeat from core client for 30 sec - exiting 23:46:38 (7060): No heartbeat from core client for 30 sec - exiting 23:46:39 (7060): No heartbeat from core client for 30 sec - exiting 23:46:40 (7060): No heartbeat from core client for 30 sec - exiting 23:46:41 (7060): No heartbeat from core client for 30 sec - exiting 23:46:42 (7060): No heartbeat from core client for 30 sec - exiting 23:46:43 (7060): No heartbeat from core client for 30 sec - exiting 23:46:44 (7060): No heartbeat from core client for 30 sec - exiting 23:46:46 (7060): No heartbeat from core client for 30 sec - exiting 23:46:47 (7060): No heartbeat from core client for 30 sec - exiting 23:46:48 (7060): No heartbeat from core client for 30 sec - exiting 23:46:49 (7060): No heartbeat from core client for 30 sec - exiting 23:46:50 (7060): No heartbeat from core client for 30 sec - exiting 23:46:51 (7060): No heartbeat from core client for 30 sec - exiting 23:46:52 (7060): No heartbeat from core client for 30 sec - exiting 23:46:53 (7060): No heartbeat from core client for 30 sec - exiting 23:46:54 (7060): No heartbeat from core client for 30 sec - exiting 23:46:55 (7060): No heartbeat from core client for 30 sec - exiting 23:46:56 (7060): No heartbeat from core client for 30 sec - exiting 23:46:57 (7060): No heartbeat from core client for 30 sec - exiting 23:46:58 (7060): No heartbeat from core client for 30 sec - exiting 23:46:59 (7060): No heartbeat from core client for 30 sec - exiting 23:47:00 (7060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6620, selfPID=5804, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7120, selfPID=5980, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_3.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_n8e9_2013_1_008805377_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Aug 2014 19:49:29 | 1328006 | 16722831 | hadam3p_eu_n8e9_2013_1_008805377_0 | 23,136 | 47,419 | 2.0496 |
04 Aug 2014 12:22:43 | 1328006 | 16722831 | hadam3p_eu_n8e9_2013_1_008805377_0 | 11,616 | 23,898 | 2.0573 |
©2024 cpdn.org