Name | hadam3p_anz_f28j_2013_1_009729065_0 |
Workunit | 9800910 |
Created | 8 Apr 2015, 19:47:21 UTC |
Sent | 10 Apr 2015, 1:36:29 UTC |
Report deadline | 22 Mar 2016, 6:56:29 UTC |
Received | 13 Apr 2015, 3:59:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1318040 |
Run time | 2 days 13 hours 32 min 1 sec |
CPU time | 2 days 12 hours 50 min 49 sec |
Validate state | Invalid |
Credit | 2,000.18 |
Device peak FLOPS | 3.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.36</core_client_version> <![CDATA[ <stderr_txt> 21:47:29 (11944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:05:01 (9668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:48:46 (10740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:50:22 (12228): No heartbeat from core client for 30 sec - exiting 22:50:23 (12228): No heartbeat from core client for 30 sec - exiting 22:50:24 (12228): No heartbeat from core client for 30 sec - exiting 22:50:25 (12228): No heartbeat from core client for 30 sec - exiting 22:50:26 (12228): No heartbeat from core client for 30 sec - exiting 22:50:28 (12228): No heartbeat from core client for 30 sec - exiting 22:50:29 (12228): No heartbeat from core client for 30 sec - exiting 22:50:30 (12228): No heartbeat from core client for 30 sec - exiting 22:50:31 (12228): No heartbeat from core client for 30 sec - exiting 22:50:32 (12228): No heartbeat from core client for 30 sec - exiting 22:50:33 (12228): No heartbeat from core client for 30 sec - exiting 22:50:34 (12228): No heartbeat from core client for 30 sec - exiting 22:50:35 (12228): No heartbeat from core client for 30 sec - exiting 22:50:36 (12228): No heartbeat from core client for 30 sec - exiting 22:50:37 (12228): No heartbeat from core client for 30 sec - exiting 22:50:39 (12228): No heartbeat from core client for 30 sec - exiting 22:50:40 (12228): No heartbeat from core client for 30 sec - exiting 22:50:41 (12228): No heartbeat from core client for 30 sec - exiting 22:50:42 (12228): No heartbeat from core client for 30 sec - exiting 22:50:43 (12228): No heartbeat from core client for 30 sec - exiting 22:50:44 (12228): No heartbeat from core client for 30 sec - exiting 22:50:45 (12228): No heartbeat from core client for 30 sec - exiting 22:50:46 (12228): No heartbeat from core client for 30 sec - exiting 22:50:47 (12228): No heartbeat from core client for 30 sec - exiting 22:50:48 (12228): No heartbeat from core client for 30 sec - exiting 22:50:49 (12228): No heartbeat from core client for 30 sec - exiting 22:50:51 (12228): No heartbeat from core client for 30 sec - exiting 22:50:52 (12228): No heartbeat from core client for 30 sec - exiting 22:50:53 (12228): No heartbeat from core client for 30 sec - exiting 22:50:54 (12228): No heartbeat from core client for 30 sec - exiting 22:50:55 (12228): No heartbeat from core client for 30 sec - exiting 22:50:56 (12228): No heartbeat from core client for 30 sec - exiting 22:50:57 (12228): No heartbeat from core client for 30 sec - exiting 22:50:58 (12228): No heartbeat from core client for 30 sec - exiting 22:50:59 (12228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:17:51 (7492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:19:10 (11092): No heartbeat from core client for 30 sec - exiting 01:19:11 (11092): No heartbeat from core client for 30 sec - exiting 01:19:12 (11092): No heartbeat from core client for 30 sec - exiting 01:19:13 (11092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:20:02 (9836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:24:49 (11356): No heartbeat from core client for 30 sec - exiting 02:24:50 (11356): No heartbeat from core client for 30 sec - exiting 02:24:51 (11356): No heartbeat from core client for 30 sec - exiting 02:24:52 (11356): No heartbeat from core client for 30 sec - exiting 02:24:53 (11356): No heartbeat from core client for 30 sec - exiting 02:24:54 (11356): No heartbeat from core client for 30 sec - exiting 02:24:55 (11356): No heartbeat from core client for 30 sec - exiting 02:24:56 (11356): No heartbeat from core client for 30 sec - exiting 02:24:57 (11356): No heartbeat from core client for 30 sec - exiting 02:24:59 (11356): No heartbeat from core client for 30 sec - exiting 02:25:00 (11356): No heartbeat from core client for 30 sec - exiting 02:25:01 (11356): No heartbeat from core client for 30 sec - exiting 02:25:02 (11356): No heartbeat from core client for 30 sec - exiting 02:25:03 (11356): No heartbeat from core client for 30 sec - exiting 02:25:04 (11356): No heartbeat from core client for 30 sec - exiting 02:25:05 (11356): No heartbeat from core client for 30 sec - exiting 02:25:06 (11356): No heartbeat from core client for 30 sec - exiting 02:25:07 (11356): No heartbeat from core client for 30 sec - exiting 02:25:08 (11356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:03:10 (11872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:08:29 (9980): No heartbeat from core client for 30 sec - exiting 05:08:30 (9980): No heartbeat from core client for 30 sec - exiting 05:08:31 (9980): No heartbeat from core client for 30 sec - exiting 05:08:32 (9980): No heartbeat from core client for 30 sec - exiting 05:08:33 (9980): No heartbeat from core client for 30 sec - exiting 05:08:35 (9980): No heartbeat from core client for 30 sec - exiting 05:08:36 (9980): No heartbeat from core client for 30 sec - exiting 05:08:37 (9980): No heartbeat from core client for 30 sec - exiting 05:08:38 (9980): No heartbeat from core client for 30 sec - exiting 05:08:39 (9980): No heartbeat from core client for 30 sec - exiting 05:08:40 (9980): No heartbeat from core client for 30 sec - exiting 05:08:41 (9980): No heartbeat from core client for 30 sec - exiting 05:08:42 (9980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:08:43 (9980): No heartbeat from core client for 30 sec - exiting 05:29:46 (8408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11668, selfPID=11668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 11:56:11 (4808): No heartbeat from core client for 30 sec - exiting 11:56:12 (4808): No heartbeat from core client for 30 sec - exiting 11:56:13 (4808): No heartbeat from core client for 30 sec - exiting 11:56:14 (4808): No heartbeat from core client for 30 sec - exiting 11:56:15 (4808): No heartbeat from core client for 30 sec - exiting 11:56:16 (4808): No heartbeat from core client for 30 sec - exiting 11:56:17 (4808): No heartbeat from core client for 30 sec - exiting 11:56:18 (4808): No heartbeat from core client for 30 sec - exiting 11:56:19 (4808): No heartbeat from core client for 30 sec - exiting 11:56:20 (4808): No heartbeat from core client for 30 sec - exiting 11:56:21 (4808): No heartbeat from core client for 30 sec - exiting 11:56:22 (4808): No heartbeat from core client for 30 sec - exiting 11:56:23 (4808): No heartbeat from core client for 30 sec - exiting 11:56:24 (4808): No heartbeat from core client for 30 sec - exiting 11:56:25 (4808): No heartbeat from core client for 30 sec - exiting 11:56:26 (4808): No heartbeat from core client for 30 sec - exiting 11:56:27 (4808): No heartbeat from core client for 30 sec - exiting 11:56:28 (4808): No heartbeat from core client for 30 sec - exiting 11:56:29 (4808): No heartbeat from core client for 30 sec - exiting 11:56:30 (4808): No heartbeat from core client for 30 sec - exiting 11:56:31 (4808): No heartbeat from core client for 30 sec - exiting 11:56:32 (4808): No heartbeat from core client for 30 sec - exiting 11:56:33 (4808): No heartbeat from core client for 30 sec - exiting 11:56:34 (4808): No heartbeat from core client for 30 sec - exiting 11:56:35 (4808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 13:55:55 (11068): No heartbeat from core client for 30 sec - exiting 13:55:56 (11068): No heartbeat from core client for 30 sec - exiting 13:55:57 (11068): No heartbeat from core client for 30 sec - exiting 13:55:58 (11068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7272, selfPID=7272, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2880, selfPID=10572, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f28j_2013_1_009729065_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Apr 2015 18:28:14 | 1318040 | 18282597 | hadam3p_anz_f28j_2013_1_009729065_0 | 46,379 | 193,472 | 4.1715 |
12 Apr 2015 04:51:13 | 1318040 | 18282597 | hadam3p_anz_f28j_2013_1_009729065_0 | 34,859 | 144,794 | 4.1537 |
11 Apr 2015 15:39:57 | 1318040 | 18282597 | hadam3p_anz_f28j_2013_1_009729065_0 | 23,339 | 97,483 | 4.1768 |
11 Apr 2015 02:27:27 | 1318040 | 18282597 | hadam3p_anz_f28j_2013_1_009729065_0 | 11,819 | 49,330 | 4.1738 |
©2024 cpdn.org