Name | hadam3p_anz_d5j0_2013_1_009723799_0 |
Workunit | 9797096 |
Created | 8 Apr 2015, 17:49:45 UTC |
Sent | 13 Apr 2015, 5:30:04 UTC |
Report deadline | 25 Mar 2016, 10:50:04 UTC |
Received | 20 Apr 2015, 10:54:50 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1318040 |
Run time | 3 days 4 hours 39 min 12 sec |
CPU time | 3 days 3 hours 27 min 14 sec |
Validate state | Invalid |
Credit | 2,497.00 |
Device peak FLOPS | 3.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.36</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:47:54 (10300): No heartbeat from core client for 30 sec - exiting 16:47:55 (10300): No heartbeat from core client for 30 sec - exiting 16:47:57 (10300): No heartbeat from core client for 30 sec - exiting 16:47:58 (10300): No heartbeat from core client for 30 sec - exiting 16:47:59 (10300): No heartbeat from core client for 30 sec - exiting 16:48:00 (10300): No heartbeat from core client for 30 sec - exiting 16:48:01 (10300): No heartbeat from core client for 30 sec - exiting 16:48:02 (10300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 05:10:11 (10712): No heartbeat from core client for 30 sec - exiting 05:10:12 (10712): No heartbeat from core client for 30 sec - exiting 05:10:13 (10712): No heartbeat from core client for 30 sec - exiting 05:10:14 (10712): No heartbeat from core client for 30 sec - exiting 05:10:15 (10712): No heartbeat from core client for 30 sec - exiting 05:10:16 (10712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:04:06 (10296): No heartbeat from core client for 30 sec - exiting 11:04:07 (10296): No heartbeat from core client for 30 sec - exiting 11:04:08 (10296): No heartbeat from core client for 30 sec - exiting 11:04:09 (10296): No heartbeat from core client for 30 sec - exiting 11:04:10 (10296): No heartbeat from core client for 30 sec - exiting 11:04:11 (10296): No heartbeat from core client for 30 sec - exiting 11:04:12 (10296): No heartbeat from core client for 30 sec - exiting 11:04:13 (10296): No heartbeat from core client for 30 sec - exiting 11:04:14 (10296): No heartbeat from core client for 30 sec - exiting 11:04:15 (10296): No heartbeat from core client for 30 sec - exiting 11:04:16 (10296): No heartbeat from core client for 30 sec - exiting 11:04:17 (10296): No heartbeat from core client for 30 sec - exiting 11:04:18 (10296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4368, selfPID=4368, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 14:25:43 (11856): No heartbeat from core client for 30 sec - exiting 14:25:44 (11856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:25:45 (11856): No heartbeat from core client for 30 sec - exiting 14:25:46 (11856): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:47:09 (12168): No heartbeat from core client for 30 sec - exiting 16:47:10 (12168): No heartbeat from core client for 30 sec - exiting 16:47:12 (12168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:43:21 (1848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:24:21 (10388): No heartbeat from core client for 30 sec - exiting 09:24:22 (10388): No heartbeat from core client for 30 sec - exiting 09:24:23 (10388): No heartbeat from core client for 30 sec - exiting 09:24:24 (10388): No heartbeat from core client for 30 sec - exiting 09:24:25 (10388): No heartbeat from core client for 30 sec - exiting 09:24:26 (10388): No heartbeat from core client for 30 sec - exiting 09:24:27 (10388): No heartbeat from core client for 30 sec - exiting 09:24:28 (10388): No heartbeat from core client for 30 sec - exiting 09:24:29 (10388): No heartbeat from core client for 30 sec - exiting 09:24:30 (10388): No heartbeat from core client for 30 sec - exiting 09:24:31 (10388): No heartbeat from core client for 30 sec - exiting 09:24:32 (10388): No heartbeat from core client for 30 sec - exiting 09:24:33 (10388): No heartbeat from core client for 30 sec - exiting 09:24:34 (10388): No heartbeat from core client for 30 sec - exiting 09:24:35 (10388): No heartbeat from core client for 30 sec - exiting 09:24:36 (10388): No heartbeat from core client for 30 sec - exiting 09:24:37 (10388): No heartbeat from core client for 30 sec - exiting 09:24:38 (10388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:15:47 (5456): No heartbeat from core client for 30 sec - exiting 14:15:48 (5456): No heartbeat from core client for 30 sec - exiting 14:15:49 (5456): No heartbeat from core client for 30 sec - exiting 14:15:50 (5456): No heartbeat from core client for 30 sec - exiting 14:15:51 (5456): No heartbeat from core client for 30 sec - exiting 14:15:52 (5456): No heartbeat from core client for 30 sec - exiting 14:15:53 (5456): No heartbeat from core client for 30 sec - exiting 14:15:54 (5456): No heartbeat from core client for 30 sec - exiting 14:15:55 (5456): No heartbeat from core client for 30 sec - exiting 14:15:56 (5456): No heartbeat from core client for 30 sec - exiting 14:15:57 (5456): No heartbeat from core client for 30 sec - exiting 14:15:58 (5456): No heartbeat from core client for 30 sec - exiting 14:15:59 (5456): No heartbeat from core client for 30 sec - exiting 14:16:00 (5456): No heartbeat from core client for 30 sec - exiting 14:16:01 (5456): No heartbeat from core client for 30 sec - exiting 14:16:02 (5456): No heartbeat from core client for 30 sec - exiting 14:16:03 (5456): No heartbeat from core client for 30 sec - exiting 14:16:04 (5456): No heartbeat from core client for 30 sec - exiting 14:16:05 (5456): No heartbeat from core client for 30 sec - exiting 14:16:06 (5456): No heartbeat from core client for 30 sec - exiting 14:16:07 (5456): No heartbeat from core client for 30 sec - exiting 14:16:08 (5456): No heartbeat from core client for 30 sec - exiting 14:16:09 (5456): No heartbeat from core client for 30 sec - exiting 14:16:10 (5456): No heartbeat from core client for 30 sec - exiting 14:16:11 (5456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:50:47 (9328): No heartbeat from core client for 30 sec - exiting 16:50:48 (9328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:38:38 (3040): No heartbeat from core client for 30 sec - exiting 17:38:39 (3040): No heartbeat from core client for 30 sec - exiting 17:38:40 (3040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:15:00 (8444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:13 (10452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7812, selfPID=7812, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=5708, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=8928, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=4084, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=8368, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_d5j0_2013_1_009723799_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Apr 2015 19:35:37 | 1318040 | 18278609 | hadam3p_anz_d5j0_2013_1_009723799_0 | 57,899 | 243,030 | 4.1975 |
19 Apr 2015 01:36:19 | 1318040 | 18278609 | hadam3p_anz_d5j0_2013_1_009723799_0 | 46,379 | 194,183 | 4.1869 |
18 Apr 2015 01:58:28 | 1318040 | 18278609 | hadam3p_anz_d5j0_2013_1_009723799_0 | 34,859 | 147,108 | 4.2201 |
15 Apr 2015 06:51:02 | 1318040 | 18278609 | hadam3p_anz_d5j0_2013_1_009723799_0 | 23,339 | 96,486 | 4.1341 |
14 Apr 2015 06:55:12 | 1318040 | 18278609 | hadam3p_anz_d5j0_2013_1_009723799_0 | 11,819 | 48,772 | 4.1266 |
©2024 cpdn.org