Name | hadam3p_anz_a5hd_2012_1_008613441_1 |
Workunit | 8759953 |
Created | 13 Apr 2015, 1:50:18 UTC |
Sent | 13 Apr 2015, 1:56:08 UTC |
Report deadline | 25 Mar 2016, 7:16:08 UTC |
Received | 6 May 2015, 12:19:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1291754 |
Run time | 3 days 23 hours 54 min 52 sec |
CPU time | 3 days 20 hours 35 min 20 sec |
Validate state | Invalid |
Credit | 4,484.28 |
Device peak FLOPS | 3.28 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=2 CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3676, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5704, selfPID=4296, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 13:36:45 (5072): No heartbeat from core client for 30 sec - exiting 13:36:46 (5072): No heartbeat from core client for 30 sec - exiting 13:36:47 (5072): No heartbeat from core client for 30 sec - exiting 13:36:48 (5072): No heartbeat from core client for 30 sec - exiting 13:36:49 (5072): No heartbeat from core client for 30 sec - exiting 13:36:50 (5072): No heartbeat from core client for 30 sec - exiting 13:36:51 (5072): No heartbeat from core client for 30 sec - exiting 13:36:52 (5072): No heartbeat from core client for 30 sec - exiting 13:36:53 (5072): No heartbeat from core client for 30 sec - exiting 13:36:54 (5072): No heartbeat from core client for 30 sec - exiting 13:36:55 (5072): No heartbeat from core client for 30 sec - exiting 13:36:56 (5072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:46:48 (6048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:47:29 (5164): No heartbeat from core client for 30 sec - exiting 14:47:30 (5164): No heartbeat from core client for 30 sec - exiting 14:47:31 (5164): No heartbeat from core client for 30 sec - exiting 14:47:32 (5164): No heartbeat from core client for 30 sec - exiting 14:47:33 (5164): No heartbeat from core client for 30 sec - exiting 14:47:34 (5164): No heartbeat from core client for 30 sec - exiting 14:47:35 (5164): No heartbeat from core client for 30 sec - exiting 14:47:36 (5164): No heartbeat from core client for 30 sec - exiting 14:47:37 (5164): No heartbeat from core client for 30 sec - exiting 14:47:38 (5164): No heartbeat from core client for 30 sec - exiting 14:47:39 (5164): No heartbeat from core client for 30 sec - exiting 14:47:40 (5164): No heartbeat from core client for 30 sec - exiting 14:47:41 (5164): No heartbeat from core client for 30 sec - exiting 14:47:42 (5164): No heartbeat from core client for 30 sec - exiting 14:47:43 (5164): No heartbeat from core client for 30 sec - exiting 14:47:44 (5164): No heartbeat from core client for 30 sec - exiting 14:47:45 (5164): No heartbeat from core client for 30 sec - exiting 14:47:46 (5164): No heartbeat from core client for 30 sec - exiting 14:47:47 (5164): No heartbeat from core client for 30 sec - exiting 14:47:48 (5164): No heartbeat from core client for 30 sec - exiting 14:47:49 (5164): No heartbeat from core client for 30 sec - exiting 14:47:50 (5164): No heartbeat from core client for 30 sec - exiting 14:47:51 (5164): No heartbeat from core client for 30 sec - exiting 14:47:52 (5164): No heartbeat from core client for 30 sec - exiting 14:47:53 (5164): No heartbeat from core client for 30 sec - exiting 14:47:54 (5164): No heartbeat from core client for 30 sec - exiting 14:47:55 (5164): No heartbeat from core client for 30 sec - exiting 14:47:56 (5164): No heartbeat from core client for 30 sec - exiting 14:47:57 (5164): No heartbeat from core client for 30 sec - exiting 14:47:58 (5164): No heartbeat from core client for 30 sec - exiting 14:47:59 (5164): No heartbeat from core client for 30 sec - exiting 14:48:00 (5164): No heartbeat from core client for 30 sec - exiting 14:48:01 (5164): No heartbeat from core client for 30 sec - exiting 14:48:02 (5164): No heartbeat from core client for 30 sec - exiting 14:48:03 (5164): No heartbeat from core client for 30 sec - exiting 14:48:04 (5164): No heartbeat from core client for 30 sec - exiting 14:48:05 (5164): No heartbeat from core client for 30 sec - exiting 14:48:06 (5164): No heartbeat from core client for 30 sec - exiting 14:48:07 (5164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:03:55 (3720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:12:36 (4716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:24:40 (10460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2572, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5776, iMonCtr=2 Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:30:13 (4108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:30:14 (4108): No heartbeat from core client for 30 sec - exiting 12:36:28 (7172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7232, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6660, selfPID=6664, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5284, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=332, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_a5hd_2012_1_008613441_1_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a5hd_2012_1_008613441_1_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a5hd_2012_1_008613441_1_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 May 2015 19:14:54 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 103,979 | 322,361 | 3.1003 |
28 Apr 2015 03:28:17 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 92,459 | 286,949 | 3.1035 |
27 Apr 2015 15:57:20 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 80,939 | 251,254 | 3.1042 |
26 Apr 2015 22:52:14 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 69,419 | 215,964 | 3.1110 |
26 Apr 2015 11:45:43 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 57,899 | 181,117 | 3.1282 |
23 Apr 2015 14:04:53 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 46,379 | 145,852 | 3.1448 |
22 Apr 2015 08:53:16 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 34,859 | 110,338 | 3.1653 |
20 Apr 2015 07:04:59 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 23,339 | 72,893 | 3.1232 |
15 Apr 2015 09:54:28 | 1291754 | 18305911 | hadam3p_anz_a5hd_2012_1_008613441_1 | 11,819 | 36,773 | 3.1113 |
©2024 cpdn.org