Name | hadam3p_saf_2ali_1960_1_007407446_0 |
Workunit | 7604876 |
Created | 15 Aug 2011, 11:16:43 UTC |
Sent | 15 Aug 2011, 19:22:33 UTC |
Report deadline | 28 Jul 2012, 0:42:33 UTC |
Received | 25 Oct 2011, 15:14:39 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1162205 |
Run time | 1 days 1 hours 58 min 16 sec |
CPU time | 21 hours 47 min 57 sec |
Validate state | Invalid |
Credit | 188.44 |
Device peak FLOPS | 1.44 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... 17:19:38 (5780): No heartbeat from core client for 30 sec - exiting 17:19:39 (5780): No heartbeat from core client for 30 sec - exiting 17:19:40 (5780): No heartbeat from core client for 30 sec - exiting 17:19:41 (5780): No heartbeat from core client for 30 sec - exiting 17:19:42 (5780): No heartbeat from core client for 30 sec - exiting 17:19:44 (5780): No heartbeat from core client for 30 sec - exiting 17:19:45 (5780): No heartbeat from core client for 30 sec - exiting 17:19:46 (5780): No heartbeat from core client for 30 sec - exiting 17:19:47 (5780): No heartbeat from core client for 30 sec - exiting 17:19:48 (5780): No heartbeat from core client for 30 sec - exiting 17:19:49 (5780): No heartbeat from core client for 30 sec - exiting 17:19:50 (5780): No heartbeat from core client for 30 sec - exiting 17:19:51 (5780): No heartbeat from core client for 30 sec - exiting 17:19:52 (5780): No heartbeat from core client for 30 sec - exiting 17:19:53 (5780): No heartbeat from core client for 30 sec - exiting 17:19:54 (5780): No heartbeat from core client for 30 sec - exiting 17:19:56 (5780): No heartbeat from core client for 30 sec - exiting 17:19:57 (5780): No heartbeat from core client for 30 sec - exiting 17:19:58 (5780): No heartbeat from core client for 30 sec - exiting 17:19:59 (5780): No heartbeat from core client for 30 sec - exiting 17:20:00 (5780): No heartbeat from core client for 30 sec - exiting 17:20:01 (5780): No heartbeat from core client for 30 sec - exiting 17:20:02 (5780): No heartbeat from core client for 30 sec - exiting 17:20:03 (5780): No heartbeat from core client for 30 sec - exiting 17:20:04 (5780): No heartbeat from core client for 30 sec - exiting 17:20:05 (5780): No heartbeat from core client for 30 sec - exiting 17:20:06 (5780): No heartbeat from core client for 30 sec - exiting 17:20:08 (5780): No heartbeat from core client for 30 sec - exiting 17:20:09 (5780): No heartbeat from core client for 30 sec - exiting 17:20:10 (5780): No heartbeat from core client for 30 sec - exiting 17:20:11 (5780): No heartbeat from core client for 30 sec - exiting 17:20:12 (5780): No heartbeat from core client for 30 sec - exiting 17:20:13 (5780): No heartbeat from core client for 30 sec - exiting 17:20:14 (5780): No heartbeat from core client for 30 sec - exiting 17:20:15 (5780): No heartbeat from core client for 30 sec - exiting 17:20:16 (5780): No heartbeat from core client for 30 sec - exiting 17:20:17 (5780): No heartbeat from core client for 30 sec - exiting 17:20:18 (5780): No heartbeat from core client for 30 sec - exiting 17:20:20 (5780): No heartbeat from core client for 30 sec - exiting 17:20:21 (5780): No heartbeat from core client for 30 sec - exiting 17:20:22 (5780): No heartbeat from core client for 30 sec - exiting 17:20:23 (5780): No heartbeat from core client for 30 sec - exiting 17:20:24 (5780): No heartbeat from core client for 30 sec - exiting 17:20:25 (5780): No heartbeat from core client for 30 sec - exiting 17:20:26 (5780): No heartbeat from core client for 30 sec - exiting 17:20:27 (5780): No heartbeat from core client for 30 sec - exiting 17:20:28 (5780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4504, selfPID=4504, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:37:07 (5356): No heartbeat from core client for 30 sec - exiting 17:37:08 (5356): No heartbeat from core client for 30 sec - exiting 17:37:09 (5356): No heartbeat from core client for 30 sec - exiting 17:37:10 (5356): No heartbeat from core client for 30 sec - exiting 17:37:11 (5356): No heartbeat from core client for 30 sec - exiting 17:37:12 (5356): No heartbeat from core client for 30 sec - exiting 17:37:13 (5356): No heartbeat from core client for 30 sec - exiting 17:37:15 (5356): No heartbeat from core client for 30 sec - exiting 17:37:16 (5356): No heartbeat from core client for 30 sec - exiting 17:37:17 (5356): No heartbeat from core client for 30 sec - exiting 17:37:18 (5356): No heartbeat from core client for 30 sec - exiting 17:37:19 (5356): No heartbeat from core client for 30 sec - exiting 17:37:20 (5356): No heartbeat from core client for 30 sec - exiting 17:37:21 (5356): No heartbeat from core client for 30 sec - exiting 17:37:22 (5356): No heartbeat from core client for 30 sec - exiting 17:37:23 (5356): No heartbeat from core client for 30 sec - exiting 17:37:24 (5356): No heartbeat from core client for 30 sec - exiting 17:37:25 (5356): No heartbeat from core client for 30 sec - exiting 17:37:27 (5356): No heartbeat from core client for 30 sec - exiting 17:37:28 (5356): No heartbeat from core client for 30 sec - exiting 17:37:29 (5356): No heartbeat from core client for 30 sec - exiting 17:37:30 (5356): No heartbeat from core client for 30 sec - exiting 17:37:31 (5356): No heartbeat from core client for 30 sec - exiting 17:37:32 (5356): No heartbeat from core client for 30 sec - exiting 17:37:33 (5356): No heartbeat from core client for 30 sec - exiting 17:37:34 (5356): No heartbeat from core client for 30 sec - exiting 17:37:35 (5356): No heartbeat from core client for 30 sec - exiting 17:37:36 (5356): No heartbeat from core client for 30 sec - exiting 17:37:37 (5356): No heartbeat from core client for 30 sec - exiting 17:37:39 (5356): No heartbeat from core client for 30 sec - exiting 17:37:40 (5356): No heartbeat from core client for 30 sec - exiting 17:37:41 (5356): No heartbeat from core client for 30 sec - exiting 17:37:42 (5356): No heartbeat from core client for 30 sec - exiting 17:37:43 (5356): No heartbeat from core client for 30 sec - exiting 17:37:44 (5356): No heartbeat from core client for 30 sec - exiting 17:37:45 (5356): No heartbeat from core client for 30 sec - exiting 17:37:46 (5356): No heartbeat from core client for 30 sec - exiting 17:37:47 (5356): No heartbeat from core client for 30 sec - exiting 17:37:48 (5356): No heartbeat from core client for 30 sec - exiting 17:37:49 (5356): No heartbeat from core client for 30 sec - exiting 17:37:51 (5356): No heartbeat from core client for 30 sec - exiting 17:37:52 (5356): No heartbeat from core client for 30 sec - exiting 17:37:53 (5356): No heartbeat from core client for 30 sec - exiting 17:37:54 (5356): No heartbeat from core client for 30 sec - exiting 17:37:55 (5356): No heartbeat from core client for 30 sec - exiting 17:37:56 (5356): No heartbeat from core client for 30 sec - exiting 17:37:57 (5356): No heartbeat from core client for 30 sec - exiting 17:37:58 (5356): No heartbeat from core client for 30 sec - exiting 17:37:59 (5356): No heartbeat from core client for 30 sec - exiting 17:38:00 (5356): No heartbeat from core client for 30 sec - exiting 17:38:01 (5356): No heartbeat from core client for 30 sec - exiting 17:38:03 (5356): No heartbeat from core client for 30 sec - exiting 17:38:04 (5356): No heartbeat from core client for 30 sec - exiting 17:38:05 (5356): No heartbeat from core client for 30 sec - exiting 17:38:06 (5356): No heartbeat from core client for 30 sec - exiting 17:38:07 (5356): No heartbeat from core client for 30 sec - exiting 17:38:08 (5356): No heartbeat from core client for 30 sec - exiting 17:38:09 (5356): No heartbeat from core client for 30 sec - exiting 17:38:10 (5356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:11 (5356): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6276, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3492, selfPID=3492, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=848, selfPID=848, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7020, selfPID=4948, iMonCtr=1 Model crash detected, will try to restart... 09:22:40 (6192): No heartbeat from core client for 30 sec - exiting 09:22:41 (6192): No heartbeat from core client for 30 sec - exiting 09:22:42 (6192): No heartbeat from core client for 30 sec - exiting 09:22:43 (6192): No heartbeat from core client for 30 sec - exiting 09:22:44 (6192): No heartbeat from core client for 30 sec - exiting 09:22:45 (6192): No heartbeat from core client for 30 sec - exiting 09:22:46 (6192): No heartbeat from core client for 30 sec - exiting 09:22:47 (6192): No heartbeat from core client for 30 sec - exiting 09:22:48 (6192): No heartbeat from core client for 30 sec - exiting 09:22:49 (6192): No heartbeat from core client for 30 sec - exiting 09:22:50 (6192): No heartbeat from core client for 30 sec - exiting 09:22:51 (6192): No heartbeat from core client for 30 sec - exiting 09:22:52 (6192): No heartbeat from core client for 30 sec - exiting 09:22:53 (6192): No heartbeat from core client for 30 sec - exiting 09:22:54 (6192): No heartbeat from core client for 30 sec - exiting 09:22:55 (6192): No heartbeat from core client for 30 sec - exiting 09:22:56 (6192): No heartbeat from core client for 30 sec - exiting 09:22:57 (6192): No heartbeat from core client for 30 sec - exiting 09:22:58 (6192): No heartbeat from core client for 30 sec - exiting 09:22:59 (6192): No heartbeat from core client for 30 sec - exiting 09:23:00 (6192): No heartbeat from core client for 30 sec - exiting 09:23:01 (6192): No heartbeat from core client for 30 sec - exiting 09:23:02 (6192): No heartbeat from core client for 30 sec - exiting 09:23:03 (6192): No heartbeat from core client for 30 sec - exiting 09:23:04 (6192): No heartbeat from core client for 30 sec - exiting 09:23:05 (6192): No heartbeat from core client for 30 sec - exiting 09:23:06 (6192): No heartbeat from core client for 30 sec - exiting 09:23:07 (6192): No heartbeat from core client for 30 sec - exiting 09:23:08 (6192): No heartbeat from core client for 30 sec - exiting 09:23:09 (6192): No heartbeat from core client for 30 sec - exiting 09:23:10 (6192): No heartbeat from core client for 30 sec - exiting 09:23:11 (6192): No heartbeat from core client for 30 sec - exiting 09:23:12 (6192): No heartbeat from core client for 30 sec - exiting 09:23:13 (6192): No heartbeat from core client for 30 sec - exiting 09:23:14 (6192): No heartbeat from core client for 30 sec - exiting 09:23:15 (6192): No heartbeat from core client for 30 sec - exiting 09:23:16 (6192): No heartbeat from core client for 30 sec - exiting 09:23:17 (6192): No heartbeat from core client for 30 sec - exiting 09:23:18 (6192): No heartbeat from core client for 30 sec - exiting 09:23:19 (6192): No heartbeat from core client for 30 sec - exiting 09:23:20 (6192): No heartbeat from core client for 30 sec - exiting 09:23:21 (6192): No heartbeat from core client for 30 sec - exiting 09:23:22 (6192): No heartbeat from core client for 30 sec - exiting 09:23:23 (6192): No heartbeat from core client for 30 sec - exiting 09:23:24 (6192): No heartbeat from core client for 30 sec - exiting 09:23:25 (6192): No heartbeat from core client for 30 sec - exiting 09:23:26 (6192): No heartbeat from core client for 30 sec - exiting 09:23:27 (6192): No heartbeat from core client for 30 sec - exiting 09:23:28 (6192): No heartbeat from core client for 30 sec - exiting 09:23:29 (6192): No heartbeat from core client for 30 sec - exiting 09:23:30 (6192): No heartbeat from core client for 30 sec - exiting 09:23:31 (6192): No heartbeat from core client for 30 sec - exiting 09:23:32 (6192): No heartbeat from core client for 30 sec - exiting 09:23:33 (6192): No heartbeat from core client for 30 sec - exiting 09:23:34 (6192): No heartbeat from core client for 30 sec - exiting 09:23:35 (6192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6620, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6292, selfPID=5712, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2ali_1960_1_007407446_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Sep 2011 20:57:24 | 1162205 | 13255604 | hadam3p_saf_2ali_1960_1_007407446_0 | 11,616 | 46,993 | 4.0455 |
©2024 cpdn.org