Name | hadam3p_saf_v75v_1978_1_006682913_0 |
Workunit | 6886166 |
Created | 26 Aug 2010, 11:54:48 UTC |
Sent | 27 Aug 2010, 16:42:23 UTC |
Report deadline | 9 Aug 2011, 22:02:23 UTC |
Received | 23 Oct 2010, 16:02:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1097001 |
Run time | 5 days 16 hours 10 min 7 sec |
CPU time | 5 days 6 hours 50 min 12 sec |
Validate state | Invalid |
Credit | 1,496.58 |
Device peak FLOPS | 1.42 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.05 windows_intelx86 |
Stderr | <core_client_version>6.12.4</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:38:09 (9632): No heartbeat from core client for 30 sec - exiting 17:38:10 (9632): No heartbeat from core client for 30 sec - exiting 17:38:11 (9632): No heartbeat from core client for 30 sec - exiting 17:38:12 (9632): No heartbeat from core client for 30 sec - exiting 17:38:13 (9632): No heartbeat from core client for 30 sec - exiting 17:38:14 (9632): No heartbeat from core client for 30 sec - exiting 17:38:15 (9632): No heartbeat from core client for 30 sec - exiting 17:38:16 (9632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6364, selfPID=6364, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=404, selfPID=404, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:37:59 (7716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:38:00 (7716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6344, selfPID=6344, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:52:04 (1116): No heartbeat from core client for 30 sec - exiting 23:52:06 (1116): No heartbeat from core client for 30 sec - exiting 23:52:07 (1116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:23:10 (10028): No heartbeat from core client for 30 sec - exiting 01:23:11 (10028): No heartbeat from core client for 30 sec - exiting 01:23:12 (10028): No heartbeat from core client for 30 sec - exiting 01:23:13 (10028): No heartbeat from core client for 30 sec - exiting 01:23:14 (10028): No heartbeat from core client for 30 sec - exiting 01:23:15 (10028): No heartbeat from core client for 30 sec - exiting 01:23:16 (10028): No heartbeat from core client for 30 sec - exiting 01:23:17 (10028): No heartbeat from core client for 30 sec - exiting 01:23:18 (10028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 04:19:00 (10672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:02:28 (14892): No heartbeat from core client for 30 sec - exiting 14:02:30 (14892): No heartbeat from core client for 30 sec - exiting 14:02:31 (14892): No heartbeat from core client for 30 sec - exiting 14:02:32 (14892): No heartbeat from core client for 30 sec - exiting 14:02:33 (14892): No heartbeat from core client for 30 sec - exiting 14:02:34 (14892): No heartbeat from core client for 30 sec - exiting 14:02:35 (14892): No heartbeat from core client for 30 sec - exiting 14:02:36 (14892): No heartbeat from core client for 30 sec - exiting 14:02:37 (14892): No heartbeat from core client for 30 sec - exiting 14:02:38 (14892): No heartbeat from core client for 30 sec - exiting 14:02:39 (14892): No heartbeat from core client for 30 sec - exiting 14:02:40 (14892): No heartbeat from core client for 30 sec - exiting 14:02:41 (14892): No heartbeat from core client for 30 sec - exiting 14:02:42 (14892): No heartbeat from core client for 30 sec - exiting 14:02:43 (14892): No heartbeat from core client for 30 sec - exiting 14:02:44 (14892): No heartbeat from core client for 30 sec - exiting 14:02:45 (14892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:07:50 (5536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4036, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5316, selfPID=3740, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 00:00:56 (3740): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_v75v_1978_1_006682913_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_v75v_1978_1_006682913_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_v75v_1978_1_006682913_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_v75v_1978_1_006682913_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Oct 2010 15:53:49 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 92,256 | 438,074 | 4.7485 |
18 Oct 2010 21:24:51 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 80,736 | 384,647 | 4.7643 |
13 Oct 2010 19:40:42 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 69,216 | 330,845 | 4.7799 |
11 Oct 2010 16:09:55 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 57,696 | 278,668 | 4.8299 |
05 Oct 2010 20:34:56 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 46,181 | 223,871 | 4.8477 |
05 Oct 2010 20:24:31 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 46,176 | 223,165 | 4.8329 |
27 Sep 2010 20:05:52 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 34,656 | 168,209 | 4.8537 |
15 Sep 2010 19:28:14 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 23,136 | 113,230 | 4.8941 |
08 Sep 2010 22:17:04 | 1097001 | 11685683 | hadam3p_saf_v75v_1978_1_006682913_0 | 11,616 | 57,558 | 4.9551 |
©2025 cpdn.org