Name | hadam3p_saf_1c8b_1970_1_006934083_1 |
Workunit | 7137399 |
Created | 13 Mar 2011, 18:41:38 UTC |
Sent | 13 Mar 2011, 20:04:50 UTC |
Report deadline | 24 Feb 2012, 1:24:50 UTC |
Received | 25 Mar 2011, 23:55:06 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1138635 |
Run time | 3 days 12 hours 1 min 11 sec |
CPU time | 2 days 23 hours 29 min 5 sec |
Validate state | Invalid |
Credit | 1,309.70 |
Device peak FLOPS | 2.71 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:00:26 (3568): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:36:09 (3208): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:04:16 (7964): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 20:27:47 (6252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5768, selfPID=5768, iMonCtr=2 22:04:52 (6116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:07:56 (7320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:59:35 (3548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:03:35 (7320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:16:17 (6592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7164, selfPID=7164, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6460, selfPID=6460, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5240, selfPID=5240, iMonCtr=2 16:22:51 (5064): No heartbeat from core client for 30 sec - exiting 16:22:52 (5064): No heartbeat from core client for 30 sec - exiting 16:22:53 (5064): No heartbeat from core client for 30 sec - exiting 16:22:54 (5064): No heartbeat from core client for 30 sec - exiting 16:22:56 (5064): No heartbeat from core client for 30 sec - exiting 16:22:57 (5064): No heartbeat from core client for 30 sec - exiting 16:22:58 (5064): No heartbeat from core client for 30 sec - exiting 16:22:59 (5064): No heartbeat from core client for 30 sec - exiting 16:23:00 (5064): No heartbeat from core client for 30 sec - exiting 16:23:01 (5064): No heartbeat from core client for 30 sec - exiting 16:23:02 (5064): No heartbeat from core client for 30 sec - exiting 16:23:03 (5064): No heartbeat from core client for 30 sec - exiting 16:23:04 (5064): No heartbeat from core client for 30 sec - exiting 16:23:05 (5064): No heartbeat from core client for 30 sec - exiting 16:23:06 (5064): No heartbeat from core client for 30 sec - exiting 16:23:08 (5064): No heartbeat from core client for 30 sec - exiting 16:23:09 (5064): No heartbeat from core client for 30 sec - exiting 16:23:10 (5064): No heartbeat from core client for 30 sec - exiting 16:23:11 (5064): No heartbeat from core client for 30 sec - exiting 16:23:12 (5064): No heartbeat from core client for 30 sec - exiting 16:23:13 (5064): No heartbeat from core client for 30 sec - exiting 16:23:14 (5064): No heartbeat from core client for 30 sec - exiting 16:23:15 (5064): No heartbeat from core client for 30 sec - exiting 16:23:16 (5064): No heartbeat from core client for 30 sec - exiting 16:23:17 (5064): No heartbeat from core client for 30 sec - exiting 16:23:18 (5064): No heartbeat from core client for 30 sec - exiting 16:23:20 (5064): No heartbeat from core client for 30 sec - exiting 16:23:21 (5064): No heartbeat from core client for 30 sec - exiting 16:23:22 (5064): No heartbeat from core client for 30 sec - exiting 16:23:23 (5064): No heartbeat from core client for 30 sec - exiting 16:23:24 (5064): No heartbeat from core client for 30 sec - exiting 16:23:25 (5064): No heartbeat from core client for 30 sec - exiting 16:23:26 (5064): No heartbeat from core client for 30 sec - exiting 16:23:27 (5064): No heartbeat from core client for 30 sec - exiting 16:23:28 (5064): No heartbeat from core client for 30 sec - exiting 16:23:29 (5064): No heartbeat from core client for 30 sec - exiting 16:23:31 (5064): No heartbeat from core client for 30 sec - exiting 16:23:32 (5064): No heartbeat from core client for 30 sec - exiting 16:23:33 (5064): No heartbeat from core client for 30 sec - exiting 16:23:34 (5064): No heartbeat from core client for 30 sec - exiting 16:23:35 (5064): No heartbeat from core client for 30 sec - exiting 16:23:36 (5064): No heartbeat from core client for 30 sec - exiting 16:23:37 (5064): No heartbeat from core client for 30 sec - exiting 16:23:38 (5064): No heartbeat from core client for 30 sec - exiting 16:23:39 (5064): No heartbeat from core client for 30 sec - exiting 16:23:40 (5064): No heartbeat from core client for 30 sec - exiting 16:23:41 (5064): No heartbeat from core client for 30 sec - exiting 16:23:43 (5064): No heartbeat from core client for 30 sec - exiting 16:23:44 (5064): No heartbeat from core client for 30 sec - exiting 16:23:45 (5064): No heartbeat from core client for 30 sec - exiting 16:23:46 (5064): No heartbeat from core client for 30 sec - exiting 16:23:47 (5064): No heartbeat from core client for 30 sec - exiting 16:23:48 (5064): No heartbeat from core client for 30 sec - exiting 16:23:49 (5064): No heartbeat from core client for 30 sec - exiting 16:23:50 (5064): No heartbeat from core client for 30 sec - exiting 16:23:51 (5064): No heartbeat from core client for 30 sec - exiting 16:23:52 (5064): No heartbeat from core client for 30 sec - exiting 16:23:53 (5064): No heartbeat from core client for 30 sec - exiting 16:23:55 (5064): No heartbeat from core client for 30 sec - exiting 16:23:56 (5064): No heartbeat from core client for 30 sec - exiting 16:23:57 (5064): No heartbeat from core client for 30 sec - exiting 16:23:58 (5064): No heartbeat from core client for 30 sec - exiting 16:23:59 (5064): No heartbeat from core client for 30 sec - exiting 16:24:00 (5064): No heartbeat from core client for 30 sec - exiting 16:24:01 (5064): No heartbeat from core client for 30 sec - exiting 16:24:02 (5064): No heartbeat from core client for 30 sec - exiting 16:24:03 (5064): No heartbeat from core client for 30 sec - exiting 16:24:04 (5064): No heartbeat from core client for 30 sec - exiting 16:24:05 (5064): No heartbeat from core client for 30 sec - exiting 16:24:07 (5064): No heartbeat from core client for 30 sec - exiting 16:24:08 (5064): No heartbeat from core client for 30 sec - exiting 16:24:09 (5064): No heartbeat from core client for 30 sec - exiting 16:24:10 (5064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=5664, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6696, selfPID=6696, iMonCtr=2 10:39:49 (5688): No heartbeat from core client for 30 sec - exiting 10:39:51 (5688): No heartbeat from core client for 30 sec - exiting 10:40:24 (5688): No heartbeat from core client for 30 sec - exiting 10:40:26 (5688): No heartbeat from core client for 30 sec - exiting 10:40:27 (5688): No heartbeat from core client for 30 sec - exiting 10:40:28 (5688): No heartbeat from core client for 30 sec - exiting 10:40:29 (5688): No heartbeat from core client for 30 sec - exiting 10:40:30 (5688): No heartbeat from core client for 30 sec - exiting 10:40:31 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1744, selfPID=1744, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1744, selfPID=3484, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 10:41:09 (3484): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_1c8b_1970_1_006934083_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c8b_1970_1_006934083_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c8b_1970_1_006934083_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c8b_1970_1_006934083_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c8b_1970_1_006934083_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Mar 2011 19:45:32 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 80,736 | 227,066 | 2.8125 |
23 Mar 2011 03:15:01 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 69,216 | 195,434 | 2.8235 |
22 Mar 2011 17:12:13 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 57,696 | 162,648 | 2.8191 |
22 Mar 2011 07:13:54 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 46,176 | 130,046 | 2.8163 |
21 Mar 2011 21:24:10 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 34,656 | 97,608 | 2.8165 |
17 Mar 2011 19:24:39 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 23,136 | 64,867 | 2.8037 |
16 Mar 2011 09:13:04 | 1138635 | 12665016 | hadam3p_saf_1c8b_1970_1_006934083_1 | 11,616 | 32,398 | 2.7891 |
©2024 cpdn.org