Name | hadam3p_eu_wibq_1997_1_006838302_0 |
Workunit | 7041618 |
Created | 18 Nov 2010, 15:45:15 UTC |
Sent | 20 Mar 2011, 14:50:39 UTC |
Report deadline | 1 Mar 2012, 20:10:39 UTC |
Received | 27 Mar 2011, 15:05:09 UTC |
Server state | Over |
Outcome | Didn't need |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1105897 |
Run time | 3 days 18 hours 2 min 26 sec |
CPU time | 3 days 13 hours 56 min 46 sec |
Validate state | Invalid |
Credit | 1,988.94 |
Device peak FLOPS | 2.64 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> 14:58:34 (2040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5436, selfPID=5436, iMonCtr=2 16:47:35 (896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:07:41 (1928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3800, selfPID=5732, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 09:57:08 (4604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=2600, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 13:31:44 (1788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:32:21 (2508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:33:21 (1052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:35:46 (4244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:41:55 (2148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:17 (1156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:39:39 (2672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:40:47 (4176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:42:05 (3024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:45:14 (3352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:48:46 (1524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:54:49 (3332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:13:55 (3988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3156, selfPID=3156, iMonCtr=2 19:15:35 (1104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker19:17:13 (4424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:18:52 (3112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3672, selfPID=3672, iMonCtr=2 19:20:31 (4752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2376, selfPID=2376, iMonCtr=2 19:22:10 (4796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:23:49 (5016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2124, selfPID=2124, iMonCtr=2 23:44:04 (3868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1204, selfPID=1204, iMonCtr=2 02:55:09 (4900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:28:11 (3604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:26:33 (2952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:08 (4952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:14:01 (3712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4068, selfPID=4068, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 00:39:03 (3304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3184, CPDN Monitor - Quit request from BOINC... 23:41:03 (2776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4880, selfPID=4880, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4176, selfPID=4176, iMonCtr=2 09:39:05 (2580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:15 (3528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:14:40 (5008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:01:19 (5020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:02:14 (2228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:03:30 (4080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:04:17 (5060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:07:09 (1772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:52 (4696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:46:28 (4444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:48:16 (2124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=2052, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4340, selfPID=1940, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 14:48:58 (1940): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_wibq_1997_1_006838302_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wibq_1997_1_006838302_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Mar 2011 04:29:18 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 115,296 | 294,594 | 2.5551 |
26 Mar 2011 08:18:09 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 103,776 | 266,023 | 2.5634 |
25 Mar 2011 22:16:12 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 92,256 | 236,477 | 2.5633 |
25 Mar 2011 11:47:57 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 80,736 | 206,917 | 2.5629 |
25 Mar 2011 04:52:00 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 69,216 | 177,898 | 2.5702 |
24 Mar 2011 17:59:53 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 57,696 | 148,188 | 2.5684 |
24 Mar 2011 07:55:25 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 46,176 | 119,429 | 2.5864 |
24 Mar 2011 07:55:25 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 34,656 | 90,399 | 2.6085 |
24 Mar 2011 07:55:25 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 23,136 | 60,450 | 2.6128 |
22 Mar 2011 21:23:04 | 1105897 | 12107630 | hadam3p_eu_wibq_1997_1_006838302_0 | 11,616 | 30,390 | 2.6162 |
©2024 cpdn.org