Name | hadam3p_saf_1tmi_2001_1_007004626_2 |
Workunit | 7207942 |
Created | 26 Jan 2011, 0:01:39 UTC |
Sent | 26 Jan 2011, 0:50:42 UTC |
Report deadline | 8 Jan 2012, 6:10:42 UTC |
Received | 31 Jan 2011, 20:08:34 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1019896 |
Run time | 2 days 3 hours 17 min 15 sec |
CPU time | 1 days 21 hours 46 min 27 sec |
Validate state | Invalid |
Credit | 935.95 |
Device peak FLOPS | 2.66 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5840, selfPID=5840, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7840, selfPID=7840, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:32:07 (2988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:32:09 (2988): No heartbeat from core client for 30 sec - exiting 13:33:11 (7472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:33:13 (7472): No heartbeat from core client for 30 sec - exiting 13:33:14 (7472): No heartbeat from core client for 30 sec - exiting 13:33:15 (7472): No heartbeat from core client for 30 sec - exiting 13:33:16 (7472): No heartbeat from core client for 30 sec - exiting 13:33:17 (7472): No heartbeat from core client for 30 sec - exiting 13:33:19 (7472): No heartbeat from core client for 30 sec - exiting 13:33:20 (7472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:05:40 (5384): No heartbeat from core client for 30 sec - exiting 15:05:41 (5384): No heartbeat from core client for 30 sec - exiting 15:05:42 (5384): No heartbeat from core client for 30 sec - exiting 15:05:43 (5384): No heartbeat from core client for 30 sec - exiting 15:05:44 (5384): No heartbeat from core client for 30 sec - exiting 15:05:45 (5384): No heartbeat from core client for 30 sec - exiting 15:05:47 (5384): No heartbeat from core client for 30 sec - exiting 15:05:48 (5384): No heartbeat from core client for 30 sec - exiting 15:05:49 (5384): No heartbeat from core client for 30 sec - exiting 15:05:50 (5384): No heartbeat from core client for 30 sec - exiting 15:05:51 (5384): No heartbeat from core client for 30 sec - exiting 15:05:52 (5384): No heartbeat from core client for 30 sec - exiting 15:05:53 (5384): No heartbeat from core client for 30 sec - exiting 15:05:54 (5384): No heartbeat from core client for 30 sec - exiting 15:05:55 (5384): No heartbeat from core client for 30 sec - exiting 15:05:56 (5384): No heartbeat from core client for 30 sec - exiting 15:05:57 (5384): No heartbeat from core client for 30 sec - exiting 15:05:58 (5384): No heartbeat from core client for 30 sec - exiting 15:05:59 (5384): No heartbeat from core client for 30 sec - exiting 15:06:00 (5384): No heartbeat from core client for 30 sec - exiting 15:06:01 (5384): No heartbeat from core client for 30 sec - exiting 15:06:02 (5384): No heartbeat from core client for 30 sec - exiting 15:06:03 (5384): No heartbeat from core client for 30 sec - exiting 15:06:04 (5384): No heartbeat from core client for 30 sec - exiting 15:06:05 (5384): No heartbeat from core client for 30 sec - exiting 15:06:06 (5384): No heartbeat from core client for 30 sec - exiting 15:06:07 (5384): No heartbeat from core client for 30 sec - exiting 15:06:08 (5384): No heartbeat from core client for 30 sec - exiting 15:06:09 (5384): No heartbeat from core client for 30 sec - exiting 15:06:10 (5384): No heartbeat from core client for 30 sec - exiting 15:06:11 (5384): No heartbeat from core client for 30 sec - exiting 15:06:12 (5384): No heartbeat from core client for 30 sec - exiting 15:06:13 (5384): No heartbeat from core client for 30 sec - exiting 15:06:14 (5384): No heartbeat from core client for 30 sec - exiting 15:06:15 (5384): No heartbeat from core client for 30 sec - exiting 15:06:16 (5384): No heartbeat from core client for 30 sec - exiting 15:06:17 (5384): No heartbeat from core client for 30 sec - exiting 15:06:18 (5384): No heartbeat from core client for 30 sec - exiting 15:06:19 (5384): No heartbeat from core client for 30 sec - exiting 15:06:20 (5384): No heartbeat from core client for 30 sec - exiting 15:06:21 (5384): No heartbeat from core client for 30 sec - exiting 15:06:22 (5384): No heartbeat from core client for 30 sec - exiting 15:06:23 (5384): No heartbeat from core client for 30 sec - exiting 15:06:24 (5384): No heartbeat from core client for 30 sec - exiting 15:06:25 (5384): No heartbeat from core client for 30 sec - exiting 15:06:26 (5384): No heartbeat from core client for 30 sec - exiting 15:06:27 (5384): No heartbeat from core client for 30 sec - exiting 15:06:28 (5384): No heartbeat from core client for 30 sec - exiting 15:06:29 (5384): No heartbeat from core client for 30 sec - exiting 15:06:30 (5384): No heartbeat from core client for 30 sec - exiting 15:06:31 (5384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6108, selfPID=6108, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 08:39:56 (3492): No heartbeat from core client for 30 sec - exiting 08:39:57 (3492): No heartbeat from core client for 30 sec - exiting 08:39:58 (3492): No heartbeat from core client for 30 sec - exiting 08:39:59 (3492): No heartbeat from core client for 30 sec - exiting 08:40:00 (3492): No heartbeat from core client for 30 sec - exiting 08:40:01 (3492): No heartbeat from core client for 30 sec - exiting 08:40:02 (3492): No heartbeat from core client for 30 sec - exiting 08:40:03 (3492): No heartbeat from core client for 30 sec - exiting 08:40:04 (3492): No heartbeat from core client for 30 sec - exiting 08:40:05 (3492): No heartbeat from core client for 30 sec - exiting 08:40:06 (3492): No heartbeat from core client for 30 sec - exiting 08:40:07 (3492): No heartbeat from core client for 30 sec - exiting 08:40:08 (3492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1144, selfPID=1144, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7208, selfPID=7208, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3412, selfPID=3412, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7852, selfPID=7852, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6352, selfPID=6352, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5668, selfPID=5668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8388, selfPID=8388, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5900, selfPID=5900, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7668, selfPID=7668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7324, selfPID=7324, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6476, selfPID=6476, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9128, selfPID=9128, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8252, selfPID=8252, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3592, selfPID=3592, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:16:24 (8276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:18:45 (6084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:20:16 (9208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:21:34 (7500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8396, selfPID=8396, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=9496, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5528, selfPID=8520, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 12:17:46 (8520): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1tmi_2001_1_007004626_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Jan 2011 13:26:38 | 1019896 | 12520472 | hadam3p_saf_1tmi_2001_1_007004626_2 | 57,696 | 146,244 | 2.5347 |
30 Jan 2011 08:30:58 | 1019896 | 12520472 | hadam3p_saf_1tmi_2001_1_007004626_2 | 46,176 | 118,700 | 2.5706 |
29 Jan 2011 16:05:20 | 1019896 | 12520472 | hadam3p_saf_1tmi_2001_1_007004626_2 | 34,656 | 89,088 | 2.5706 |
29 Jan 2011 12:35:08 | 1019896 | 12520472 | hadam3p_saf_1tmi_2001_1_007004626_2 | 23,136 | 59,358 | 2.5656 |
27 Jan 2011 14:01:25 | 1019896 | 12520472 | hadam3p_saf_1tmi_2001_1_007004626_2 | 11,616 | 29,471 | 2.5371 |
©2024 cpdn.org