Name | hadam3p_eu_9wdj_1976_1_008066562_2 |
Workunit | 8221676 |
Created | 24 Jul 2012, 12:08:07 UTC |
Sent | 24 Jul 2012, 12:19:52 UTC |
Report deadline | 6 Jul 2013, 17:39:52 UTC |
Received | 24 Sep 2012, 11:03:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1207414 |
Run time | 3 days 22 hours 50 min 33 sec |
CPU time | 3 days 4 hours 48 min 33 sec |
Validate state | Invalid |
Credit | 995.30 |
Device peak FLOPS | 1.06 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2128, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7372, selfPID=7372, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3064, selfPID=3064, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9000, selfPID=8092, iMonCtr=1 Model crash detected, will try to restart... GCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7488, selfPID=2676, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6564, selfPID=6564, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2848, selfPID=2848, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8396, selfPID=8396, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7872, selfPID=7872, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 08:36:37 (7984): No heartbeat from core client for 30 sec - exiting 08:36:38 (7984): No heartbeat from core client for 30 sec - exiting 08:36:39 (7984): No heartbeat from core client for 30 sec - exiting 08:36:40 (7984): No heartbeat from core client for 30 sec - exiting 08:36:41 (7984): No heartbeat from core client for 30 sec - exiting 08:36:42 (7984): No heartbeat from core client for 30 sec - exiting 08:36:43 (7984): No heartbeat from core client for 30 sec - exiting 08:36:44 (7984): No heartbeat from core client for 30 sec - exiting 08:36:45 (7984): No heartbeat from core client for 30 sec - exiting 08:36:46 (7984): No heartbeat from core client for 30 sec - exiting 08:36:47 (7984): No heartbeat from core client for 30 sec - exiting 08:36:48 (7984): No heartbeat from core client for 30 sec - exiting 08:36:49 (7984): No heartbeat from core client for 30 sec - exiting 08:36:50 (7984): No heartbeat from core client for 30 sec - exiting 08:36:51 (7984): No heartbeat from core client for 30 sec - exiting 08:36:52 (7984): No heartbeat from core client for 30 sec - exiting 08:36:53 (7984): No heartbeat from core client for 30 sec - exiting 08:36:54 (7984): No heartbeat from core client for 30 sec - exiting 08:36:55 (7984): No heartbeat from core client for 30 sec - exiting 08:36:56 (7984): No heartbeat from core client for 30 sec - exiting 08:36:57 (7984): No heartbeat from core client for 30 sec - exiting 08:36:58 (7984): No heartbeat from core client for 30 sec - exiting 08:36:59 (7984): No heartbeat from core client for 30 sec - exiting 08:37:00 (7984): No heartbeat from core client for 30 sec - exiting 08:37:01 (7984): No heartbeat from core client for 30 sec - exiting 08:37:02 (7984): No heartbeat from core client for 30 sec - exiting 08:37:03 (7984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3752, selfPID=3752, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10572, selfPID=10572, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 19:24:52 (12152): No heartbeat from core client for 30 sec - exiting 19:24:53 (12152): No heartbeat from core client for 30 sec - exiting 19:24:54 (12152): No heartbeat from core client for 30 sec - exiting 19:24:55 (12152): No heartbeat from core client for 30 sec - exiting 19:24:56 (12152): No heartbeat from core client for 30 sec - exiting 19:24:57 (12152): No heartbeat from core client for 30 sec - exiting 19:24:58 (12152): No heartbeat from core client for 30 sec - exiting 19:24:59 (12152): No heartbeat from core client for 30 sec - exiting 19:25:00 (12152): No heartbeat from core client for 30 sec - exiting 19:25:01 (12152): No heartbeat from core client for 30 sec - exiting 19:25:02 (12152): No heartbeat from core client for 30 sec - exiting 19:25:03 (12152): No heartbeat from core client for 30 sec - exiting 19:25:04 (12152): No heartbeat from core client for 30 sec - exiting 19:25:05 (12152): No heartbeat from core client for 30 sec - exiting 19:25:06 (12152): No heartbeat from core client for 30 sec - exiting 19:25:07 (12152): No heartbeat from core client for 30 sec - exiting 19:25:08 (12152): No heartbeat from core client for 30 sec - exiting 19:25:09 (12152): No heartbeat from core client for 30 sec - exiting 19:25:10 (12152): No heartbeat from core client for 30 sec - exiting 19:25:11 (12152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:33:29 (10340): No heartbeat from core client for 30 sec - exiting 18:33:30 (10340): No heartbeat from core client for 30 sec - exiting 18:33:31 (10340): No heartbeat from core client for 30 sec - exiting 18:33:32 (10340): No heartbeat from core client for 30 sec - exiting 18:33:33 (10340): No heartbeat from core client for 30 sec - exiting 18:33:34 (10340): No heartbeat from core client for 30 sec - exiting 18:33:35 (10340): No heartbeat from core client for 30 sec - exiting 18:33:36 (10340): No heartbeat from core client for 30 sec - exiting 18:33:37 (10340): No heartbeat from core client for 30 sec - exiting 18:33:38 (10340): No heartbeat from core client for 30 sec - exiting 18:33:39 (10340): No heartbeat from core client for 30 sec - exiting 18:33:40 (10340): No heartbeat from core client for 30 sec - exiting 18:33:41 (10340): No heartbeat from core client for 30 sec - exiting 18:33:42 (10340): No heartbeat from core client for 30 sec - exiting 18:33:43 (10340): No heartbeat from core client for 30 sec - exiting 18:33:44 (10340): No heartbeat from core client for 30 sec - exiting 18:33:45 (10340): No heartbeat from core client for 30 sec - exiting 18:33:46 (10340): No heartbeat from core client for 30 sec - exiting 18:33:47 (10340): No heartbeat from core client for 30 sec - exiting 18:33:48 (10340): No heartbeat from core client for 30 sec - exiting 18:33:49 (10340): No heartbeat from core client for 30 sec - exiting 18:33:50 (10340): No heartbeat from core client for 30 sec - exiting 18:33:51 (10340): No heartbeat from core client for 30 sec - exiting 18:33:52 (10340): No heartbeat from core client for 30 sec - exiting 18:33:53 (10340): No heartbeat from core client for 30 sec - exiting 18:33:54 (10340): No heartbeat from core client for 30 sec - exiting 18:33:55 (10340): No heartbeat from core client for 30 sec - exiting 18:33:56 (10340): No heartbeat from core client for 30 sec - exiting 18:33:57 (10340): No heartbeat from core client for 30 sec - exiting 18:33:58 (10340): No heartbeat from core client for 30 sec - exiting 18:33:59 (10340): No heartbeat from core client for 30 sec - exiting 18:34:00 (10340): No heartbeat from core client for 30 sec - exiting 18:34:01 (10340): No heartbeat from core client for 30 sec - exiting 18:34:02 (10340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12040, selfPID=12040, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 10:24:17 (12640): No heartbeat from core client for 30 sec - exiting 10:24:21 (12640): No heartbeat from core client for 30 sec - exiting 10:24:22 (12640): No heartbeat from core client for 30 sec - exiting 10:24:23 (12640): No heartbeat from core client for 30 sec - exiting 10:24:24 (12640): No heartbeat from core client for 30 sec - exiting 10:24:25 (12640): No heartbeat from core client for 30 sec - exiting 10:24:26 (12640): No heartbeat from core client for 30 sec - exiting 10:24:27 (12640): No heartbeat from core client for 30 sec - exiting 10:24:28 (12640): No heartbeat from core client for 30 sec - exiting 10:24:29 (12640): No heartbeat from core client for 30 sec - exiting 10:24:30 (12640): No heartbeat from core client for 30 sec - exiting 10:24:31 (12640): No heartbeat from core client for 30 sec - exiting 10:24:32 (12640): No heartbeat from core client for 30 sec - exiting 10:24:33 (12640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:14:54 (4932): No heartbeat from core client for 30 sec - exiting 14:14:55 (4932): No heartbeat from core client for 30 sec - exiting 14:14:56 (4932): No heartbeat from core client for 30 sec - exiting 14:14:57 (4932): No heartbeat from core client for 30 sec - exiting 14:14:58 (4932): No heartbeat from core client for 30 sec - exiting 14:14:59 (4932): No heartbeat from core client for 30 sec - exiting 14:15:00 (4932): No heartbeat from core client for 30 sec - exiting 14:15:01 (4932): No heartbeat from core client for 30 sec - exiting 14:15:02 (4932): No heartbeat from core client for 30 sec - exiting 14:15:03 (4932): No heartbeat from core client for 30 sec - exiting 14:15:04 (4932): No heartbeat from core client for 30 sec - exiting 14:15:05 (4932): No heartbeat from core client for 30 sec - exiting 14:15:06 (4932): No heartbeat from core client for 30 sec - exiting 14:15:07 (4932): No heartbeat from core client for 30 sec - exiting 14:15:08 (4932): No heartbeat from core client for 30 sec - exiting 14:15:09 (4932): No heartbeat from core client for 30 sec - exiting 14:15:10 (4932): No heartbeat from core client for 30 sec - exiting 14:15:11 (4932): No heartbeat from core client for 30 sec - exiting 14:15:12 (4932): No heartbeat from core client for 30 sec - exiting 14:15:13 (4932): No heartbeat from core client for 30 sec - exiting 14:15:14 (4932): No heartbeat from core client for 30 sec - exiting 14:15:15 (4932): No heartbeat from core client for 30 sec - exiting 14:15:16 (4932): No heartbeat from core client for 30 sec - exiting 14:15:17 (4932): No heartbeat from core client for 30 sec - exiting 14:15:18 (4932): No heartbeat from core client for 30 sec - exiting 14:15:19 (4932): No heartbeat from core client for 30 sec - exiting 14:15:20 (4932): No heartbeat from core client for 30 sec - exiting 14:15:21 (4932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8372, selfPID=8372, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4032, selfPID=7032, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 21:28:37 (8960): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6908, selfPID=5008, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt><message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9wdj_1976_1_008066562_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Sep 2012 13:08:47 | 1207414 | 14988187 | hadam3p_eu_9wdj_1976_1_008066562_2 | 57,696 | 248,071 | 4.2996 |
08 Sep 2012 16:34:12 | 1207414 | 14988187 | hadam3p_eu_9wdj_1976_1_008066562_2 | 46,176 | 197,435 | 4.2757 |
24 Aug 2012 17:37:43 | 1207414 | 14988187 | hadam3p_eu_9wdj_1976_1_008066562_2 | 34,656 | 149,351 | 4.3095 |
19 Aug 2012 01:57:53 | 1207414 | 14988187 | hadam3p_eu_9wdj_1976_1_008066562_2 | 23,136 | 99,781 | 4.3128 |
30 Jul 2012 18:52:33 | 1207414 | 14988187 | hadam3p_eu_9wdj_1976_1_008066562_2 | 11,616 | 50,358 | 4.3352 |
©2024 cpdn.org