Name | hadam3p_eu_cngc_2002_1_008081075_0 |
Workunit | 8236189 |
Created | 21 Jul 2012, 22:32:29 UTC |
Sent | 29 Jul 2012, 5:20:40 UTC |
Report deadline | 11 Jul 2013, 10:40:40 UTC |
Received | 20 Aug 2012, 4:17:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1220817 |
Run time | 7 hours 59 min 3 sec |
CPU time | 6 hours 47 min 5 sec |
Validate state | Invalid |
Credit | 200.38 |
Device peak FLOPS | 2.90 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> 07:59:03 (5204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3560, selfPID=3560, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3572, selfPID=3572, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4052, selfPID=4052, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2416, selfPID=2416, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3104, selfPID=3104, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 18:38:52 (3928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2292, selfPID=2292, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2020, selfPID=4088, iMonCtr=1 Model crash detected, will try to restart... 15:48:01 (3924): No heartbeat from core client for 30 sec - exiting 15:48:03 (3924): No heartbeat from core client for 30 sec - exiting 15:48:04 (3924): No heartbeat from core client for 30 sec - exiting 15:48:05 (3924): No heartbeat from core client for 30 sec - exiting 15:48:06 (3924): No heartbeat from core client for 30 sec - exiting 15:48:07 (3924): No heartbeat from core client for 30 sec - exiting 15:48:08 (3924): No heartbeat from core client for 30 sec - exiting 15:48:09 (3924): No heartbeat from core client for 30 sec - exiting 15:48:10 (3924): No heartbeat from core client for 30 sec - exiting 15:48:11 (3924): No heartbeat from core client for 30 sec - exiting 15:48:12 (3924): No heartbeat from core client for 30 sec - exiting 15:48:13 (3924): No heartbeat from core client for 30 sec - exiting 15:48:15 (3924): No heartbeat from core client for 30 sec - exiting 15:48:16 (3924): No heartbeat from core client for 30 sec - exiting 15:48:17 (3924): No heartbeat from core client for 30 sec - exiting 15:48:18 (3924): No heartbeat from core client for 30 sec - exiting 15:48:19 (3924): No heartbeat from core client for 30 sec - exiting 15:48:20 (3924): No heartbeat from core client for 30 sec - exiting 15:48:21 (3924): No heartbeat from core client for 30 sec - exiting 18:28:36 (3548): No heartbeat from core client for 30 sec - exiting 18:28:37 (3548): No heartbeat from core client for 30 sec - exiting 18:28:39 (3548): No heartbeat from core client for 30 sec - exiting 18:28:40 (3548): No heartbeat from core client for 30 sec - exiting 18:28:41 (3548): No heartbeat from core client for 30 sec - exiting 18:28:42 (3548): No heartbeat from core client for 30 sec - exiting 18:28:43 (3548): No heartbeat from core client for 30 sec - exiting 18:28:44 (3548): No heartbeat from core client for 30 sec - exiting 18:28:45 (3548): No heartbeat from core client for 30 sec - exiting 18:28:46 (3548): No heartbeat from core client for 30 sec - exiting 18:28:47 (3548): No heartbeat from core client for 30 sec - exiting 18:28:48 (3548): No heartbeat from core client for 30 sec - exiting 18:28:49 (3548): No heartbeat from core client for 30 sec - exiting 18:28:51 (3548): No heartbeat from core client for 30 sec - exiting 18:28:52 (3548): No heartbeat from core client for 30 sec - exiting 18:28:53 (3548): No heartbeat from core client for 30 sec - exiting 18:28:54 (3548): No heartbeat from core client for 30 sec - exiting 18:28:55 (3548): No heartbeat from core client for 30 sec - exiting 18:28:56 (3548): No heartbeat from core client for 30 sec - exiting 18:28:57 (3548): No heartbeat from core client for 30 sec - exiting 18:28:58 (3548): No heartbeat from core client for 30 sec - exiting 18:28:59 (3548): No heartbeat from core client for 30 sec - exiting 18:29:00 (3548): No heartbeat from core client for 30 sec - exiting 18:29:01 (3548): No heartbeat from core client for 30 sec - exiting 18:29:03 (3548): No heartbeat from core client for 30 sec - exiting 18:29:04 (3548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:36:05 (3916): No heartbeat from core client for 30 sec - exiting 18:36:06 (3916): No heartbeat from core client for 30 sec - exiting 18:36:07 (3916): No heartbeat from core client for 30 sec - exiting 18:36:08 (3916): No heartbeat from core client for 30 sec - exiting 18:36:09 (3916): No heartbeat from core client for 30 sec - exiting 18:36:10 (3916): No heartbeat from core client for 30 sec - exiting 18:36:11 (3916): No heartbeat from core client for 30 sec - exiting 18:36:13 (3916): No heartbeat from core client for 30 sec - exiting 18:36:14 (3916): No heartbeat from core client for 30 sec - exiting 18:36:15 (3916): No heartbeat from core client for 30 sec - exiting 18:36:16 (3916): No heartbeat from core client for 30 sec - exiting 18:36:17 (3916): No heartbeat from core client for 30 sec - exiting 18:36:18 (3916): No heartbeat from core client for 30 sec - exiting 18:36:19 (3916): No heartbeat from core client for 30 sec - exiting 18:36:20 (3916): No heartbeat from core client for 30 sec - exiting 18:36:21 (3916): No heartbeat from core client for 30 sec - exiting 18:36:22 (3916): No heartbeat from core client for 30 sec - exiting 18:36:23 (3916): No heartbeat from core client for 30 sec - exiting 18:36:25 (3916): No heartbeat from core client for 30 sec - exiting 18:36:26 (3916): No heartbeat from core client for 30 sec - exiting 18:36:27 (3916): No heartbeat from core client for 30 sec - exiting 18:36:28 (3916): No heartbeat from core client for 30 sec - exiting 18:36:29 (3916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:05:48 (3076): No heartbeat from core client for 30 sec - exiting 19:05:49 (3076): No heartbeat from core client for 30 sec - exiting 19:05:51 (3076): No heartbeat from core client for 30 sec - exiting 19:05:52 (3076): No heartbeat from core client for 30 sec - exiting 19:05:53 (3076): No heartbeat from core client for 30 sec - exiting 19:05:54 (3076): No heartbeat from core client for 30 sec - exiting 19:05:55 (3076): No heartbeat from core client for 30 sec - exiting 19:05:56 (3076): No heartbeat from core client for 30 sec - exiting 19:05:57 (3076): No heartbeat from core client for 30 sec - exiting 19:05:58 (3076): No heartbeat from core client for 30 sec - exiting 19:05:59 (3076): No heartbeat from core client for 30 sec - exiting 19:06:00 (3076): No heartbeat from core client for 30 sec - exiting 19:06:01 (3076): No heartbeat from core client for 30 sec - exiting 19:06:03 (3076): No heartbeat from core client for 30 sec - exiting 19:06:04 (3076): No heartbeat from core client for 30 sec - exiting 19:06:05 (3076): No heartbeat from core client for 30 sec - exiting 19:06:06 (3076): No heartbeat from core client for 30 sec - exiting 19:06:07 (3076): No heartbeat from core client for 30 sec - exiting 19:06:08 (3076): No heartbeat from core client for 30 sec - exiting 19:06:09 (3076): No heartbeat from core client for 30 sec - exiting 19:06:10 (3076): No heartbeat from core client for 30 sec - exiting 19:06:11 (3076): No heartbeat from core client for 30 sec - exiting 19:06:12 (3076): No heartbeat from core client for 30 sec - exiting 19:06:13 (3076): No heartbeat from core client for 30 sec - exiting 17:47:17 (3508): No heartbeat from core client for 30 sec - exiting 17:47:18 (3508): No heartbeat from core client for 30 sec - exiting 17:47:19 (3508): No heartbeat from core client for 30 sec - exiting 17:47:20 (3508): No heartbeat from core client for 30 sec - exiting 17:47:21 (3508): No heartbeat from core client for 30 sec - exiting 17:47:22 (3508): No heartbeat from core client for 30 sec - exiting 17:47:23 (3508): No heartbeat from core client for 30 sec - exiting 17:47:24 (3508): No heartbeat from core client for 30 sec - exiting 17:47:25 (3508): No heartbeat from core client for 30 sec - exiting 17:47:26 (3508): No heartbeat from core client for 30 sec - exiting 17:47:28 (3508): No heartbeat from core client for 30 sec - exiting 17:47:29 (3508): No heartbeat from core client for 30 sec - exiting 17:47:30 (3508): No heartbeat from core client for 30 sec - exiting 17:47:31 (3508): No heartbeat from core client for 30 sec - exiting 17:47:32 (3508): No heartbeat from core client for 30 sec - exiting 17:47:33 (3508): No heartbeat from core client for 30 sec - exiting 17:47:34 (3508): No heartbeat from core client for 30 sec - exiting 17:47:35 (3508): No heartbeat from core client for 30 sec - exiting 17:47:36 (3508): No heartbeat from core client for 30 sec - exiting 17:47:37 (3508): No heartbeat from core client for 30 sec - exiting 17:47:38 (3508): No heartbeat from core client for 30 sec - exiting 17:47:40 (3508): No heartbeat from core client for 30 sec - exiting 17:47:41 (3508): No heartbeat from core client for 30 sec - exiting 17:47:42 (3508): No heartbeat from core client for 30 sec - exiting 17:47:43 (3508): No heartbeat from core client for 30 sec - exiting 17:47:44 (3508): No heartbeat from core client for 30 sec - exiting 17:47:45 (3508): No heartbeat from core client for 30 sec - exiting 17:47:46 (3508): No heartbeat from core client for 30 sec - exiting 17:47:47 (3508): No heartbeat from core client for 30 sec - exiting 17:47:48 (3508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 20:02:08 (2348): No heartbeat from core client for 30 sec - exiting 20:02:09 (2348): No heartbeat from core client for 30 sec - exiting 20:02:10 (2348): No heartbeat from core client for 30 sec - exiting 20:02:11 (2348): No heartbeat from core client for 30 sec - exiting 20:02:12 (2348): No heartbeat from core client for 30 sec - exiting 20:02:13 (2348): No heartbeat from core client for 30 sec - exiting 20:02:14 (2348): No heartbeat from core client for 30 sec - exiting 20:02:15 (2348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_cngc_2002_1_008081075_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Aug 2012 18:16:27 | 1220817 | 14972309 | hadam3p_eu_cngc_2002_1_008081075_0 | 11,616 | 21,530 | 1.8535 |
©2024 cpdn.org