Name | hadam3p_eu_2kce_1984_1_007159862_0 |
Workunit | 7344702 |
Created | 18 Feb 2011, 17:06:51 UTC |
Sent | 12 Mar 2011, 10:39:33 UTC |
Report deadline | 22 Feb 2012, 15:59:33 UTC |
Received | 23 Mar 2011, 21:28:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1129858 |
Run time | 18 hours 2 min 8 sec |
CPU time | 16 hours 47 min 33 sec |
Validate state | Invalid |
Credit | 796.57 |
Device peak FLOPS | 3.37 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=272764, selfPID=272764, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=287944, selfPID=287944, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:05:53 (288456): No heartbeat from core client for 30 sec - exiting 11:05:54 (288456): No heartbeat from core client for 30 sec - exiting 11:05:55 (288456): No heartbeat from core client for 30 sec - exiting 11:05:56 (288456): No heartbeat from core client for 30 sec - exiting 11:05:57 (288456): No heartbeat from core client for 30 sec - exiting 11:05:58 (288456): No heartbeat from core client for 30 sec - exiting 11:05:59 (288456): No heartbeat from core client for 30 sec - exiting 11:06:00 (288456): No heartbeat from core client for 30 sec - exiting 11:06:01 (288456): No heartbeat from core client for 30 sec - exiting 11:06:02 (288456): No heartbeat from core client for 30 sec - exiting 11:06:03 (288456): No heartbeat from core client for 30 sec - exiting 11:06:04 (288456): No heartbeat from core client for 30 sec - exiting 11:06:05 (288456): No heartbeat from core client for 30 sec - exiting 11:06:06 (288456): No heartbeat from core client for 30 sec - exiting 11:06:07 (288456): No heartbeat from core client for 30 sec - exiting 11:06:08 (288456): No heartbeat from core client for 30 sec - exiting 11:06:09 (288456): No heartbeat from core client for 30 sec - exiting 11:06:10 (288456): No heartbeat from core client for 30 sec - exiting 11:06:11 (288456): No heartbeat from core client for 30 sec - exiting 11:06:12 (288456): No heartbeat from core client for 30 sec - exiting 11:06:13 (288456): No heartbeat from core client for 30 sec - exiting 11:06:14 (288456): No heartbeat from core client for 30 sec - exiting 11:06:15 (288456): No heartbeat from core client for 30 sec - exiting 11:06:16 (288456): No heartbeat from core client for 30 sec - exiting 11:06:17 (288456): No heartbeat from core client for 30 sec - exiting 11:06:18 (288456): No heartbeat from core client for 30 sec - exiting 11:06:19 (288456): No heartbeat from core client for 30 sec - exiting 11:06:20 (288456): No heartbeat from core client for 30 sec - exiting 11:06:21 (288456): No heartbeat from core client for 30 sec - exiting 11:06:22 (288456): No heartbeat from core client for 30 sec - exiting 11:06:23 (288456): No heartbeat from core client for 30 sec - exiting 11:06:24 (288456): No heartbeat from core client for 30 sec - exiting 11:06:25 (288456): No heartbeat from core client for 30 sec - exiting 11:06:26 (288456): No heartbeat from core client for 30 sec - exiting 11:06:27 (288456): No heartbeat from core client for 30 sec - exiting 11:06:28 (288456): No heartbeat from core client for 30 sec - exiting 11:06:29 (288456): No heartbeat from core client for 30 sec - exiting 11:06:30 (288456): No heartbeat from core client for 30 sec - exiting 11:06:31 (288456): No heartbeat from core client for 30 sec - exiting 11:06:32 (288456): No heartbeat from core client for 30 sec - exiting 11:06:33 (288456): No heartbeat from core client for 30 sec - exiting 11:06:34 (288456): No heartbeat from core client for 30 sec - exiting 11:06:35 (288456): No heartbeat from core client for 30 sec - exiting 11:06:36 (288456): No heartbeat from core client for 30 sec - exiting 11:06:37 (288456): No heartbeat from core client for 30 sec - exiting 11:06:38 (288456): No heartbeat from core client for 30 sec - exiting 11:06:39 (288456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:53:13 (288456): No heartbeat from core client for 30 sec - exiting 18:53:45 (288456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:42:21 (290852): No heartbeat from core client for 30 sec - exiting 01:42:22 (290852): No heartbeat from core client for 30 sec - exiting 01:42:23 (290852): No heartbeat from core client for 30 sec - exiting 01:42:24 (290852): No heartbeat from core client for 30 sec - exiting 01:42:25 (290852): No heartbeat from core client for 30 sec - exiting 01:42:26 (290852): No heartbeat from core client for 30 sec - exiting 01:42:27 (290852): No heartbeat from core client for 30 sec - exiting 01:42:28 (290852): No heartbeat from core client for 30 sec - exiting 01:42:29 (290852): No heartbeat from core client for 30 sec - exiting 01:42:30 (290852): No heartbeat from core client for 30 sec - exiting 01:42:31 (290852): No heartbeat from core client for 30 sec - exiting 01:42:32 (290852): No heartbeat from core client for 30 sec - exiting 01:42:33 (290852): No heartbeat from core client for 30 sec - exiting 01:42:34 (290852): No heartbeat from core client for 30 sec - exiting 01:42:35 (290852): No heartbeat from core client for 30 sec - exiting 01:42:36 (290852): No heartbeat from core client for 30 sec - exiting 01:42:37 (290852): No heartbeat from core client for 30 sec - exiting 01:42:38 (290852): No heartbeat from core client for 30 sec - exiting 01:42:39 (290852): No heartbeat from core client for 30 sec - exiting 01:42:40 (290852): No heartbeat from core client for 30 sec - exiting 01:42:41 (290852): No heartbeat from core client for 30 sec - exiting 01:42:42 (290852): No heartbeat from core client for 30 sec - exiting 01:42:43 (290852): No heartbeat from core client for 30 sec - exiting 01:42:44 (290852): No heartbeat from core client for 30 sec - exiting 01:42:45 (290852): No heartbeat from core client for 30 sec - exiting 01:42:46 (290852): No heartbeat from core client for 30 sec - exiting 01:42:47 (290852): No heartbeat from core client for 30 sec - exiting 01:42:48 (290852): No heartbeat from core client for 30 sec - exiting 01:42:49 (290852): No heartbeat from core client for 30 sec - exiting 01:42:50 (290852): No heartbeat from core client for 30 sec - exiting 01:42:51 (290852): No heartbeat from core client for 30 sec - exiting 01:42:52 (290852): No heartbeat from core client for 30 sec - exiting 01:42:53 (290852): No heartbeat from core client for 30 sec - exiting 01:42:54 (290852): No heartbeat from core client for 30 sec - exiting 01:42:55 (290852): No heartbeat from core client for 30 sec - exiting 01:42:56 (290852): No heartbeat from core client for 30 sec - exiting 01:42:57 (290852): No heartbeat from core client for 30 sec - exiting 01:42:58 (290852): No heartbeat from core client for 30 sec - exiting 01:42:59 (290852): No heartbeat from core client for 30 sec - exiting 01:43:00 (290852): No heartbeat from core client for 30 sec - exiting 01:43:01 (290852): No heartbeat from core client for 30 sec - exiting 01:43:02 (290852): No heartbeat from core client for 30 sec - exiting 01:43:03 (290852): No heartbeat from core client for 30 sec - exiting 01:43:04 (290852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=290908, selfPID=290908, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=290236, selfPID=290236, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=290748, selfPID=290748, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=273116, selfPID=273116, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=288496, selfPID=288496, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 21:46:58 (292904): No heartbeat from core client for 30 sec - exiting 21:46:59 (292904): No heartbeat from core client for 30 sec - exiting 21:47:00 (292904): No heartbeat from core client for 30 sec - exiting 21:47:01 (292904): No heartbeat from core client for 30 sec - exiting 21:47:02 (292904): No heartbeat from core client for 30 sec - exiting 21:47:03 (292904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:35:14 (293484): No heartbeat from core client for 30 sec - exiting 07:35:15 (293484): No heartbeat from core client for 30 sec - exiting 07:35:16 (293484): No heartbeat from core client for 30 sec - exiting 07:35:17 (293484): No heartbeat from core client for 30 sec - exiting 07:35:18 (293484): No heartbeat from core client for 30 sec - exiting 07:35:19 (293484): No heartbeat from core client for 30 sec - exiting 07:35:20 (293484): No heartbeat from core client for 30 sec - exiting 07:35:21 (293484): No heartbeat from core client for 30 sec - exiting 07:35:22 (293484): No heartbeat from core client for 30 sec - exiting 07:35:23 (293484): No heartbeat from core client for 30 sec - exiting 07:35:24 (293484): No heartbeat from core client for 30 sec - exiting 07:35:25 (293484): No heartbeat from core client for 30 sec - exiting 07:35:26 (293484): No heartbeat from core client for 30 sec - exiting 07:35:27 (293484): No heartbeat from core client for 30 sec - exiting 07:35:28 (293484): No heartbeat from core client for 30 sec - exiting 07:35:29 (293484): No heartbeat from core client for 30 sec - exiting 07:35:30 (293484): No heartbeat from core client for 30 sec - exiting 07:35:31 (293484): No heartbeat from core client for 30 sec - exiting 07:35:32 (293484): No heartbeat from core client for 30 sec - exiting 07:35:33 (293484): No heartbeat from core client for 30 sec - exiting 07:35:34 (293484): No heartbeat from core client for 30 sec - exiting 07:35:35 (293484): No heartbeat from core client for 30 sec - exiting 07:35:36 (293484): No heartbeat from core client for 30 sec - exiting 07:35:37 (293484): No heartbeat from core client for 30 sec - exiting 07:35:38 (293484): No heartbeat from core client for 30 sec - exiting 07:35:39 (293484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=293480, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=294704, selfPID=294704, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=294704, selfPID=294584, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:36:48 (294584): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2kce_1984_1_007159862_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Mar 2011 15:15:32 | 1129858 | 12591980 | hadam3p_eu_2kce_1984_1_007159862_0 | 46,176 | 57,823 | 1.2522 |
20 Mar 2011 08:42:38 | 1129858 | 12591980 | hadam3p_eu_2kce_1984_1_007159862_0 | 34,656 | 43,299 | 1.2494 |
20 Mar 2011 02:20:01 | 1129858 | 12591980 | hadam3p_eu_2kce_1984_1_007159862_0 | 23,136 | 28,657 | 1.2386 |
16 Mar 2011 18:37:36 | 1129858 | 12591980 | hadam3p_eu_2kce_1984_1_007159862_0 | 11,616 | 14,281 | 1.2294 |
©2024 cpdn.org