Name | hadam3p_anz_l21c_2013_1_009735762_1 |
Workunit | 9807607 |
Created | 14 Apr 2015, 20:12:17 UTC |
Sent | 16 Apr 2015, 14:04:35 UTC |
Report deadline | 28 Mar 2016, 19:24:35 UTC |
Received | 21 Apr 2015, 12:33:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1326216 |
Run time | 15 hours 58 min 19 sec |
CPU time | 15 hours 7 min 58 sec |
Validate state | Invalid |
Credit | 509.72 |
Device peak FLOPS | 4.23 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7652, iMonCtr=2 Model crash detected, will try to restart... 07:56:33 (3652): No heartbeat from core client for 30 sec - exiting 07:56:35 (3652): No heartbeat from core client for 30 sec - exiting 07:56:36 (3652): No heartbeat from core client for 30 sec - exiting 07:56:37 (3652): No heartbeat from core client for 30 sec - exiting 07:56:38 (3652): No heartbeat from core client for 30 sec - exiting 07:56:39 (3652): No heartbeat from core client for 30 sec - exiting 07:56:40 (3652): No heartbeat from core client for 30 sec - exiting 07:56:41 (3652): No heartbeat from core client for 30 sec - exiting 07:56:42 (3652): No heartbeat from core client for 30 sec - exiting 07:56:43 (3652): No heartbeat from core client for 30 sec - exiting 07:56:44 (3652): No heartbeat from core client for 30 sec - exiting 07:56:45 (3652): No heartbeat from core client for 30 sec - exiting 07:56:46 (3652): No heartbeat from core client for 30 sec - exiting 07:56:47 (3652): No heartbeat from core client for 30 sec - exiting 07:56:48 (3652): No heartbeat from core client for 30 sec - exiting 07:56:49 (3652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1428, selfPID=1428, iMonCtr=2 07:56:50 (3652): No heartbeat from core client for 30 sec - exiting 07:58:09 (6052): No heartbeat from core client for 30 sec - exiting 07:58:10 (6052): No heartbeat from core client for 30 sec - exiting 07:58:11 (6052): No heartbeat from core client for 30 sec - exiting 07:58:12 (6052): No heartbeat from core client for 30 sec - exiting 07:58:13 (6052): No heartbeat from core client for 30 sec - exiting 07:58:14 (6052): No heartbeat from core client for 30 sec - exiting 07:58:15 (6052): No heartbeat from core client for 30 sec - exiting 07:58:16 (6052): No heartbeat from core client for 30 sec - exiting 07:58:17 (6052): No heartbeat from core client for 30 sec - exiting 07:58:18 (6052): No heartbeat from core client for 30 sec - exiting 07:58:19 (6052): No heartbeat from core client for 30 sec - exiting 07:58:20 (6052): No heartbeat from core client for 30 sec - exiting 07:58:21 (6052): No heartbeat from core client for 30 sec - exiting 07:58:22 (6052): No heartbeat from core client for 30 sec - exiting 07:58:23 (6052): No heartbeat from core client for 30 sec - exiting 07:58:24 (6052): No heartbeat from core client for 30 sec - exiting 07:58:25 (6052): No heartbeat from core client for 30 sec - exiting 07:58:26 (6052): No heartbeat from core client for 30 sec - exiting 07:58:27 (6052): No heartbeat from core client for 30 sec - exiting 07:58:28 (6052): No heartbeat from core client for 30 sec - exiting 07:58:29 (6052): No heartbeat from core client for 30 sec - exiting 07:58:30 (6052): No heartbeat from core client for 30 sec - exiting 07:58:31 (6052): No heartbeat from core client for 30 sec - exiting 07:58:32 (6052): No heartbeat from core client for 30 sec - exiting 07:58:33 (6052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:07:32 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:07:54 (7840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:32:19 (6448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:32:56 (11252): No heartbeat from core client for 30 sec - exiting 09:32:57 (11252): No heartbeat from core client for 30 sec - exiting 09:32:58 (11252): No heartbeat from core client for 30 sec - exiting 09:32:59 (11252): No heartbeat from core client for 30 sec - exiting 09:33:00 (11252): No heartbeat from core client for 30 sec - exiting 09:33:01 (11252): No heartbeat from core client for 30 sec - exiting 09:33:02 (11252): No heartbeat from core client for 30 sec - exiting 09:33:03 (11252): No heartbeat from core client for 30 sec - exiting 09:33:04 (11252): No heartbeat from core client for 30 sec - exiting 09:33:05 (11252): No heartbeat from core client for 30 sec - exiting 09:33:06 (11252): No heartbeat from core client for 30 sec - exiting 09:33:07 (11252): No heartbeat from core client for 30 sec - exiting 09:33:08 (11252): No heartbeat from core client for 30 sec - exiting 09:33:09 (11252): No heartbeat from core client for 30 sec - exiting 09:33:10 (11252): No heartbeat from core client for 30 sec - exiting 09:33:11 (11252): No heartbeat from core client for 30 sec - exiting 09:33:12 (11252): No heartbeat from core client for 30 sec - exiting 09:33:13 (11252): No heartbeat from core client for 30 sec - exiting 09:33:14 (11252): No heartbeat from core client for 30 sec - exiting 09:33:15 (11252): No heartbeat from core client for 30 sec - exiting 09:33:16 (11252): No heartbeat from core client for 30 sec - exiting 09:33:17 (11252): No heartbeat from core client for 30 sec - exiting 09:33:18 (11252): No heartbeat from core client for 30 sec - exiting 09:33:19 (11252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:41:42 (4160): No heartbeat from core client for 30 sec - exiting 07:41:44 (4160): No heartbeat from core client for 30 sec - exiting 07:41:45 (4160): No heartbeat from core client for 30 sec - exiting 07:41:46 (4160): No heartbeat from core client for 30 sec - exiting 07:41:47 (4160): No heartbeat from core client for 30 sec - exiting 07:41:48 (4160): No heartbeat from core client for 30 sec - exiting 07:41:49 (4160): No heartbeat from core client for 30 sec - exiting 07:41:50 (4160): No heartbeat from core client for 30 sec - exiting 07:41:51 (4160): No heartbeat from core client for 30 sec - exiting 07:41:52 (4160): No heartbeat from core client for 30 sec - exiting 07:41:53 (4160): No heartbeat from core client for 30 sec - exiting 07:41:55 (4160): No heartbeat from core client for 30 sec - exiting 07:41:56 (4160): No heartbeat from core client for 30 sec - exiting 07:41:57 (4160): No heartbeat from core client for 30 sec - exiting 07:41:58 (4160): No heartbeat from core client for 30 sec - exiting 07:41:59 (4160): No heartbeat from core client for 30 sec - exiting 07:42:00 (4160): No heartbeat from core client for 30 sec - exiting 07:42:01 (4160): No heartbeat from core client for 30 sec - exiting 07:42:02 (4160): No heartbeat from core client for 30 sec - exiting 07:42:03 (4160): No heartbeat from core client for 30 sec - exiting 07:42:04 (4160): No heartbeat from core client for 30 sec - exiting 07:42:05 (4160): No heartbeat from core client for 30 sec - exiting 07:42:06 (4160): No heartbeat from core client for 30 sec - exiting 07:42:07 (4160): No heartbeat from core client for 30 sec - exiting 07:42:08 (4160): No heartbeat from core client for 30 sec - exiting 07:42:10 (4160): No heartbeat from core client for 30 sec - exiting 07:42:11 (4160): No heartbeat from core client for 30 sec - exiting 07:42:12 (4160): No heartbeat from core client for 30 sec - exiting 07:42:13 (4160): No heartbeat from core client for 30 sec - exiting 07:42:14 (4160): No heartbeat from core client for 30 sec - exiting 07:42:15 (4160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:45:55 (5028): No heartbeat from core client for 30 sec - exiting 07:45:57 (5028): No heartbeat from core client for 30 sec - exiting 07:45:58 (5028): No heartbeat from core client for 30 sec - exiting 07:45:59 (5028): No heartbeat from core client for 30 sec - exiting 07:46:00 (5028): No heartbeat from core client for 30 sec - exiting 07:46:01 (5028): No heartbeat from core client for 30 sec - exiting 07:46:02 (5028): No heartbeat from core client for 30 sec - exiting 07:46:03 (5028): No heartbeat from core client for 30 sec - exiting 07:46:04 (5028): No heartbeat from core client for 30 sec - exiting 07:46:05 (5028): No heartbeat from core client for 30 sec - exiting 07:46:06 (5028): No heartbeat from core client for 30 sec - exiting 07:46:07 (5028): No heartbeat from core client for 30 sec - exiting 07:46:09 (5028): No heartbeat from core client for 30 sec - exiting 07:46:10 (5028): No heartbeat from core client for 30 sec - exiting 07:46:11 (5028): No heartbeat from core client for 30 sec - exiting 07:46:12 (5028): No heartbeat from core client for 30 sec - exiting 07:46:13 (5028): No heartbeat from core client for 30 sec - exiting 07:46:14 (5028): No heartbeat from core client for 30 sec - exiting 07:46:15 (5028): No heartbeat from core client for 30 sec - exiting 07:46:16 (5028): No heartbeat from core client for 30 sec - exiting 07:46:17 (5028): No heartbeat from core client for 30 sec - exiting 07:46:18 (5028): No heartbeat from core client for 30 sec - exiting 07:46:19 (5028): No heartbeat from core client for 30 sec - exiting 07:46:20 (5028): No heartbeat from core client for 30 sec - exiting 07:46:21 (5028): No heartbeat from core client for 30 sec - exiting 07:46:23 (5028): No heartbeat from core client for 30 sec - exiting 07:46:24 (5028): No heartbeat from core client for 30 sec - exiting 07:46:25 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:49:10 (3920): No heartbeat from core client for 30 sec - exiting 07:49:11 (3920): No heartbeat from core client for 30 sec - exiting 07:49:12 (3920): No heartbeat from core client for 30 sec - exiting 07:49:14 (3920): No heartbeat from core client for 30 sec - exiting 07:49:15 (3920): No heartbeat from core client for 30 sec - exiting 07:49:16 (3920): No heartbeat from core client for 30 sec - exiting 07:49:17 (3920): No heartbeat from core client for 30 sec - exiting 07:49:18 (3920): No heartbeat from core client for 30 sec - exiting 07:49:19 (3920): No heartbeat from core client for 30 sec - exiting 07:49:20 (3920): No heartbeat from core client for 30 sec - exiting 07:49:21 (3920): No heartbeat from core client for 30 sec - exiting 07:49:22 (3920): No heartbeat from core client for 30 sec - exiting 07:49:23 (3920): No heartbeat from core client for 30 sec - exiting 07:49:24 (3920): No heartbeat from core client for 30 sec - exiting 07:49:26 (3920): No heartbeat from core client for 30 sec - exiting 07:49:27 (3920): No heartbeat from core client for 30 sec - exiting 07:49:28 (3920): No heartbeat from core client for 30 sec - exiting 07:49:29 (3920): No heartbeat from core client for 30 sec - exiting 07:49:30 (3920): No heartbeat from core client for 30 sec - exiting 07:49:31 (3920): No heartbeat from core client for 30 sec - exiting 07:49:32 (3920): No heartbeat from core client for 30 sec - exiting 07:49:33 (3920): No heartbeat from core client for 30 sec - exiting 07:49:34 (3920): No heartbeat from core client for 30 sec - exiting 07:49:35 (3920): No heartbeat from core client for 30 sec - exiting 07:49:37 (3920): No heartbeat from core client for 30 sec - exiting 07:49:38 (3920): No heartbeat from core client for 30 sec - exiting 07:49:39 (3920): No heartbeat from core client for 30 sec - exiting 07:49:40 (3920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:04:36 (2184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:05:18 (4924): No heartbeat from core client for 30 sec - exiting 09:05:19 (4924): No heartbeat from core client for 30 sec - exiting 09:05:20 (4924): No heartbeat from core client for 30 sec - exiting 09:05:21 (4924): No heartbeat from core client for 30 sec - exiting 09:05:22 (4924): No heartbeat from core client for 30 sec - exiting 09:05:23 (4924): No heartbeat from core client for 30 sec - exiting 09:05:24 (4924): No heartbeat from core client for 30 sec - exiting 09:05:25 (4924): No heartbeat from core client for 30 sec - exiting 09:05:26 (4924): No heartbeat from core client for 30 sec - exiting 09:05:27 (4924): No heartbeat from core client for 30 sec - exiting 09:05:28 (4924): No heartbeat from core client for 30 sec - exiting 09:05:29 (4924): No heartbeat from core client for 30 sec - exiting 09:05:30 (4924): No heartbeat from core client for 30 sec - exiting 09:05:31 (4924): No heartbeat from core client for 30 sec - exiting 09:05:32 (4924): No heartbeat from core client for 30 sec - exiting 09:05:33 (4924): No heartbeat from core client for 30 sec - exiting 09:05:34 (4924): No heartbeat from core client for 30 sec - exiting 09:05:35 (4924): No heartbeat from core client for 30 sec - exiting 09:05:36 (4924): No heartbeat from core client for 30 sec - exiting 09:05:37 (4924): No heartbeat from core client for 30 sec - exiting 09:05:38 (4924): No heartbeat from core client for 30 sec - exiting 09:05:39 (4924): No heartbeat from core client for 30 sec - exiting 09:05:40 (4924): No heartbeat from core client for 30 sec - exiting 09:05:41 (4924): No heartbeat from core client for 30 sec - exiting 09:05:42 (4924): No heartbeat from core client for 30 sec - exiting 09:05:43 (4924): No heartbeat from core client for 30 sec - exiting 09:05:44 (4924): No heartbeat from core client for 30 sec - exiting 09:05:45 (4924): No heartbeat from core client for 30 sec - exiting 09:05:46 (4924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:02:09 (6696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:43 (4992): No heartbeat from core client for 30 sec - exiting 11:02:44 (4992): No heartbeat from core client for 30 sec - exiting 11:02:45 (4992): No heartbeat from core client for 30 sec - exiting 11:02:46 (4992): No heartbeat from core client for 30 sec - exiting 11:02:47 (4992): No heartbeat from core client for 30 sec - exiting 11:02:48 (4992): No heartbeat from core client for 30 sec - exiting 11:02:49 (4992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:53:15 (8680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:53:50 (7292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=2 Model crash detected, will try to restart... 08:25:05 (5168): No heartbeat from core client for 30 sec - exiting 08:25:06 (5168): No heartbeat from core client for 30 sec - exiting 08:25:07 (5168): No heartbeat from core client for 30 sec - exiting 08:25:08 (5168): No heartbeat from core client for 30 sec - exiting 08:25:09 (5168): No heartbeat from core client for 30 sec - exiting 08:25:10 (5168): No heartbeat from core client for 30 sec - exiting 08:25:12 (5168): No heartbeat from core client for 30 sec - exiting 08:25:13 (5168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5464, selfPID=5464, iMonCtr=2 10:17:28 (5532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:31:36 (4648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: INITDUMP: Wrong no of atmos prognostic fields tmp/xaakm.pipe_dummy 2048 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 13:32:10 (4764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: INITDUMP: Wrong no of atmos prognostic fields tmp/xaakm.pipe_dummy 2048 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2800, selfPID=2808, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_2.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_3.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l21c_2013_1_009735762_1_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Apr 2015 09:32:39 | 1326216 | 18308743 | hadam3p_anz_l21c_2013_1_009735762_1 | 11,819 | 48,376 | 4.0931 |
©2024 cpdn.org