Name | hadam3p_eu_j5pr_2013_1_008786883_0 |
Workunit | 8932861 |
Created | 4 Jul 2014, 19:31:01 UTC |
Sent | 4 Jul 2014, 19:31:19 UTC |
Report deadline | 17 Jun 2015, 0:51:19 UTC |
Received | 22 Jul 2014, 16:25:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1255898 |
Run time | 2 days 10 hours 45 min 41 sec |
CPU time | 2 days 8 hours 58 min 3 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 2.83 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6188, selfPID=1576, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2492, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4628, selfPID=4748, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4276, selfPID=4276, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2796, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6728, iMonCtr=2 07:30:45 (5708): No heartbeat from core client for 30 sec - exiting 07:30:46 (5708): No heartbeat from core client for 30 sec - exiting 07:30:47 (5708): No heartbeat from core client for 30 sec - exiting 07:30:48 (5708): No heartbeat from core client for 30 sec - exiting 07:30:49 (5708): No heartbeat from core client for 30 sec - exiting 07:30:50 (5708): No heartbeat from core client for 30 sec - exiting 07:30:51 (5708): No heartbeat from core client for 30 sec - exiting 07:30:52 (5708): No heartbeat from core client for 30 sec - exiting 07:30:53 (5708): No heartbeat from core client for 30 sec - exiting 07:30:54 (5708): No heartbeat from core client for 30 sec - exiting 07:30:55 (5708): No heartbeat from core client for 30 sec - exiting 07:30:56 (5708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:48:11 (4888): No heartbeat from core client for 30 sec - exiting 19:48:12 (4888): No heartbeat from core client for 30 sec - exiting 19:48:13 (4888): No heartbeat from core client for 30 sec - exiting 19:48:14 (4888): No heartbeat from core client for 30 sec - exiting 19:48:15 (4888): No heartbeat from core client for 30 sec - exiting 19:48:16 (4888): No heartbeat from core client for 30 sec - exiting 19:48:17 (4888): No heartbeat from core client for 30 sec - exiting 19:48:18 (4888): No heartbeat from core client for 30 sec - exiting 19:48:19 (4888): No heartbeat from core client for 30 sec - exiting 19:49:44 (4888): No heartbeat from core client for 30 sec - exiting 19:49:45 (4888): No heartbeat from core client for 30 sec - exiting 19:49:46 (4888): No heartbeat from core client for 30 sec - exiting 19:49:47 (4888): No heartbeat from core client for 30 sec - exiting 19:49:48 (4888): No heartbeat from core client for 30 sec - exiting 19:49:49 (4888): No heartbeat from core client for 30 sec - exiting 19:49:50 (4888): No heartbeat from core client for 30 sec - exiting 19:49:51 (4888): No heartbeat from core client for 30 sec - exiting 19:49:52 (4888): No heartbeat from core client for 30 sec - exiting 19:49:53 (4888): No heartbeat from core client for 30 sec - exiting 19:49:54 (4888): No heartbeat from core client for 30 sec - exiting 19:49:55 (4888): No heartbeat from core client for 30 sec - exiting 19:49:56 (4888): No heartbeat from core client for 30 sec - exiting 19:49:57 (4888): No heartbeat from core client for 30 sec - exiting 19:49:58 (4888): No heartbeat from core client for 30 sec - exiting 19:49:59 (4888): No heartbeat from core client for 30 sec - exiting 19:50:00 (4888): No heartbeat from core client for 30 sec - exiting 19:50:01 (4888): No heartbeat from core client for 30 sec - exiting 19:50:02 (4888): No heartbeat from core client for 30 sec - exiting 19:50:03 (4888): No heartbeat from core client for 30 sec - exiting 19:50:04 (4888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:11:41 (5796): No heartbeat from core client for 30 sec - exiting 12:11:42 (5796): No heartbeat from core client for 30 sec - exiting 12:11:43 (5796): No heartbeat from core client for 30 sec - exiting 12:11:44 (5796): No heartbeat from core client for 30 sec - exiting 12:11:45 (5796): No heartbeat from core client for 30 sec - exiting 12:11:46 (5796): No heartbeat from core client for 30 sec - exiting 12:11:47 (5796): No heartbeat from core client for 30 sec - exiting 12:11:48 (5796): No heartbeat from core client for 30 sec - exiting 12:11:49 (5796): No heartbeat from core client for 30 sec - exiting 12:11:50 (5796): No heartbeat from core client for 30 sec - exiting 12:11:51 (5796): No heartbeat from core client for 30 sec - exiting 12:11:52 (5796): No heartbeat from core client for 30 sec - exiting 12:11:53 (5796): No heartbeat from core client for 30 sec - exiting 12:11:54 (5796): No heartbeat from core client for 30 sec - exiting 12:11:55 (5796): No heartbeat from core client for 30 sec - exiting 12:11:56 (5796): No heartbeat from core client for 30 sec - exiting 12:11:57 (5796): No heartbeat from core client for 30 sec - exiting 12:11:58 (5796): No heartbeat from core client for 30 sec - exiting 12:11:59 (5796): No heartbeat from core client for 30 sec - exiting 12:12:00 (5796): No heartbeat from core client for 30 sec - exiting 12:12:01 (5796): No heartbeat from core client for 30 sec - exiting 12:12:02 (5796): No heartbeat from core client for 30 sec - exiting 12:12:03 (5796): No heartbeat from core client for 30 sec - exiting 12:12:04 (5796): No heartbeat from core client for 30 sec - exiting 12:12:05 (5796): No heartbeat from core client for 30 sec - exiting 12:12:06 (5796): No heartbeat from core client for 30 sec - exiting 12:12:07 (5796): No heartbeat from core client for 30 sec - exiting 12:12:08 (5796): No heartbeat from core client for 30 sec - exiting 12:12:09 (5796): No heartbeat from core client for 30 sec - exiting 12:12:10 (5796): No heartbeat from core client for 30 sec - exiting 12:12:11 (5796): No heartbeat from core client for 30 sec - exiting 12:12:12 (5796): No heartbeat from core client for 30 sec - exiting 12:13:14 (5796): No heartbeat from core client for 30 sec - exiting 12:13:15 (5796): No heartbeat from core client for 30 sec - exiting 12:13:16 (5796): No heartbeat from core client for 30 sec - exiting 12:13:17 (5796): No heartbeat from core client for 30 sec - exiting 12:13:18 (5796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:13:19 (5796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7208, selfPID=7208, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2888, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1932, selfPID=3684, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 00:11:29 (5888): No heartbeat from core client for 30 sec - exiting 00:11:31 (5888): No heartbeat from core client for 30 sec - exiting 00:11:32 (5888): No heartbeat from core client for 30 sec - exiting 00:11:33 (5888): No heartbeat from core client for 30 sec - exiting 00:11:34 (5888): No heartbeat from core client for 30 sec - exiting 00:11:35 (5888): No heartbeat from core client for 30 sec - exiting 00:11:36 (5888): No heartbeat from core client for 30 sec - exiting 00:11:37 (5888): No heartbeat from core client for 30 sec - exiting 00:11:38 (5888): No heartbeat from core client for 30 sec - exiting 00:11:39 (5888): No heartbeat from core client for 30 sec - exiting 00:11:40 (5888): No heartbeat from core client for 30 sec - exiting 00:11:41 (5888): No heartbeat from core client for 30 sec - exiting 00:11:42 (5888): No heartbeat from core client for 30 sec - exiting 00:11:43 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:27:59 (5820): No heartbeat from core client for 30 sec - exiting 00:28:00 (5820): No heartbeat from core client for 30 sec - exiting 00:28:01 (5820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:28:53 (1524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:29:21 (5940): No heartbeat from core client for 30 sec - exiting 16:29:22 (5940): No heartbeat from core client for 30 sec - exiting 16:29:23 (5940): No heartbeat from core client for 30 sec - exiting 16:29:24 (5940): No heartbeat from core client for 30 sec - exiting 16:29:25 (5940): No heartbeat from core client for 30 sec - exiting 16:29:26 (5940): No heartbeat from core client for 30 sec - exiting 16:29:27 (5940): No heartbeat from core client for 30 sec - exiting 16:29:28 (5940): No heartbeat from core client for 30 sec - exiting 16:29:29 (5940): No heartbeat from core client for 30 sec - exiting 16:29:30 (5940): No heartbeat from core client for 30 sec - exiting 16:29:31 (5940): No heartbeat from core client for 30 sec - exiting 16:29:32 (5940): No heartbeat from core client for 30 sec - exiting 16:29:33 (5940): No heartbeat from core client for 30 sec - exiting 16:29:34 (5940): No heartbeat from core client for 30 sec - exiting 16:29:35 (5940): No heartbeat from core client for 30 sec - exiting 16:29:36 (5940): No heartbeat from core client for 30 sec - exiting 16:29:37 (5940): No heartbeat from core client for 30 sec - exiting 16:29:38 (5940): No heartbeat from core client for 30 sec - exiting 16:29:39 (5940): No heartbeat from core client for 30 sec - exiting 16:29:40 (5940): No heartbeat from core client for 30 sec - exiting 16:29:41 (5940): No heartbeat from core client for 30 sec - exiting 16:29:42 (5940): No heartbeat from core client for 30 sec - exiting 16:29:43 (5940): No heartbeat from core client for 30 sec - exiting 16:29:44 (5940): No heartbeat from core client for 30 sec - exiting 16:29:45 (5940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:29:46 (5940): No heartbeat from core client for 30 sec - exiting CGntrolleral Workerpro CPDN process is not running, exiting, bRetVal = 1, checkPID=0, sel0fPID=6132, i Model cra sh detected, will try to restart... 21:59:06 (5684): No heartbeat from core client for 30 sec - exiting 21:59:07 (5684): No heartbeat from core client for 30 sec - exiting 21:59:08 (5684): No heartbeat from core client for 30 sec - exiting 21:59:09 (5684): No heartbeat from core client for 30 sec - exiting 21:59:10 (5684): No heartbeat from core client for 30 sec - exiting 21:59:11 (5684): No heartbeat from core client for 30 sec - exiting 21:59:12 (5684): No heartbeat from core client for 30 sec - exiting 21:59:13 (5684): No heartbeat from core client for 30 sec - exiting 21:59:14 (5684): No heartbeat from core client for 30 sec - exiting 21:59:15 (5684): No heartbeat from core client for 30 sec - exiting 21:59:16 (5684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=2 Model crash detected, will try to restart... 11:24:08 (5820): No heartbeat from core client for 30 sec - exiting 11:24:09 (5820): No heartbeat from core client for 30 sec - exiting 11:24:10 (5820): No heartbeat from core client for 30 sec - exiting 11:24:11 (5820): No heartbeat from core client for 30 sec - exiting 11:25:24 (5820): No heartbeat from core client for 30 sec - exiting 11:25:25 (5820): No heartbeat from core client for 30 sec - exiting 11:25:26 (5820): No heartbeat from core client for 30 sec - exiting 11:25:27 (5820): No heartbeat from core client for 30 sec - exiting 11:25:28 (5820): No heartbeat from core client for 30 sec - exiting 11:25:29 (5820): No heartbeat from core client for 30 sec - exiting 11:25:30 (5820): No heartbeat from core client for 30 sec - exiting 11:25:31 (5820): No heartbeat from core client for 30 sec - exiting 11:25:32 (5820): No heartbeat from core client for 30 sec - exiting 11:25:33 (5820): No heartbeat from core client for 30 sec - exiting 11:25:34 (5820): No heartbeat from core client for 30 sec - exiting 11:25:35 (5820): No heartbeat from core client for 30 sec - exiting 11:25:36 (5820): No heartbeat from core client for 30 sec - exiting 11:25:37 (5820): No heartbeat from core client for 30 sec - exiting 11:25:38 (5820): No heartbeat from core client for 30 sec - exiting 11:25:39 (5820): No heartbeat from core client for 30 sec - exiting 11:25:40 (5820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4116, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2248, selfPID=3296, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_j5pr_2013_1_008786883_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_j5pr_2013_1_008786883_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_j5pr_2013_1_008786883_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Jul 2014 15:41:23 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 103,776 | 202,501 | 1.9513 |
19 Jul 2014 20:46:21 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 92,256 | 180,238 | 1.9537 |
17 Jul 2014 19:17:22 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 80,736 | 158,217 | 1.9597 |
15 Jul 2014 19:41:26 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 69,216 | 135,474 | 1.9573 |
13 Jul 2014 19:52:08 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 57,696 | 112,671 | 1.9528 |
12 Jul 2014 12:54:53 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 46,176 | 90,333 | 1.9563 |
11 Jul 2014 19:22:57 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 34,656 | 67,815 | 1.9568 |
10 Jul 2014 17:02:22 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 23,136 | 44,929 | 1.9420 |
07 Jul 2014 19:23:05 | 1255898 | 16701882 | hadam3p_eu_j5pr_2013_1_008786883_0 | 11,616 | 22,462 | 1.9337 |
©2024 cpdn.org