Name | hadam3p_eu_l31r_2013_1_008815176_0 |
Workunit | 8961105 |
Created | 8 Jul 2014, 9:02:50 UTC |
Sent | 31 Jul 2014, 11:27:56 UTC |
Report deadline | 13 Jul 2015, 16:47:56 UTC |
Received | 6 Aug 2014, 12:02:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1177123 |
Run time | 2 days 19 hours 55 min 21 sec |
CPU time | 2 days 0 hours 30 min 5 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 3.20 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6832, selfPID=9952, iMonCtr=1 Model crash detected, will try to restart... 19:46:07 (8072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:46:08 (8072): No heartbeat from core client for 30 sec - exiting 19:46:09 (8072): No heartbeat from core client for 30 sec - exiting 19:46:10 (8072): No heartbeat from core client for 30 sec - exiting 19:46:11 (8072): No heartbeat from core client for 30 sec - exiting 19:46:12 (8072): No heartbeat from core client for 30 sec - exiting 19:46:13 (8072): No heartbeat from core client for 30 sec - exiting 19:46:14 (8072): No heartbeat from core client for 30 sec - exiting 19:46:15 (8072): No heartbeat from core client for 30 sec - exiting 19:46:16 (8072): No heartbeat from core client for 30 sec - exiting GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5552, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:31:44 (7872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:31:45 (7872): No heartbeat from core client for 30 sec - exiting 06:31:46 (7872): No heartbeat from core client for 30 sec - exiting 06:31:47 (7872): No heartbeat from core client for 30 sec - exiting 06:31:48 (7872): No heartbeat from core client for 30 sec - exiting 06:31:49 (7872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7440, selfPID=7440, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1436, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4964, selfPID=4964, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5876, iMonCtr=2 02:59:13 (3340): No heartbeat from core client for 30 sec - exiting 02:59:14 (3340): No heartbeat from core client for 30 sec - exiting 02:59:15 (3340): No heartbeat from core client for 30 sec - exiting 02:59:16 (3340): No heartbeat from core client for 30 sec - exiting 02:59:17 (3340): No heartbeat from core client for 30 sec - exiting 02:59:18 (3340): No heartbeat from core client for 30 sec - exiting 02:59:19 (3340): No heartbeat from core client for 30 sec - exiting 02:59:20 (3340): No heartbeat from core client for 30 sec - exiting 02:59:21 (3340): No heartbeat from core client for 30 sec - exiting 02:59:22 (3340): No heartbeat from core client for 30 sec - exiting 02:59:23 (3340): No heartbeat from core client for 30 sec - exiting 02:59:24 (3340): No heartbeat from core client for 30 sec - exiting 02:59:25 (3340): No heartbeat from core client for 30 sec - exiting 02:59:27 (3340): No heartbeat from core client for 30 sec - exiting 02:59:28 (3340): No heartbeat from core client for 30 sec - exiting 02:59:29 (3340): No heartbeat from core client for 30 sec - exiting 02:59:30 (3340): No heartbeat from core client for 30 sec - exiting 02:59:31 (3340): No heartbeat from core client for 30 sec - exiting 02:59:32 (3340): No heartbeat from core client for 30 sec - exiting 02:59:33 (3340): No heartbeat from core client for 30 sec - exiting 02:59:34 (3340): No heartbeat from core client for 30 sec - exiting 02:59:35 (3340): No heartbeat from core client for 30 sec - exiting 02:59:36 (3340): No heartbeat from core client for 30 sec - exiting 02:59:37 (3340): No heartbeat from core client for 30 sec - exiting 02:59:38 (3340): No heartbeat from core client for 30 sec - exiting 02:59:39 (3340): No heartbeat from core client for 30 sec - exiting 02:59:40 (3340): No heartbeat from core client for 30 sec - exiting 02:59:41 (3340): No heartbeat from core client for 30 sec - exiting 02:59:42 (3340): No heartbeat from core client for 30 sec - exiting 02:59:43 (3340): No heartbeat from core client for 30 sec - exiting 02:59:44 (3340): No heartbeat from core client for 30 sec - exiting 02:59:45 (3340): No heartbeat from core client for 30 sec - exiting 02:59:46 (3340): No heartbeat from core client for 30 sec - exiting 02:59:47 (3340): No heartbeat from core client for 30 sec - exiting 02:59:48 (3340): No heartbeat from core client for 30 sec - exiting 02:59:49 (3340): No heartbeat from core client for 30 sec - exiting 02:59:50 (3340): No heartbeat from core client for 30 sec - exiting 02:59:51 (3340): No heartbeat from core client for 30 sec - exiting 02:59:52 (3340): No heartbeat from core client for 30 sec - exiting 02:59:53 (3340): No heartbeat from core client for 30 sec - exiting 02:59:54 (3340): No heartbeat from core client for 30 sec - exiting 02:59:55 (3340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 05:20:23 (7924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:20:24 (7924): No heartbeat from core client for 30 sec - exiting 05:20:25 (7924): No heartbeat from core client for 30 sec - exiting 05:20:26 (7924): No heartbeat from core client for 30 sec - exiting 05:20:27 (7924): No heartbeat from core client for 30 sec - exiting 05:20:28 (7924): No heartbeat from core client for 30 sec - exiting 05:20:29 (7924): No heartbeat from core client for 30 sec - exiting 05:20:30 (7924): No heartbeat from core client for 30 sec - exiting 05:20:31 (7924): No heartbeat from core client for 30 sec - exiting 05:20:32 (7924): No heartbeat from core client for 30 sec - exiting 05:20:33 (7924): No heartbeat from core client for 30 sec - exiting 05:20:34 (7924): No heartbeat from core client for 30 sec - exiting 05:20:35 (7924): No heartbeat from core client for 30 sec - exiting 05:20:36 (7924): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7540, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=2 Model crash detected, will try to restart... 08:10:52 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:10:53 (3424): No heartbeat from core client for 30 sec - exiting 08:10:54 (3424): No heartbeat from core client for 30 sec - exiting 08:10:55 (3424): No heartbeat from core client for 30 sec - exiting 08:10:56 (3424): No heartbeat from core client for 30 sec - exiting 08:10:57 (3424): No heartbeat from core client for 30 sec - exiting 08:10:58 (3424): No heartbeat from core client for 30 sec - exiting 08:10:59 (3424): No heartbeat from core client for 30 sec - exiting 08:11:00 (3424): No heartbeat from core client for 30 sec - exiting 08:11:01 (3424): No heartbeat from core client for 30 sec - exiting 08:11:02 (3424): No heartbeat from core client for 30 sec - exiting 08:11:03 (3424): No heartbeat from core client for 30 sec - exiting 08:11:04 (3424): No heartbeat from core client for 30 sec - exiting 08:11:05 (3424): No heartbeat from core client for 30 sec - exiting 08:11:06 (3424): No heartbeat from core client for 30 sec - exiting 08:11:07 (3424): No heartbeat from core client for 30 sec - exiting 08:11:08 (3424): No heartbeat from core client for 30 sec - exiting 08:11:09 (3424): No heartbeat from core client for 30 sec - exiting 08:11:10 (3424): No heartbeat from core client for 30 sec - exiting 08:11:11 (3424): No heartbeat from core client for 30 sec - exiting 08:11:12 (3424): No heartbeat from core client for 30 sec - exiting 08:11:13 (3424): No heartbeat from core client for 30 sec - exiting 08:11:14 (3424): No heartbeat from core client for 30 sec - exiting 08:11:15 (3424): No heartbeat from core client for 30 sec - exiting 08:11:16 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:08:49 (7280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:08:50 (7280): No heartbeat from core client for 30 sec - exiting 04:08:51 (7280): No heartbeat from core client for 30 sec - exiting 04:08:52 (7280): No heartbeat from core client for 30 sec - exiting 04:08:53 (7280): No heartbeat from core client for 30 sec - exiting 04:08:54 (7280): No heartbeat from core client for 30 sec - exiting 04:08:55 (7280): No heartbeat from core client for 30 sec - exiting 04:33:15 (8320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:33:16 (8320): No heartbeat from core client for 30 sec - exiting 04:33:17 (8320): No heartbeat from core client for 30 sec - exiting 04:33:18 (8320): No heartbeat from core client for 30 sec - exiting 04:33:19 (8320): No heartbeat from core client for 30 sec - exiting 04:33:20 (8320): No heartbeat from core client for 30 sec - exiting 08:07:11 (7828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:07:44 (7828): No heartbeat from core client for 30 sec - exiting 08:07:45 (7828): No heartbeat from core client for 30 sec - exiting 08:07:46 (7828): No heartbeat from core client for 30 sec - exiting 08:07:47 (7828): No heartbeat from core client for 30 sec - exiting 08:07:48 (7828): No heartbeat from core client for 30 sec - exiting 08:07:49 (7828): No heartbeat from core client for 30 sec - exiting 08:07:50 (7828): No heartbeat from core client for 30 sec - exiting 08:07:51 (7828): No heartbeat from core client for 30 sec - exiting 08:07:52 (7828): No heartbeat from core client for 30 sec - exiting 08:07:53 (7828): No heartbeat from core client for 30 sec - exiting 08:07:54 (7828): No heartbeat from core client for 30 sec - exiting 08:07:55 (7828): No heartbeat from core client for 30 sec - exiting 08:07:56 (7828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1556, selfPID=1556, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6744, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 03:47:31 (6240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:47:32 (6240): No heartbeat from core client for 30 sec - exiting 03:47:33 (6240): No heartbeat from core client for 30 sec - exiting 03:47:34 (6240): No heartbeat from core client for 30 sec - exiting 03:47:35 (6240): No heartbeat from core client for 30 sec - exiting 03:47:36 (6240): No heartbeat from core client for 30 sec - exiting 07:15:55 (3384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:15:56 (3384): No heartbeat from core client for 30 sec - exiting 07:15:57 (3384): No heartbeat from core client for 30 sec - exiting 07:15:58 (3384): No heartbeat from core client for 30 sec - exiting 07:15:59 (3384): No heartbeat from core client for 30 sec - exiting 07:16:00 (3384): No heartbeat from core client for 30 sec - exiting 07:16:01 (3384): No heartbeat from core client for 30 sec - exiting 07:16:02 (3384): No heartbeat from core client for 30 sec - exiting 07:16:03 (3384): No heartbeat from core client for 30 sec - exiting 07:16:04 (3384): No heartbeat from core client for 30 sec - exiting 07:17:54 (4028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:55 (4028): No heartbeat from core client for 30 sec - exiting 07:17:56 (4028): No heartbeat from core client for 30 sec - exiting 07:17:57 (4028): No heartbeat from core client for 30 sec - exiting 07:17:58 (4028): No heartbeat from core client for 30 sec - exiting 07:22:16 (8156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:22:17 (8156): No heartbeat from core client for 30 sec - exiting 07:22:18 (8156): No heartbeat from core client for 30 sec - exiting 07:22:19 (8156): No heartbeat from core client for 30 sec - exiting 07:22:20 (8156): No heartbeat from core client for 30 sec - exiting 07:22:21 (8156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_l31r_2013_1_008815176_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l31r_2013_1_008815176_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l31r_2013_1_008815176_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l31r_2013_1_008815176_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l31r_2013_1_008815176_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l31r_2013_1_008815176_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Aug 2014 04:56:38 | 1177123 | 16733780 | hadam3p_eu_l31r_2013_1_008815176_0 | 69,216 | 151,533 | 2.1893 |
04 Aug 2014 08:21:29 | 1177123 | 16733780 | hadam3p_eu_l31r_2013_1_008815176_0 | 57,696 | 126,372 | 2.1903 |
03 Aug 2014 20:34:30 | 1177123 | 16733780 | hadam3p_eu_l31r_2013_1_008815176_0 | 46,176 | 101,428 | 2.1966 |
03 Aug 2014 07:05:44 | 1177123 | 16733780 | hadam3p_eu_l31r_2013_1_008815176_0 | 34,656 | 76,533 | 2.2084 |
02 Aug 2014 20:11:42 | 1177123 | 16733780 | hadam3p_eu_l31r_2013_1_008815176_0 | 23,136 | 49,670 | 2.1469 |
01 Aug 2014 15:00:16 | 1177123 | 16733780 | hadam3p_eu_l31r_2013_1_008815176_0 | 11,616 | 24,467 | 2.1063 |
©2024 cpdn.org