Name | hadam3p_saf_2agm_1980_1_007403272_2 |
Workunit | 7600702 |
Created | 18 Aug 2011, 15:01:04 UTC |
Sent | 18 Aug 2011, 15:37:24 UTC |
Report deadline | 30 Jul 2012, 20:57:24 UTC |
Received | 1 Dec 2011, 16:30:31 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1107482 |
Run time | 1 days 5 hours 55 min 43 sec |
CPU time | 4 hours 18 min 50 sec |
Validate state | Invalid |
Credit | 562.19 |
Device peak FLOPS | 2.94 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=4324, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3292, selfPID=6704, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2992, selfPID=4980, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 09:33:26 (2964): No heartbeat from core client for 30 sec - exiting 09:33:27 (2964): No heartbeat from core client for 30 sec - exiting 09:33:28 (2964): No heartbeat from core client for 30 sec - exiting 09:33:29 (2964): No heartbeat from core client for 30 sec - exiting 15:56:27 (3904): No heartbeat from core client for 30 sec - exiting 15:56:28 (3904): No heartbeat from core client for 30 sec - exiting 15:56:29 (3904): No heartbeat from core client for 30 sec - exiting 15:56:30 (3904): No heartbeat from core client for 30 sec - exiting 15:56:31 (3904): No heartbeat from core client for 30 sec - exiting 15:56:32 (3904): No heartbeat from core client for 30 sec - exiting 15:56:33 (3904): No heartbeat from core client for 30 sec - exiting 15:56:34 (3904): No heartbeat from core client for 30 sec - exiting 15:56:36 (3904): No heartbeat from core client for 30 sec - exiting 15:56:37 (3904): No heartbeat from core client for 30 sec - exiting 15:56:38 (3904): No heartbeat from core client for 30 sec - exiting 15:56:39 (3904): No heartbeat from core client for 30 sec - exiting 15:56:40 (3904): No heartbeat from core client for 30 sec - exiting 15:56:41 (3904): No heartbeat from core client for 30 sec - exiting 15:56:42 (3904): No heartbeat from core client for 30 sec - exiting 15:56:43 (3904): No heartbeat from core client for 30 sec - exiting 15:56:44 (3904): No heartbeat from core client for 30 sec - exiting 15:56:45 (3904): No heartbeat from core client for 30 sec - exiting 15:56:46 (3904): No heartbeat from core client for 30 sec - exiting 15:56:47 (3904): No heartbeat from core client for 30 sec - exiting 15:56:48 (3904): No heartbeat from core client for 30 sec - exiting 15:56:49 (3904): No heartbeat from core client for 30 sec - exiting 15:56:50 (3904): No heartbeat from core client for 30 sec - exiting 15:56:51 (3904): No heartbeat from core client for 30 sec - exiting 15:56:52 (3904): No heartbeat from core client for 30 sec - exiting 15:56:53 (3904): No heartbeat from core client for 30 sec - exiting 15:56:54 (3904): No heartbeat from core client for 30 sec - exiting 15:56:55 (3904): No heartbeat from core client for 30 sec - exiting 15:56:56 (3904): No heartbeat from core client for 30 sec - exiting 15:56:57 (3904): No heartbeat from core client for 30 sec - exiting 15:56:58 (3904): No heartbeat from core client for 30 sec - exiting 15:56:59 (3904): No heartbeat from core client for 30 sec - exiting 15:57:00 (3904): No heartbeat from core client for 30 sec - exiting 15:57:01 (3904): No heartbeat from core client for 30 sec - exiting 15:57:02 (3904): No heartbeat from core client for 30 sec - exiting 15:57:03 (3904): No heartbeat from core client for 30 sec - exiting 15:57:04 (3904): No heartbeat from core client for 30 sec - exiting 15:57:05 (3904): No heartbeat from core client for 30 sec - exiting 15:57:06 (3904): No heartbeat from core client for 30 sec - exiting 15:57:07 (3904): No heartbeat from core client for 30 sec - exiting 15:57:08 (3904): No heartbeat from core client for 30 sec - exiting 15:57:09 (3904): No heartbeat from core client for 30 sec - exiting 15:57:10 (3904): No heartbeat from core client for 30 sec - exiting 15:57:11 (3904): No heartbeat from core client for 30 sec - exiting 15:57:12 (3904): No heartbeat from core client for 30 sec - exiting 15:57:13 (3904): No heartbeat from core client for 30 sec - exiting 15:57:14 (3904): No heartbeat from core client for 30 sec - exiting 15:57:15 (3904): No heartbeat from core client for 30 sec - exiting 15:57:16 (3904): No heartbeat from core client for 30 sec - exiting 15:57:17 (3904): No heartbeat from core client for 30 sec - exiting 15:57:18 (3904): No heartbeat from core client for 30 sec - exiting 15:57:19 (3904): No heartbeat from core client for 30 sec - exiting 15:57:20 (3904): No heartbeat from core client for 30 sec - exiting 15:57:21 (3904): No heartbeat from core client for 30 sec - exiting 15:57:22 (3904): No heartbeat from core client for 30 sec - exiting 15:57:23 (3904): No heartbeat from core client for 30 sec - exiting 15:57:24 (3904): No heartbeat from core client for 30 sec - exiting 15:57:26 (3904): No heartbeat from core client for 30 sec - exiting 15:57:27 (3904): No heartbeat from core client for 30 sec - exiting 15:57:28 (3904): No heartbeat from core client for 30 sec - exiting 15:57:29 (3904): No heartbeat from core client for 30 sec - exiting 15:57:30 (3904): No heartbeat from core client for 30 sec - exiting 15:57:31 (3904): No heartbeat from core client for 30 sec - exiting 15:57:32 (3904): No heartbeat from core client for 30 sec - exiting 15:57:33 (3904): No heartbeat from core client for 30 sec - exiting 15:57:34 (3904): No heartbeat from core client for 30 sec - exiting 15:57:35 (3904): No heartbeat from core client for 30 sec - exiting 15:57:36 (3904): No heartbeat from core client for 30 sec - exiting 15:57:37 (3904): No heartbeat from core client for 30 sec - exiting 15:57:38 (3904): No heartbeat from core client for 30 sec - exiting 15:57:39 (3904): No heartbeat from core client for 30 sec - exiting 15:57:40 (3904): No heartbeat from core client for 30 sec - exiting 15:57:41 (3904): No heartbeat from core client for 30 sec - exiting 15:57:42 (3904): No heartbeat from core client for 30 sec - exiting 15:57:43 (3904): No heartbeat from core client for 30 sec - exiting 15:57:44 (3904): No heartbeat from core client for 30 sec - exiting 15:57:45 (3904): No heartbeat from core client for 30 sec - exiting 15:57:46 (3904): No heartbeat from core client for 30 sec - exiting 15:57:47 (3904): No heartbeat from core client for 30 sec - exiting 15:57:48 (3904): No heartbeat from core client for 30 sec - exiting 15:57:49 (3904): No heartbeat from core client for 30 sec - exiting 15:57:50 (3904): No heartbeat from core client for 30 sec - exiting 15:57:51 (3904): No heartbeat from core client for 30 sec - exiting 15:57:52 (3904): No heartbeat from core client for 30 sec - exiting 15:57:53 (3904): No heartbeat from core client for 30 sec - exiting 15:57:54 (3904): No heartbeat from core client for 30 sec - exiting 15:57:55 (3904): No heartbeat from core client for 30 sec - exiting 15:57:56 (3904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:57 (3904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 13:15:07 (4284): No heartbeat from core client for 30 sec - exiting 13:15:08 (4284): No heartbeat from core client for 30 sec - exiting 13:15:09 (4284): No heartbeat from core client for 30 sec - exiting 13:15:10 (4284): No heartbeat from core client for 30 sec - exiting 13:15:11 (4284): No heartbeat from core client for 30 sec - exiting 13:15:12 (4284): No heartbeat from core client for 30 sec - exiting 13:15:13 (4284): No heartbeat from core client for 30 sec - exiting 13:15:14 (4284): No heartbeat from core client for 30 sec - exiting 13:15:15 (4284): No heartbeat from core client for 30 sec - exiting 13:15:16 (4284): No heartbeat from core client for 30 sec - exiting 13:15:17 (4284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:15:18 (4284): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3424, selfPID=4940, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1160, selfPID=5596, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2716, selfPID=464, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=3688, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1132, selfPID=1132, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6960, selfPID=6960, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6112, selfPID=6112, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4552, selfPID=4552, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1556, selfPID=1556, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5552, selfPID=5552, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=968, selfPID=968, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2agm_1980_1_007403272_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Nov 2011 15:12:35 | 1107482 | 13274941 | hadam3p_saf_2agm_1980_1_007403272_2 | 34,656 | 76,497 | 2.2073 |
24 Nov 2011 10:52:12 | 1107482 | 13274941 | hadam3p_saf_2agm_1980_1_007403272_2 | 23,136 | 54,754 | 2.3666 |
21 Nov 2011 14:45:05 | 1107482 | 13274941 | hadam3p_saf_2agm_1980_1_007403272_2 | 11,616 | 33,106 | 2.8500 |
©2024 cpdn.org