Name | hadam3p_eu_a3mw_1987_1_007787336_0 |
Workunit | 7942445 |
Created | 20 Feb 2012, 21:42:06 UTC |
Sent | 3 Mar 2012, 17:56:47 UTC |
Report deadline | 13 Feb 2013, 23:16:47 UTC |
Received | 26 Mar 2012, 20:02:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1038991 |
Run time | 3 days 5 hours 33 min 50 sec |
CPU time | 2 days 17 hours 40 min 24 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 2.05 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7156, selfPID=7156, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5868, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3280, selfPID=3280, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4132, selfPID=4132, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1336, selfPID=1336, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 17:29:53 (6184): No heartbeat from core client for 30 sec - exiting 17:29:54 (6184): No heartbeat from core client for 30 sec - exiting 17:29:55 (6184): No heartbeat from core client for 30 sec - exiting 17:29:56 (6184): No heartbeat from core client for 30 sec - exiting 17:29:57 (6184): No heartbeat from core client for 30 sec - exiting 17:29:58 (6184): No heartbeat from core client for 30 sec - exiting 17:29:59 (6184): No heartbeat from core client for 30 sec - exiting 17:30:00 (6184): No heartbeat from core client for 30 sec - exiting 17:30:02 (6184): No heartbeat from core client for 30 sec - exiting 17:30:03 (6184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:46:18 (2176): No heartbeat from core client for 30 sec - exiting 14:46:19 (2176): No heartbeat from core client for 30 sec - exiting 14:46:20 (2176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7616, selfPID=7616, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3308, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4236, selfPID=3104, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... 23:42:24 (5784): No heartbeat from core client for 30 sec - exiting 23:42:25 (5784): No heartbeat from core client for 30 sec - exiting 23:42:26 (5784): No heartbeat from core client for 30 sec - exiting 23:42:27 (5784): No heartbeat from core client for 30 sec - exiting 23:42:28 (5784): No heartbeat from core client for 30 sec - exiting 23:42:29 (5784): No heartbeat from core client for 30 sec - exiting 23:42:30 (5784): No heartbeat from core client for 30 sec - exiting 23:42:31 (5784): No heartbeat from core client for 30 sec - exiting 23:42:32 (5784): No heartbeat from core client for 30 sec - exiting 23:42:33 (5784): No heartbeat from core client for 30 sec - exiting 23:42:34 (5784): No heartbeat from core client for 30 sec - exiting 23:42:35 (5784): No heartbeat from core client for 30 sec - exiting 23:42:36 (5784): No heartbeat from core client for 30 sec - exiting 23:42:37 (5784): No heartbeat from core client for 30 sec - exiting 23:42:38 (5784): No heartbeat from core client for 30 sec - exiting 23:42:39 (5784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:58:48 (1388): No heartbeat from core client for 30 sec - exiting 15:58:49 (1388): No heartbeat from core client for 30 sec - exiting 15:58:50 (1388): No heartbeat from core client for 30 sec - exiting 15:58:51 (1388): No heartbeat from core client for 30 sec - exiting 15:58:52 (1388): No heartbeat from core client for 30 sec - exiting 15:58:53 (1388): No heartbeat from core client for 30 sec - exiting 15:58:54 (1388): No heartbeat from core client for 30 sec - exiting 15:58:55 (1388): No heartbeat from core client for 30 sec - exiting 15:58:56 (1388): No heartbeat from core client for 30 sec - exiting 15:58:57 (1388): No heartbeat from core client for 30 sec - exiting 15:58:58 (1388): No heartbeat from core client for 30 sec - exiting 15:58:59 (1388): No heartbeat from core client for 30 sec - exiting 15:59:00 (1388): No heartbeat from core client for 30 sec - exiting 15:59:01 (1388): No heartbeat from core client for 30 sec - exiting 15:59:02 (1388): No heartbeat from core client for 30 sec - exiting 15:59:03 (1388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... G Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_a3mw_1987_1_007787336_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_a3mw_1987_1_007787336_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_a3mw_1987_1_007787336_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_a3mw_1987_1_007787336_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_a3mw_1987_1_007787336_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_a3mw_1987_1_007787336_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Mar 2012 03:30:15 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 69,216 | 219,933 | 3.1775 |
23 Mar 2012 05:21:57 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 57,696 | 183,732 | 3.1845 |
20 Mar 2012 00:44:57 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 46,176 | 147,061 | 3.1848 |
17 Mar 2012 23:01:16 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 34,656 | 111,245 | 3.2100 |
16 Mar 2012 01:47:11 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 23,141 | 73,740 | 3.1866 |
16 Mar 2012 00:46:02 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 23,136 | 73,257 | 3.1664 |
11 Mar 2012 00:46:10 | 1038991 | 14146139 | hadam3p_eu_a3mw_1987_1_007787336_0 | 11,616 | 37,218 | 3.2040 |
©2024 cpdn.org