Name | hadam3p_eu_9j0r_1979_1_007760619_0 |
Workunit | 7915728 |
Created | 20 Feb 2012, 17:05:22 UTC |
Sent | 14 Mar 2012, 21:42:05 UTC |
Report deadline | 25 Feb 2013, 3:02:05 UTC |
Received | 24 Mar 2012, 18:45:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 981389 |
Run time | 4 days 18 hours 49 min 32 sec |
CPU time | 3 days 12 hours 3 min 48 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 2.61 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2712, selfPID=2712, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5764, selfPID=5764, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:34:01 (5260): No heartbeat from core client for 30 sec - exiting 02:34:02 (5260): No heartbeat from core client for 30 sec - exiting 02:34:03 (5260): No heartbeat from core client for 30 sec - exiting 02:34:04 (5260): No heartbeat from core client for 30 sec - exiting 02:34:05 (5260): No heartbeat from core client for 30 sec - exiting 02:34:06 (5260): No heartbeat from core client for 30 sec - exiting 02:34:07 (5260): No heartbeat from core client for 30 sec - exiting 02:34:08 (5260): No heartbeat from core client for 30 sec - exiting 02:34:09 (5260): No heartbeat from core client for 30 sec - exiting 02:34:10 (5260): No heartbeat from core client for 30 sec - exiting 02:34:11 (5260): No heartbeat from core client for 30 sec - exiting 02:34:12 (5260): No heartbeat from core client for 30 sec - exiting 02:34:13 (5260): No heartbeat from core client for 30 sec - exiting 02:34:14 (5260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5080, selfPID=5080, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5216, selfPID=5216, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:03:45 (4292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:04:41 (4396): No heartbeat from core client for 30 sec - exiting 14:04:42 (4396): No heartbeat from core client for 30 sec - exiting 14:04:43 (4396): No heartbeat from core client for 30 sec - exiting 14:04:44 (4396): No heartbeat from core client for 30 sec - exiting 14:04:45 (4396): No heartbeat from core client for 30 sec - exiting 14:04:46 (4396): No heartbeat from core client for 30 sec - exiting 14:04:47 (4396): No heartbeat from core client for 30 sec - exiting 14:04:48 (4396): No heartbeat from core client for 30 sec - exiting 14:04:49 (4396): No heartbeat from core client for 30 sec - exiting 14:04:50 (4396): No heartbeat from core client for 30 sec - exiting 14:04:51 (4396): No heartbeat from core client for 30 sec - exiting 14:04:52 (4396): No heartbeat from core client for 30 sec - exiting 14:04:53 (4396): No heartbeat from core client for 30 sec - exiting 14:04:54 (4396): No heartbeat from core client for 30 sec - exiting 14:04:55 (4396): No heartbeat from core client for 30 sec - exiting 14:04:56 (4396): No heartbeat from core client for 30 sec - exiting 14:04:57 (4396): No heartbeat from core client for 30 sec - exiting 14:04:58 (4396): No heartbeat from core client for 30 sec - exiting 14:04:59 (4396): No heartbeat from core client for 30 sec - exiting 14:05:00 (4396): No heartbeat from core client for 30 sec - exiting 14:05:01 (4396): No heartbeat from core client for 30 sec - exiting 14:05:02 (4396): No heartbeat from core client for 30 sec - exiting 14:05:03 (4396): No heartbeat from core client for 30 sec - exiting 14:05:04 (4396): No heartbeat from core client for 30 sec - exiting 14:05:05 (4396): No heartbeat from core client for 30 sec - exiting 14:05:06 (4396): No heartbeat from core client for 30 sec - exiting 14:05:07 (4396): No heartbeat from core client for 30 sec - exiting 14:05:08 (4396): No heartbeat from core client for 30 sec - exiting 14:05:09 (4396): No heartbeat from core client for 30 sec - exiting 14:05:10 (4396): No heartbeat from core client for 30 sec - exiting 14:05:11 (4396): No heartbeat from core client for 30 sec - exiting 14:05:12 (4396): No heartbeat from core client for 30 sec - exiting 14:05:13 (4396): No heartbeat from core client for 30 sec - exiting 14:05:14 (4396): No heartbeat from core client for 30 sec - exiting 14:05:15 (4396): No heartbeat from core client for 30 sec - exiting 14:05:16 (4396): No heartbeat from core client for 30 sec - exiting 14:05:17 (4396): No heartbeat from core client for 30 sec - exiting 14:05:18 (4396): No heartbeat from core client for 30 sec - exiting 14:05:19 (4396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4100, selfPID=4100, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3480, selfPID=3480, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4192, selfPID=4192, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3668, selfPID=3668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process isCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2556, selfPID=3504, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2556, selfPID=2556, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4568, selfPID=4568, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2544, selfPID=2544, iMonCtr=2 09:42:00 (3544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2332, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_9j0r_1979_1_007760619_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9j0r_1979_1_007760619_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9j0r_1979_1_007760619_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Mar 2012 11:34:11 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 103,776 | 286,425 | 2.7600 |
23 Mar 2012 23:00:50 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 92,256 | 254,323 | 2.7567 |
23 Mar 2012 08:09:57 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 80,736 | 223,694 | 2.7707 |
22 Mar 2012 19:34:19 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 69,216 | 193,221 | 2.7916 |
21 Mar 2012 10:06:20 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 57,696 | 162,296 | 2.8130 |
19 Mar 2012 08:24:41 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 46,176 | 130,158 | 2.8187 |
18 Mar 2012 09:32:12 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 34,656 | 97,364 | 2.8094 |
17 Mar 2012 02:07:14 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 23,136 | 64,136 | 2.7721 |
15 Mar 2012 17:17:23 | 981389 | 14118860 | hadam3p_eu_9j0r_1979_1_007760619_0 | 11,616 | 32,645 | 2.8103 |
©2024 cpdn.org