Name | hadam3p_eu_2loj_1965_1_007235876_1 |
Workunit | 7434116 |
Created | 12 May 2011, 14:47:15 UTC |
Sent | 12 May 2011, 14:53:20 UTC |
Report deadline | 23 Apr 2012, 20:13:20 UTC |
Received | 7 Jun 2011, 16:14:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1141360 |
Run time | 3 days 5 hours 50 min 29 sec |
CPU time | 3 days 1 hours 4 min 23 sec |
Validate state | Invalid |
Credit | 1,591.48 |
Device peak FLOPS | 2.32 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4776, selfPID=4776, iMonCtr=2 00:41:14 (3840): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=3680, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=700, selfPID=2660, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4412, selfPID=4108, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4540, selfPID=2820, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5024, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4012, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=2 Model crash detected, will try to restart... 15:50:07 (3956): No heartbeat from core client for 30 sec - exiting 15:50:08 (3956): No heartbeat from core client for 30 sec - exiting 15:50:09 (3956): No heartbeat from core client for 30 sec - exiting 15:50:10 (3956): No heartbeat from core client for 30 sec - exiting 15:50:12 (3956): No heartbeat from core client for 30 sec - exiting 15:50:13 (3956): No heartbeat from core client for 30 sec - exiting 15:50:14 (3956): No heartbeat from core client for 30 sec - exiting 15:50:15 (3956): No heartbeat from core client for 30 sec - exiting 15:50:16 (3956): No heartbeat from core client for 30 sec - exiting 15:50:17 (3956): No heartbeat from core client for 30 sec - exiting 15:50:18 (3956): No heartbeat from core client for 30 sec - exiting 15:50:19 (3956): No heartbeat from core client for 30 sec - exiting 15:50:20 (3956): No heartbeat from core client for 30 sec - exiting 15:50:21 (3956): No heartbeat from core client for 30 sec - exiting 15:50:22 (3956): No heartbeat from core client for 30 sec - exiting 15:50:24 (3956): No heartbeat from core client for 30 sec - exiting 15:50:25 (3956): No heartbeat from core client for 30 sec - exiting 15:50:26 (3956): No heartbeat from core client for 30 sec - exiting 15:50:27 (3956): No heartbeat from core client for 30 sec - exiting 15:50:28 (3956): No heartbeat from core client for 30 sec - exiting 15:50:29 (3956): No heartbeat from core client for 30 sec - exiting 15:50:30 (3956): No heartbeat from core client for 30 sec - exiting 15:50:31 (3956): No heartbeat from core client for 30 sec - exiting 15:50:32 (3956): No heartbeat from core client for 30 sec - exiting 15:50:33 (3956): No heartbeat from core client for 30 sec - exiting 15:50:34 (3956): No heartbeat from core client for 30 sec - exiting 15:50:36 (3956): No heartbeat from core client for 30 sec - exiting 15:50:37 (3956): No heartbeat from core client for 30 sec - exiting 15:50:38 (3956): No heartbeat from core client for 30 sec - exiting 15:50:39 (3956): No heartbeat from core client for 30 sec - exiting 15:50:40 (3956): No heartbeat from core client for 30 sec - exiting 15:50:41 (3956): No heartbeat from core client for 30 sec - exiting 15:50:42 (3956): No heartbeat from core client for 30 sec - exiting 15:50:43 (3956): No heartbeat from core client for 30 sec - exiting 15:50:44 (3956): No heartbeat from core client for 30 sec - exiting 15:50:45 (3956): No heartbeat from core client for 30 sec - exiting 15:50:47 (3956): No heartbeat from core client for 30 sec - exiting 15:50:48 (3956): No heartbeat from core client for 30 sec - exiting 15:50:49 (3956): No heartbeat from core client for 30 sec - exiting 15:50:50 (3956): No heartbeat from core client for 30 sec - exiting 15:50:51 (3956): No heartbeat from core client for 30 sec - exiting 15:50:52 (3956): No heartbeat from core client for 30 sec - exiting 15:50:53 (3956): No heartbeat from core client for 30 sec - exiting 15:50:54 (3956): No heartbeat from core client for 30 sec - exiting 15:50:55 (3956): No heartbeat from core client for 30 sec - exiting 15:50:56 (3956): No heartbeat from core client for 30 sec - exiting 15:50:57 (3956): No heartbeat from core client for 30 sec - exiting 15:50:59 (3956): No heartbeat from core client for 30 sec - exiting 15:51:00 (3956): No heartbeat from core client for 30 sec - exiting 15:51:01 (3956): No heartbeat from core client for 30 sec - exiting 15:51:02 (3956): No heartbeat from core client for 30 sec - exiting 15:51:03 (3956): No heartbeat from core client for 30 sec - exiting 15:51:04 (3956): No heartbeat from core client for 30 sec - exiting 15:51:05 (3956): No heartbeat from core client for 30 sec - exiting 15:51:06 (3956): No heartbeat from core client for 30 sec - exiting 15:51:07 (3956): No heartbeat from core client for 30 sec - exiting 15:51:08 (3956): No heartbeat from core client for 30 sec - exiting 15:51:09 (3956): No heartbeat from core client for 30 sec - exiting 15:51:11 (3956): No heartbeat from core client for 30 sec - exiting 15:51:12 (3956): No heartbeat from core client for 30 sec - exiting 15:51:13 (3956): No heartbeat from core client for 30 sec - exiting 15:51:14 (3956): No heartbeat from core client for 30 sec - exiting 15:51:15 (3956): No heartbeat from core client for 30 sec - exiting 15:51:16 (3956): No heartbeat from core client for 30 sec - exiting 15:51:17 (3956): No heartbeat from core client for 30 sec - exiting 15:51:18 (3956): No heartbeat from core client for 30 sec - exiting 15:51:19 (3956): No heartbeat from core client for 30 sec - exiting 15:51:20 (3956): No heartbeat from core client for 30 sec - exiting 15:51:21 (3956): No heartbeat from core client for 30 sec - exiting 15:51:23 (3956): No heartbeat from core client for 30 sec - exiting 15:51:24 (3956): No heartbeat from core client for 30 sec - exiting 15:51:25 (3956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:26 (3956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:25:08 (5072): No heartbeat from core client for 30 sec - exiting 09:25:10 (5072): No heartbeat from core client for 30 sec - exiting 09:25:11 (5072): No heartbeat from core client for 30 sec - exiting 09:25:12 (5072): No heartbeat from core client for 30 sec - exiting 09:25:13 (5072): No heartbeat from core client for 30 sec - exiting 09:25:14 (5072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_2loj_1965_1_007235876_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2loj_1965_1_007235876_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2loj_1965_1_007235876_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2loj_1965_1_007235876_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Jun 2011 10:50:41 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 92,256 | 254,362 | 2.7571 |
06 Jun 2011 04:33:14 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 80,736 | 225,682 | 2.7953 |
05 Jun 2011 20:24:30 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 69,216 | 196,526 | 2.8393 |
02 Jun 2011 18:28:41 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 57,696 | 167,585 | 2.9046 |
26 May 2011 13:51:43 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 46,176 | 134,572 | 2.9143 |
21 May 2011 21:11:41 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 34,656 | 101,266 | 2.9220 |
15 May 2011 21:11:10 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 23,136 | 67,870 | 2.9335 |
14 May 2011 11:16:48 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 11,621 | 34,493 | 2.9682 |
13 May 2011 11:21:30 | 1141360 | 12882782 | hadam3p_eu_2loj_1965_1_007235876_1 | 11,616 | 34,093 | 2.9350 |
©2024 cpdn.org