Name | hadam3p_eu_x450_1995_1_006924172_1 |
Workunit | 7127488 |
Created | 8 Feb 2011, 16:50:52 UTC |
Sent | 8 Feb 2011, 17:28:09 UTC |
Report deadline | 21 Jan 2012, 22:48:09 UTC |
Received | 13 Feb 2011, 20:54:56 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1131576 |
Run time | 2 days 8 hours 43 min 44 sec |
CPU time | 2 days 7 hours 37 min 6 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 2.84 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5964, selfPID=5036, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5964, selfPID=5964, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3180, selfPID=324, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3180, selfPID=3180, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4148, selfPID=5484, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4148, selfPID=4148, iMonCtr=1 07:57:17 (4836): No heartbeat from core client for 30 sec - exiting 07:57:19 (4836): No heartbeat from core client for 30 sec - exiting 07:57:20 (4836): No heartbeat from core client for 30 sec - exiting 07:57:21 (4836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:57:22 (4836): No heartbeat from core client for 30 sec - exiting 07:57:23 (4836): No heartbeat from core client for 30 sec - exiting 07:57:25 (4836): No heartbeat from core client for 30 sec - exiting 07:57:26 (4836): No heartbeat from core client for 30 sec - exiting 07:57:27 (4836): No heartbeat from core client for 30 sec - exiting 07:57:28 (4836): No heartbeat from core client for 30 sec - exiting 07:57:29 (4836): No heartbeat from core client for 30 sec - exiting 07:57:30 (4836): No heartbeat from core client for 30 sec - exiting 07:57:31 (4836): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5324, selfPID=5408, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5812, selfPID=5840, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5812, selfPID=5812, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=5324, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=5932, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2628, selfPID=5580, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2628, selfPID=2628, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5436, selfPID=4444, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5436, selfPID=5436, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5792, selfPID=4512, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5792, selfPID=5792, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=300, selfPID=3992, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=300, selfPID=300, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5764, selfPID=2320, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5764, selfPID=5764, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2084, selfPID=5412, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2084, selfPID=2084, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4316, selfPID=3756, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4316, selfPID=4316, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6020, selfPID=4600, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6020, selfPID=6020, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6128, selfPID=5108, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6128, selfPID=6128, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2760, selfPID=1948, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2760, selfPID=2760, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=2640, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=2600, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5212, selfPID=5440, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5212, selfPID=5212, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6016, selfPID=5316, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6016, selfPID=6016, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=4948, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=4436, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5776, selfPID=2440, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5776, selfPID=5776, iMonCtr=1 G20:43:10 (4536): No heartbeat from core client for 30 sec - exiting 20:43:11 (4536): No heartbeat from core client for 30 sec - exiting 20:43:12 (4536): No heartbeat from core client for 30 sec - exiting 20:43:13 (4536): No heartbeat from core client for 30 sec - exiting 20:43:14 (4536): No heartbeat from core client for 30 sec - exiting 20:43:15 (4536): No heartbeat from core client for 30 sec - exiting 20:43:16 (4536): No heartbeat from core client for 30 sec - exiting 20:43:17 (4536): No heartbeat from core client for 30 sec - exiting 20:43:18 (4536): No heartbeat from core client for 30 sec - exiting 20:43:19 (4536): No heartbeat from core client for 30 sec - exiting 20:43:20 (4536): No heartbeat from core client for 30 sec - exiting 20:43:21 (4536): No heartbeat from core client for 30 sec - exiting 20:43:22 (4536): No heartbeat from core client for 30 sec - exiting 20:43:23 (4536): No heartbeat from core client for 30 sec - exiting 20:43:24 (4536): No heartbeat from core client for 30 sec - exiting 20:43:26 (4536): No heartbeat from core client for 30 sec - exiting 20:43:27 (4536): No heartbeat from core client for 30 sec - exiting 20:43:28 (4536): No heartbeat from core client for 30 sec - exiting 20:43:29 (4536): No heartbeat from core client for 30 sec - exiting 20:43:30 (4536): No heartbeat from core client for 30 sec - exiting 20:43:31 (4536): No heartbeat from core client for 30 sec - exiting 20:43:32 (4536): No heartbeat from core client for 30 sec - exiting 20:43:33 (4536): No heartbeat from core client for 30 sec - exiting 20:43:34 (4536): No heartbeat from core client for 30 sec - exiting 20:43:35 (4536): No heartbeat from core client for 30 sec - exiting 20:43:36 (4536): No heartbeat from core client for 30 sec - exiting 20:43:37 (4536): No heartbeat from core client for 30 sec - exiting 20:43:38 (4536): No heartbeat from core client for 30 sec - exiting 20:43:39 (4536): No heartbeat from core client for 30 sec - exiting 20:43:40 (4536): No heartbeat from core client for 30 sec - exiting 20:43:41 (4536): No heartbeat from core client for 30 sec - exiting 20:43:42 (4536): No heartbeat from core client for 30 sec - exiting 20:43:43 (4536): No heartbeat from core client for 30 sec - exiting 20:43:44 (4536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2728, selfPID=908, iMonCtr=1 forrtl: Access is denied. Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3360, selfPID=3360, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3360, selfPID=3736, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 20:53:31 (3736): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_x450_1995_1_006924172_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_x450_1995_1_006924172_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_x450_1995_1_006924172_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_x450_1995_1_006924172_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_x450_1995_1_006924172_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_x450_1995_1_006924172_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Feb 2011 16:35:14 | 1131576 | 12564223 | hadam3p_eu_x450_1995_1_006924172_1 | 69,216 | 186,272 | 2.6912 |
13 Feb 2011 00:40:54 | 1131576 | 12564223 | hadam3p_eu_x450_1995_1_006924172_1 | 57,696 | 155,267 | 2.6911 |
12 Feb 2011 13:32:01 | 1131576 | 12564223 | hadam3p_eu_x450_1995_1_006924172_1 | 46,176 | 124,393 | 2.6939 |
12 Feb 2011 03:03:48 | 1131576 | 12564223 | hadam3p_eu_x450_1995_1_006924172_1 | 34,656 | 93,276 | 2.6915 |
11 Feb 2011 13:36:55 | 1131576 | 12564223 | hadam3p_eu_x450_1995_1_006924172_1 | 23,136 | 62,261 | 2.6911 |
10 Feb 2011 17:14:01 | 1131576 | 12564223 | hadam3p_eu_x450_1995_1_006924172_1 | 11,616 | 31,192 | 2.6853 |
©2024 cpdn.org