Name | hadam3p_saf_1600_1959_1_006916408_1 |
Workunit | 7119724 |
Created | 22 Mar 2011, 20:10:48 UTC |
Sent | 22 Mar 2011, 20:31:03 UTC |
Report deadline | 4 Mar 2012, 1:51:03 UTC |
Received | 16 May 2011, 19:23:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 917076 |
Run time | 17 hours 17 min 33 sec |
CPU time | 14 hours 27 min 2 sec |
Validate state | Invalid |
Credit | 188.44 |
Device peak FLOPS | 2.32 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.12.10</core_client_version> <![CDATA[ <stderr_txt> 21:27:12 (5780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:36 (4552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:34:52 (5048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:41:25 (6812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:43:05 (7524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7280, selfPID=7280, iMonCtr=2 03:45:35 (6828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:42:34 (7900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:00:00 (7624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:07:03 (6524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:10:40 (6000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:52:07 (6864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:28:26 (6492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:41:49 (6648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:56:13 (2988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:35:56 (4392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:39:26 (6744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:41:01 (3408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:03:38 (180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:48:56 (2812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5840, selfPID=5840, iMonCtr=2 16:57:32 (7708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:11:39 (6004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:10:46 (4928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:23:32 (1464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:36:11 (5692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:59:31 (7052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:17:05 (7340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:29:08 (5788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:24:37 (4500): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 20:24:39 (4500): No heartbeat from core client for 30 sec - exiting 20:25:52 (5896): No heartbeat from core client for 30 sec - exiting 20:25:53 (5896): No heartbeat from core client for 30 sec - exiting 20:25:54 (5896): No heartbeat from core client for 30 sec - exiting 20:25:55 (5896): No heartbeat from core client for 30 sec - exiting 20:25:56 (5896): No heartbeat from core client for 30 sec - exiting 20:25:57 (5896): No heartbeat from core client for 30 sec - exiting 20:25:58 (5896): No heartbeat from core client for 30 sec - exiting 20:25:59 (5896): No heartbeat from core client for 30 sec - exiting 20:26:01 (5896): No heartbeat from core client for 30 sec - exiting 20:26:02 (5896): No heartbeat from core client for 30 sec - exiting 20:26:03 (5896): No heartbeat from core client for 30 sec - exiting 20:26:04 (5896): No heartbeat from core client for 30 sec - exiting 20:26:05 (5896): No heartbeat from core client for 30 sec - exiting 20:26:06 (5896): No heartbeat from core client for 30 sec - exiting 20:26:07 (5896): No heartbeat from core client for 30 sec - exiting 20:26:08 (5896): No heartbeat from core client for 30 sec - exiting 20:26:09 (5896): No heartbeat from core client for 30 sec - exiting 20:26:10 (5896): No heartbeat from core client for 30 sec - exiting 20:26:11 (5896): No heartbeat from core client for 30 sec - exiting 20:26:13 (5896): No heartbeat from core client for 30 sec - exiting 20:26:14 (5896): No heartbeat from core client for 30 sec - exiting 20:26:15 (5896): No heartbeat from core client for 30 sec - exiting 20:26:16 (5896): No heartbeat from core client for 30 sec - exiting 20:26:17 (5896): No heartbeat from core client for 30 sec - exiting 20:26:18 (5896): No heartbeat from core client for 30 sec - exiting 20:26:19 (5896): No heartbeat from core client for 30 sec - exiting 20:26:20 (5896): No heartbeat from core client for 30 sec - exiting 20:26:21 (5896): No heartbeat from core client for 30 sec - exiting 20:26:22 (5896): No heartbeat from core client for 30 sec - exiting 20:26:23 (5896): No heartbeat from core client for 30 sec - exiting 20:26:25 (5896): No heartbeat from core client for 30 sec - exiting 20:26:26 (5896): No heartbeat from core client for 30 sec - exiting 20:26:27 (5896): No heartbeat from core client for 30 sec - exiting 20:26:28 (5896): No heartbeat from core client for 30 sec - exiting 20:26:29 (5896): No heartbeat from core client for 30 sec - exiting 20:26:30 (5896): No heartbeat from core client for 30 sec - exiting 20:26:31 (5896): No heartbeat from core client for 30 sec - exiting 20:26:32 (5896): No heartbeat from core client for 30 sec - exiting 20:26:33 (5896): No heartbeat from core client for 30 sec - exiting 20:26:34 (5896): No heartbeat from core client for 30 sec - exiting 20:26:35 (5896): No heartbeat from core client for 30 sec - exiting 20:26:37 (5896): No heartbeat from core client for 30 sec - exiting 20:26:38 (5896): No heartbeat from core client for 30 sec - exiting 20:26:39 (5896): No heartbeat from core client for 30 sec - exiting 20:26:40 (5896): No heartbeat from core client for 30 sec - exiting 20:26:41 (5896): No heartbeat from core client for 30 sec - exiting 20:26:42 (5896): No heartbeat from core client for 30 sec - exiting 20:26:43 (5896): No heartbeat from core client for 30 sec - exiting 20:26:44 (5896): No heartbeat from core client for 30 sec - exiting 20:26:45 (5896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:18:18 (6224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:26:14 (5396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7528, selfPID=7528, iMonCtr=2 22:58:07 (5812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:15:30 (4944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:17:10 (7372): No heartbeat from core client for 30 sec - exiting 23:17:56 (7372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:20:38 (3044): Can't acquire lockfile (32) - waiting 35s 23:56:48 (3044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=2104, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1408, selfPID=6084, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 23:59:24 (6084): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1600_1959_1_006916408_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 May 2011 19:27:32 | 917076 | 12684136 | hadam3p_saf_1600_1959_1_006916408_1 | 11,616 | 35,735 | 3.0764 |
©2024 cpdn.org