Name | hadam3p_pnw_yt72_1973_1_006883302_2 |
Workunit | 7086618 |
Created | 24 Apr 2012, 15:34:37 UTC |
Sent | 24 Apr 2012, 15:40:37 UTC |
Report deadline | 6 Apr 2013, 21:00:37 UTC |
Received | 28 Apr 2012, 17:14:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 962831 |
Run time | 2 days 6 hours 8 min 47 sec |
CPU time | 2 days 5 hours 14 min 6 sec |
Validate state | Invalid |
Credit | 1,503.98 |
Device peak FLOPS | 3.09 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 06:00:02 (16784): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:39:34 (17040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:40:47 (10108): No heartbeat from core client for 30 sec - exiting 05:40:48 (10108): No heartbeat from core client for 30 sec - exiting 05:41:19 (10108): No heartbeat from core client for 30 sec - exiting 05:41:20 (10108): No heartbeat from core client for 30 sec - exiting 05:41:21 (10108): No heartbeat from core client for 30 sec - exiting 05:41:22 (10108): No heartbeat from core client for 30 sec - exiting 05:41:23 (10108): No heartbeat from core client for 30 sec - exiting 05:41:24 (10108): No heartbeat from core client for 30 sec - exiting 05:41:25 (10108): No heartbeat from core client for 30 sec - exiting 05:41:26 (10108): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=20092, iMonCtr=1 CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=19868, iMonCtr=1 05:42:29 (17948): No heartbeat from core client for 30 sec - exiting 05:42:30 (17948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:40:11 (15820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:41:37 (27284): No heartbeat from core client for 30 sec - exiting 16:41:38 (27284): No heartbeat from core client for 30 sec - exiting 16:41:39 (27284): No heartbeat from core client for 30 sec - exiting 16:41:40 (27284): No heartbeat from core client for 30 sec - exiting 16:41:41 (27284): No heartbeat from core client for 30 sec - exiting 16:41:42 (27284): No heartbeat from core client for 30 sec - exiting 16:41:43 (27284): No heartbeat from core client for 30 sec - exiting 16:41:44 (27284): No heartbeat from core client for 30 sec - exiting 16:41:45 (27284): No heartbeat from core client for 30 sec - exiting 16:41:46 (27284): No heartbeat from core client for 30 sec - exiting 16:41:47 (27284): No heartbeat from core client for 30 sec - exiting 16:41:48 (27284): No heartbeat from core client for 30 sec - exiting 16:41:49 (27284): No heartbeat from core client for 30 sec - exiting 16:41:50 (27284): No heartbeat from core client for 30 sec - exiting 16:41:51 (27284): No heartbeat from core client for 30 sec - exiting 16:41:52 (27284): No heartbeat from core client for 30 sec - exiting 16:41:53 (27284): No heartbeat from core client for 30 sec - exiting 16:41:54 (27284): No heartbeat from core client for 30 sec - exiting 16:41:55 (27284): No heartbeat from core client for 30 sec - exiting 16:41:56 (27284): No heartbeat from core client for 30 sec - exiting 16:41:57 (27284): No heartbeat from core client for 30 sec - exiting 16:41:58 (27284): No heartbeat from core client for 30 sec - exiting 16:41:59 (27284): No heartbeat from core client for 30 sec - exiting 16:42:00 (27284): No heartbeat from core client for 30 sec - exiting 16:42:01 (27284): No heartbeat from core client for 30 sec - exiting 16:42:02 (27284): No heartbeat from core client for 30 sec - exiting 16:42:03 (27284): No heartbeat from core client for 30 sec - exiting 16:42:04 (27284): No heartbeat from core client for 30 sec - exiting 16:42:05 (27284): No heartbeat from core client for 30 sec - exiting 16:42:06 (27284): No heartbeat from core client for 30 sec - exiting 16:42:07 (27284): No heartbeat from core client for 30 sec - exiting 16:42:08 (27284): No heartbeat from core client for 30 sec - exiting 16:42:09 (27284): No heartbeat from core client for 30 sec - exiting 16:42:10 (27284): No heartbeat from core client for 30 sec - exiting 16:42:11 (27284): No heartbeat from core client for 30 sec - exiting 16:42:12 (27284): No heartbeat from core client for 30 sec - exiting 16:42:13 (27284): No heartbeat from core client for 30 sec - exiting 16:42:14 (27284): No heartbeat from core client for 30 sec - exiting 16:42:15 (27284): No heartbeat from core client for 30 sec - exiting 16:42:16 (27284): No heartbeat from core client for 30 sec - exiting 16:42:17 (27284): No heartbeat from core client for 30 sec - exiting 16:42:18 (27284): No heartbeat from core client for 30 sec - exiting 16:42:19 (27284): No heartbeat from core client for 30 sec - exiting 16:42:20 (27284): No heartbeat from core client for 30 sec - exiting 16:42:21 (27284): No heartbeat from core client for 30 sec - exiting 16:42:22 (27284): No heartbeat from core client for 30 sec - exiting 16:42:23 (27284): No heartbeat from core client for 30 sec - exiting 16:42:24 (27284): No heartbeat from core client for 30 sec - exiting 16:42:25 (27284): No heartbeat from core client for 30 sec - exiting 16:42:26 (27284): No heartbeat from core client for 30 sec - exiting 16:42:27 (27284): No heartbeat from core client for 30 sec - exiting 16:42:28 (27284): No heartbeat from core client for 30 sec - exiting 16:42:29 (27284): No heartbeat from core client for 30 sec - exiting 16:42:30 (27284): No heartbeat from core client for 30 sec - exiting 16:42:31 (27284): No heartbeat from core client for 30 sec - exiting 16:42:32 (27284): No heartbeat from core client for 30 sec - exiting 16:42:33 (27284): No heartbeat from core client for 30 sec - exiting 16:42:34 (27284): No heartbeat from core client for 30 sec - exiting 16:42:35 (27284): No heartbeat from core client for 30 sec - exiting 16:42:36 (27284): No heartbeat from core client for 30 sec - exiting 16:42:37 (27284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:44:17 (22252): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=21280, iMonCtr=1 CPDN Monitor - No 'heartbeat' from BOINC... 16:44:48 (22252): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=12028, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=28260, selfPID=19636, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_yt72_1973_1_006883302_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt72_1973_1_006883302_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt72_1973_1_006883302_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt72_1973_1_006883302_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt72_1973_1_006883302_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_yt72_1973_1_006883302_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Apr 2012 05:12:13 | 962831 | 14585296 | hadam3p_pnw_yt72_1973_1_006883302_2 | 69,216 | 167,248 | 2.4163 |
27 Apr 2012 13:17:53 | 962831 | 14585296 | hadam3p_pnw_yt72_1973_1_006883302_2 | 57,696 | 139,849 | 2.4239 |
27 Apr 2012 05:07:35 | 962831 | 14585296 | hadam3p_pnw_yt72_1973_1_006883302_2 | 46,176 | 112,235 | 2.4306 |
26 Apr 2012 13:32:47 | 962831 | 14585296 | hadam3p_pnw_yt72_1973_1_006883302_2 | 34,656 | 84,797 | 2.4468 |
26 Apr 2012 04:30:17 | 962831 | 14585296 | hadam3p_pnw_yt72_1973_1_006883302_2 | 23,136 | 56,694 | 2.4505 |
25 Apr 2012 11:10:55 | 962831 | 14585296 | hadam3p_pnw_yt72_1973_1_006883302_2 | 11,616 | 28,721 | 2.4725 |
©2024 cpdn.org