Name | hadam3p_pnw_2ymu_1978_1_008238867_0 |
Workunit | 8393991 |
Created | 24 Oct 2012, 17:08:09 UTC |
Sent | 24 Oct 2012, 17:08:12 UTC |
Report deadline | 6 Oct 2013, 22:28:12 UTC |
Received | 3 Nov 2012, 12:49:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1036109 |
Run time | 1 days 18 hours 48 min 37 sec |
CPU time | 1 days 18 hours 33 min 46 sec |
Validate state | Invalid |
Credit | 1,503.98 |
Device peak FLOPS | 2.87 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 10:24:37 (2572): No heartbeat from core client for 30 sec - exiting 10:24:38 (2572): No heartbeat from core client for 30 sec - exiting 10:24:39 (2572): No heartbeat from core client for 30 sec - exiting 10:24:41 (2572): No heartbeat from core client for 30 sec - exiting 10:24:42 (2572): No heartbeat from core client for 30 sec - exiting 10:24:43 (2572): No heartbeat from core client for 30 sec - exiting 10:24:44 (2572): No heartbeat from core client for 30 sec - exiting 10:24:45 (2572): No heartbeat from core client for 30 sec - exiting 10:24:46 (2572): No heartbeat from core client for 30 sec - exiting 10:24:47 (2572): No heartbeat from core client for 30 sec - exiting 10:24:48 (2572): No heartbeat from core client for 30 sec - exiting 10:24:49 (2572): No heartbeat from core client for 30 sec - exiting 10:24:50 (2572): No heartbeat from core client for 30 sec - exiting 10:24:51 (2572): No heartbeat from core client for 30 sec - exiting 10:24:53 (2572): No heartbeat from core client for 30 sec - exiting 10:24:54 (2572): No heartbeat from core client for 30 sec - exiting 10:24:55 (2572): No heartbeat from core client for 30 sec - exiting 10:24:56 (2572): No heartbeat from core client for 30 sec - exiting 10:24:57 (2572): No heartbeat from core client for 30 sec - exiting 10:24:58 (2572): No heartbeat from core client for 30 sec - exiting 10:24:59 (2572): No heartbeat from core client for 30 sec - exiting 10:25:00 (2572): No heartbeat from core client for 30 sec - exiting 10:25:01 (2572): No heartbeat from core client for 30 sec - exiting 10:25:02 (2572): No heartbeat from core client for 30 sec - exiting 10:25:03 (2572): No heartbeat from core client for 30 sec - exiting 10:25:05 (2572): No heartbeat from core client for 30 sec - exiting 10:25:06 (2572): No heartbeat from core client for 30 sec - exiting 10:25:07 (2572): No heartbeat from core client for 30 sec - exiting 10:25:08 (2572): No heartbeat from core client for 30 sec - exiting 10:25:09 (2572): No heartbeat from core client for 30 sec - exiting 10:25:10 (2572): No heartbeat from core client for 30 sec - exiting 10:25:11 (2572): No heartbeat from core client for 30 sec - exiting 10:25:12 (2572): No heartbeat from core client for 30 sec - exiting 10:25:13 (2572): No heartbeat from core client for 30 sec - exiting 10:25:14 (2572): No heartbeat from core client for 30 sec - exiting 10:25:15 (2572): No heartbeat from core client for 30 sec - exiting 10:25:17 (2572): No heartbeat from core client for 30 sec - exiting 10:25:18 (2572): No heartbeat from core client for 30 sec - exiting 10:25:19 (2572): No heartbeat from core client for 30 sec - exiting 10:25:20 (2572): No heartbeat from core client for 30 sec - exiting 10:25:21 (2572): No heartbeat from core client for 30 sec - exiting 10:25:22 (2572): No heartbeat from core client for 30 sec - exiting 10:25:23 (2572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2092, selfPID=4608, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6128, selfPID=4664, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1912, selfPID=5068, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6568, selfPID=4784, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6996, selfPID=1600, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_2ymu_1978_1_008238867_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2ymu_1978_1_008238867_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2ymu_1978_1_008238867_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2ymu_1978_1_008238867_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2ymu_1978_1_008238867_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2ymu_1978_1_008238867_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Nov 2012 11:30:09 | 1036109 | 15397724 | hadam3p_pnw_2ymu_1978_1_008238867_0 | 69,216 | 149,039 | 2.1532 |
03 Nov 2012 04:19:11 | 1036109 | 15397724 | hadam3p_pnw_2ymu_1978_1_008238867_0 | 57,696 | 123,244 | 2.1361 |
02 Nov 2012 21:12:53 | 1036109 | 15397724 | hadam3p_pnw_2ymu_1978_1_008238867_0 | 46,176 | 97,749 | 2.1169 |
01 Nov 2012 20:21:27 | 1036109 | 15397724 | hadam3p_pnw_2ymu_1978_1_008238867_0 | 34,656 | 72,581 | 2.0943 |
31 Oct 2012 19:27:03 | 1036109 | 15397724 | hadam3p_pnw_2ymu_1978_1_008238867_0 | 23,136 | 48,598 | 2.1005 |
29 Oct 2012 20:43:36 | 1036109 | 15397724 | hadam3p_pnw_2ymu_1978_1_008238867_0 | 11,616 | 25,082 | 2.1593 |
©2024 cpdn.org