Name | hadam3p_eu_wjfr_1999_1_007216196_1 |
Workunit | 7414476 |
Created | 22 Sep 2012, 11:36:18 UTC |
Sent | 22 Sep 2012, 18:36:02 UTC |
Report deadline | 4 Sep 2013, 23:56:02 UTC |
Received | 30 Sep 2012, 7:54:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1238239 |
Run time | 4 days 14 hours 42 min 8 sec |
CPU time | 4 days 9 hours 30 min 44 sec |
Validate state | Invalid |
Credit | 2,187.67 |
Device peak FLOPS | 2.95 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:22:49 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6500, selfPID=7920, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6220, selfPID=6088, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6332, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6340, selfPID=1996, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_wjfr_1999_1_007216196_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Sep 2012 06:51:12 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 126,816 | 378,913 | 2.9879 |
29 Sep 2012 19:36:58 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 115,296 | 343,808 | 2.9820 |
29 Sep 2012 08:10:45 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 103,776 | 309,435 | 2.9818 |
28 Sep 2012 19:41:25 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 92,256 | 275,764 | 2.9891 |
28 Sep 2012 07:15:59 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 80,736 | 242,025 | 2.9977 |
27 Sep 2012 17:47:03 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 69,216 | 208,579 | 3.0135 |
27 Sep 2012 03:17:56 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 57,696 | 173,914 | 3.0143 |
26 Sep 2012 12:49:37 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 46,176 | 139,062 | 3.0116 |
24 Sep 2012 14:31:16 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 34,656 | 104,234 | 3.0077 |
24 Sep 2012 03:23:43 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 23,136 | 70,314 | 3.0392 |
23 Sep 2012 17:08:32 | 1238239 | 15297666 | hadam3p_eu_wjfr_1999_1_007216196_1 | 11,616 | 35,592 | 3.0640 |
©2024 cpdn.org