Name | hadam3p_eu_xv6w_1967_1_007045632_1 |
Workunit | 7248948 |
Created | 23 Sep 2012, 10:18:12 UTC |
Sent | 23 Sep 2012, 10:47:03 UTC |
Report deadline | 5 Sep 2013, 16:07:03 UTC |
Received | 10 Oct 2012, 7:34:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1060851 |
Run time | 2 days 8 hours 14 min 20 sec |
CPU time | 2 days 0 hours 7 min 47 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 2.53 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3328, selfPID=3664, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4600, selfPID=2780, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3440, selfPID=1724, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4856, selfPID=3264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3172, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2668, selfPID=2448, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4348, selfPID=4336, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4644, selfPID=3312, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=3820, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4172, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4508, selfPID=3976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2212, selfPID=3836, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=2632, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4872, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=4124, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_xv6w_1967_1_007045632_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xv6w_1967_1_007045632_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xv6w_1967_1_007045632_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xv6w_1967_1_007045632_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xv6w_1967_1_007045632_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xv6w_1967_1_007045632_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Oct 2012 16:39:06 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 69,216 | 167,469 | 2.4195 |
04 Oct 2012 15:59:45 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 57,696 | 139,373 | 2.4156 |
03 Oct 2012 11:26:59 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 46,177 | 111,700 | 2.4190 |
03 Oct 2012 10:26:48 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 46,176 | 111,340 | 2.4112 |
01 Oct 2012 17:07:59 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 34,656 | 83,616 | 2.4127 |
29 Sep 2012 10:06:07 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 23,136 | 54,988 | 2.3767 |
27 Sep 2012 10:19:29 | 1060851 | 15304867 | hadam3p_eu_xv6w_1967_1_007045632_1 | 11,616 | 28,080 | 2.4174 |
©2024 cpdn.org