Name | hadam3p_eu_g1c7_201412_13_379_010442107_0 |
Workunit | 10442107 |
Created | 24 Mar 2016, 20:29:14 UTC |
Sent | 28 Mar 2016, 19:59:23 UTC |
Report deadline | 11 Mar 2017, 1:19:23 UTC |
Received | 21 Apr 2016, 19:33:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1484973 |
Run time | 2 days 19 hours 18 min 20 sec |
CPU time | 2 hours 50 min 45 sec |
Validate state | Invalid |
Credit | 2,191.17 |
Device peak FLOPS | 3.39 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v7.28 windows_intelx86 |
Stderr | <core_client_version>7.6.29</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7468, selfPID=6248, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7424, selfPID=6868, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6736, selfPID=7400, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8444, selfPID=7124, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7448, selfPID=6572, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5772, selfPID=6240, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7524, selfPID=6836, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8044, selfPID=6628, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2524, selfPID=7104, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=5452, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6084, selfPID=6436, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7940, selfPID=6984, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8128, selfPID=8128, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7924, selfPID=7924, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2968, selfPID=2228, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 20:58:39 (2228): called boinc_finish(0) CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6740, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... cpdnmonitor: cannot open input file D:\BOINC-Daten/projects/climateprediction.net/hadam3p_eu_g1c7_201412_13_379_010442107/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\BOINC-Daten/projects/climateprediction.net/hadam3p_eu_g1c7_201412_13_379_010442107/dataout/region_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... 18:54:04 (7756): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_g1c7_201412_13_379_010442107_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_g1c7_201412_13_379_010442107_0_13.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Apr 2016 16:36:06 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 127,019 | 208,362 | 1.6404 |
17 Apr 2016 13:52:28 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 115,499 | 190,444 | 1.6489 |
16 Apr 2016 07:29:17 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 103,979 | 172,062 | 1.6548 |
15 Apr 2016 12:13:51 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 92,459 | 154,590 | 1.6720 |
15 Apr 2016 12:05:53 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 80,939 | 137,325 | 1.6966 |
10 Apr 2016 08:34:35 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 69,419 | 118,367 | 1.7051 |
07 Apr 2016 19:57:30 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 57,899 | 99,154 | 1.7125 |
05 Apr 2016 17:39:44 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 46,379 | 79,553 | 1.7153 |
05 Apr 2016 00:16:40 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 34,859 | 60,530 | 1.7364 |
02 Apr 2016 11:06:26 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 23,339 | 41,041 | 1.7585 |
01 Apr 2016 15:34:09 | 1289156 | 19454428 | hadam3p_eu_g1c7_201412_13_379_010442107_0 | 11,819 | 20,718 | 1.7529 |
©2024 cpdn.org