Name | hadam3p_anz_k2b1_201212_12_306_010265838_1 |
Workunit | 10265838 |
Created | 29 Jan 2016, 22:43:58 UTC |
Sent | 30 Jan 2016, 11:27:48 UTC |
Report deadline | 11 Jan 2017, 16:47:48 UTC |
Received | 26 Feb 2016, 11:41:10 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1353778 |
Run time | 5 days 19 hours 26 min 58 sec |
CPU time | 4 days 23 hours 48 min 24 sec |
Validate state | Invalid |
Credit | 4,981.10 |
Device peak FLOPS | 3.52 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.6.22</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3696, selfPID=208, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4616, selfPID=2676, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2172, selfPID=3508, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4996, selfPID=4012, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=608, selfPID=3024, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=1172, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:28:32 (3252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2660, selfPID=1524, iMonCtr=1 Model crash detected, will try to restart... 15:09:24 (3624): No heartbeat from core client for 30 sec - exiting 15:09:25 (3624): No heartbeat from core client for 30 sec - exiting 15:09:26 (3624): No heartbeat from core client for 30 sec - exiting 15:09:27 (3624): No heartbeat from core client for 30 sec - exiting 15:09:28 (3624): No heartbeat from core client for 30 sec - exiting 15:09:30 (3624): No heartbeat from core client for 30 sec - exiting 15:09:31 (3624): No heartbeat from core client for 30 sec - exiting 15:09:32 (3624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3624, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4148, selfPID=3484, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4040, selfPID=2556, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4432, selfPID=2148, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=1048, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1588, selfPID=3540, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1740, selfPID=3172, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2204, selfPID=1024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2080, selfPID=3152, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_anz_k2b1_201212_12_306_010265838/dataout/atmos_restart.day after 11 attempts Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3972, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_k2b1_201212_12_306_010265838_1_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_k2b1_201212_12_306_010265838_1_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Feb 2016 17:13:46 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 115,499 | 395,611 | 3.4252 |
23 Feb 2016 10:50:18 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 103,979 | 359,004 | 3.4527 |
21 Feb 2016 20:05:11 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 92,459 | 321,712 | 3.4795 |
21 Feb 2016 08:15:20 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 80,939 | 285,618 | 3.5288 |
19 Feb 2016 15:47:02 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 69,419 | 248,737 | 3.5831 |
16 Feb 2016 19:34:20 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 57,899 | 212,161 | 3.6643 |
14 Feb 2016 17:15:30 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 46,379 | 174,133 | 3.7546 |
13 Feb 2016 14:39:56 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 34,859 | 132,749 | 3.8082 |
12 Feb 2016 10:45:42 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 23,339 | 88,550 | 3.7941 |
11 Feb 2016 11:06:48 | 1353778 | 19241612 | hadam3p_anz_k2b1_201212_12_306_010265838_1 | 11,819 | 42,708 | 3.6135 |
©2024 cpdn.org