Name | hadam3p_eu_82ex_2005_1_008210512_1 |
Workunit | 8365636 |
Created | 6 Oct 2012, 8:37:52 UTC |
Sent | 6 Oct 2012, 8:38:06 UTC |
Report deadline | 18 Sep 2013, 13:58:06 UTC |
Received | 20 Oct 2012, 17:15:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1170519 |
Run time | 2 days 1 hours 21 min 29 sec |
CPU time | 1 days 22 hours 0 min 58 sec |
Validate state | Invalid |
Credit | 993.71 |
Device peak FLOPS | 2.50 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> 03:03:14 (2792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:03:16 (2792): No heartbeat from core client for 30 sec - exiting 06:24:25 (1436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:24:26 (1436): No heartbeat from core client for 30 sec - exiting 06:24:27 (1436): No heartbeat from core client for 30 sec - exiting 06:24:28 (1436): No heartbeat from core client for 30 sec - exiting 07:07:54 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6284, selfPID=6284, iMonCtr=2 07:07:55 (4372): No heartbeat from core client for 30 sec - exiting 07:07:56 (4372): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6468, selfPID=5352, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4648, selfPID=3224, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2104, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=2872, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Colobal Worerer:: CPDDN pNoceprocess is not ninnning, exiting, bRetVal1, 1, checkPID=0, selfPI3712, iM iMonCtr= 2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4756, selfPID=1692, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1108, selfPID=4556, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4200, selfPID=2648, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2540, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=128, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GRegional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1336, selfPID=4804, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=3188, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3968, selfPID=5220, iMonCtr=1 Model crash detected, will try to restart... 15:39:14 (4688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:40:56 (5180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=4904, iMonCtr=1 CPDN Monitor - Quit request from BOINC... GCPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2676, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2828, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3060, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:04:14 (2904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4640, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1856, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4404, selfPID=4404, iMonCtr=2 10:41:06 (2020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:41:08 (2020): No heartbeat from core client for 30 sec - exiting 11:42:14 (124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:42:15 (124): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=3460, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1652, selfPID=1652, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2852, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2880, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:08:25 (3500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:30 (3500): No heartbeat from core client for 30 sec - exiting 13:08:31 (3500): No heartbeat from core client for 30 sec - exiting 13:08:32 (3500): No heartbeat from core client for 30 sec - exiting 07:54:12 (4436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2552, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_82ex_2005_1_008210512\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 00E0C52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 00DB4460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00DB362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 00D92469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00C966EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 00D32AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00D335AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 00AD9860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00DF0893 Unknown Unknown Unknown kernel32.dll 75F033AA Unknown Unknown Unknown ntdll.dll 77369EF2 Unknown Unknown Unknown ntdll.dll 77369EC5 Unknown Unknown Unknownforrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_82ex_2005_1_008210512\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 0150A39A Unknown Unknown Unknown hadam3p_eu_um_6.0 014B2CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 014B1E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 01492819 Unknown Unknown Unknown hadam3p_eu_um_6.0 01392287 Unknown Unknown Unknown hadam3p_eu_um_6.0 0142E7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0142F2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 011A9BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 014EE638 Unknown Unknown Unknown kernel32.dll 75F033AA Unknown Unknown Unknown ntdll.dll 77369EF2 Unknown Unknown Unknown ntdll.dll 77369EC5 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2388, selfPID=916, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_82ex_2005_1_008210512_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Oct 2012 10:46:49 | 1170519 | 15342225 | hadam3p_eu_82ex_2005_1_008210512_1 | 57,604 | 159,854 | 2.7751 |
20 Oct 2012 09:46:37 | 1170519 | 15342225 | hadam3p_eu_82ex_2005_1_008210512_1 | 57,600 | 159,386 | 2.7671 |
17 Oct 2012 16:26:53 | 1170519 | 15342225 | hadam3p_eu_82ex_2005_1_008210512_1 | 46,080 | 127,428 | 2.7654 |
14 Oct 2012 10:01:31 | 1170519 | 15342225 | hadam3p_eu_82ex_2005_1_008210512_1 | 34,560 | 95,897 | 2.7748 |
10 Oct 2012 19:10:59 | 1170519 | 15342225 | hadam3p_eu_82ex_2005_1_008210512_1 | 23,040 | 64,606 | 2.8041 |
08 Oct 2012 02:39:34 | 1170519 | 15342225 | hadam3p_eu_82ex_2005_1_008210512_1 | 11,616 | 33,337 | 2.8699 |
©2024 cpdn.org