Name | hadam3p_eu_l24a_2013_1_008555230_0 |
Workunit | 8702742 |
Created | 5 Mar 2014, 21:02:52 UTC |
Sent | 5 Mar 2014, 23:35:57 UTC |
Report deadline | 16 Feb 2015, 4:55:57 UTC |
Received | 22 Mar 2014, 22:57:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1241825 |
Run time | 2 days 12 hours 22 min 2 sec |
CPU time | 2 days 0 hours 11 min 8 sec |
Validate state | Invalid |
Credit | 995.30 |
Device peak FLOPS | 2.63 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.2.39</core_client_version> <![CDATA[ <stderr_txt> 00:54:45 (10416): start_timer_thread(): CreateThread() failed, errno 0 00:54:45 (8828): start_timer_thread(): CreateThread() failed, errno 0 diagnostics_init_unhandled_exception_monitor(): Creating hExceptionMonitorThread failed, errno 12 WARNING: BOINC2W3indows Runtime Debugger has beenread(bled. ateThread() failed, errno 0 23:03:32 (9344): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1268, selfPID=1268, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10404, selfPID=10404, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11148, selfPID=11148, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23124, selfPID=23124, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23056, selfPID=23056, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:25:58 (14232): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 04:34:54 (13636): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 05:37:02 (15520): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14460, selfPID=14460, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:30:52 (2872): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=6136, iMonCtr=2 01:32:24 (7816): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2624, selfPID=2624, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:52:19 (13040): No heartbeat from core client for 30 sec - exiting 22:52:20 (13040): No heartbeat from core client for 30 sec - exiting 22:52:21 (13040): No heartbeat from core client for 30 sec - exiting 22:52:22 (13040): No heartbeat from core client for 30 sec - exiting 22:52:23 (13040): No heartbeat from core client for 30 sec - exiting 22:52:24 (13040): No heartbeat from core client for 30 sec - exiting 22:52:25 (13040): No heartbeat from core client for 30 sec - exiting 22:52:26 (13040): No heartbeat from core client for 30 sec - exiting 22:52:27 (13040): No heartbeat from core client for 30 sec - exiting 22:52:28 (13040): No heartbeat from core client for 30 sec - exiting 22:52:29 (13040): No heartbeat from core client for 30 sec - exiting 22:52:30 (13040): No heartbeat from core client for 30 sec - exiting 22:52:32 (13040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:52:33 (13040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11260, selfPID=11260, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3340, selfPID=3340, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:09:42 (6740): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10948, selfPID=10948, iMonCtr=2 15:40:08 (6024): No heartbeat from core client for 30 sec - exiting 15:40:09 (6024): No heartbeat from core client for 30 sec - exiting 15:40:10 (6024): No heartbeat from core client for 30 sec - exiting 15:40:11 (6024): No heartbeat from core client for 30 sec - exiting 15:40:12 (6024): No heartbeat from core client for 30 sec - exiting 15:40:13 (6024): No heartbeat from core client for 30 sec - exiting 15:40:14 (6024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12216, selfPID=12352, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:49:23 (5460): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10624, selfPID=10624, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9116, selfPID=9116, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 02:11:20 (1480): No heartbeat from core client for 30 sec - exiting 02:11:21 (1480): No heartbeat from core client for 30 sec - exiting 02:11:22 (1480): No heartbeat from core client for 30 sec - exiting 02:11:23 (1480): No heartbeat from core client for 30 sec - exiting 02:11:24 (1480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:11:25 (1480): No heartbeat from core client for 30 sec - exiting 02:11:27 (1480): No heartbeat from core client for 30 sec - exiting 02:11:41 (1172): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetValCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9636, selfPID=9636, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 09:53:41 (2696): No heartbeat from core client for 30 sec - exiting 09:53:42 (2696): No heartbeat from core client for 30 sec - exiting 09:53:43 (2696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:28:42 (3772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:29:57 (7060): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7228, selfPID=7228, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8420, selfPID=8420, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8740, selfPID=8884, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_l24a_2013_1_008555230/dataout/atmos_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_l24a_2013_1_008555230\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 014BC52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01464460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0146362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01442469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 013466EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 013E2AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 013E35AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 01189860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 014A0893 Unknown Unknown Unknown kernel32.dll 7632336A Unknown Unknown Unknown ntdll.dll 772A9F72 Unknown Unknown Unknown ntdll.dll 772A9F45 Unknown Unknown Unknown forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_l24a_2013_1_008555230\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 005BA39A Unknown Unknown Unknown hadam3p_eu_um_6.0 00562CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 00561E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 00542819 Unknown Unknown Unknown hadam3p_eu_um_6.0 00442287 Unknown Unknown Unknown hadam3p_eu_um_6.0 004DE7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 004DF2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 00259BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0059E638 Unknown Unknown Unknown kernel32.dll 7632336A Unknown Unknown Unknown ntdll.dll 772A9F72 Unknown Unknown Unknown ntdll.dll 772A9F45 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7672, selfPID=8560, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l24a_2013_1_008555230_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Mar 2014 11:00:25 | 1241825 | 16345466 | hadam3p_eu_l24a_2013_1_008555230_0 | 57,696 | 158,060 | 2.7395 |
20 Mar 2014 21:18:14 | 1241825 | 16345466 | hadam3p_eu_l24a_2013_1_008555230_0 | 46,176 | 125,544 | 2.7188 |
16 Mar 2014 22:27:48 | 1241825 | 16345466 | hadam3p_eu_l24a_2013_1_008555230_0 | 34,656 | 93,402 | 2.6951 |
15 Mar 2014 11:06:11 | 1241825 | 16345466 | hadam3p_eu_l24a_2013_1_008555230_0 | 23,136 | 62,557 | 2.7039 |
10 Mar 2014 22:46:30 | 1241825 | 16345466 | hadam3p_eu_l24a_2013_1_008555230_0 | 11,616 | 31,035 | 2.6717 |
©2024 cpdn.org