Name | hadam3p_pnw_dfx5_2042_1_008276997_1 |
Workunit | 8428132 |
Created | 20 Feb 2013, 16:33:07 UTC |
Sent | 20 Feb 2013, 16:33:13 UTC |
Report deadline | 2 Feb 2014, 21:53:13 UTC |
Received | 1 Mar 2013, 16:01:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1225775 |
Run time | 3 days 7 hours 7 min 28 sec |
CPU time | 2 days 15 hours 47 min 16 sec |
Validate state | Invalid |
Credit | 2,004.61 |
Device peak FLOPS | 3.13 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5432, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=604, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:17:25 (1008): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 18:17:26 (1008): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4348, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1000, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 08:22:11 (2484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:22:12 (2484): No heartbeat from core client for 30 sec - exiting 08:22:13 (2484): No heartbeat from core client for 30 sec - exiting 09:49:28 (5728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:49:29 (5728): No heartbeat from core client for 30 sec - exiting 09:56:23 (6104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:56:25 (6104): No heartbeat from core client for 30 sec - exiting 09:56:26 (6104): No heartbeat from core client for 30 sec - exiting 10:03:37 (4172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:59:09 (7180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:59:15 (7180): No heartbeat from core client for 30 sec - exiting 11:59:16 (7180): No heartbeat from core client for 30 sec - exiting 11:59:17 (7180): No heartbeat from core client for 30 sec - exiting 11:59:18 (7180): No heartbeat from core client for 30 sec - exiting 11:59:19 (7180): No heartbeat from core client for 30 sec - exiting 11:59:22 (7180): No heartbeat from core client for 30 sec - exiting 11:59:23 (7180): No heartbeat from core client for 30 sec - exiting 11:59:24 (7180): No heartbeat from core client for 30 sec - exiting 11:59:25 (7180): No heartbeat from core client for 30 sec - exiting 11:59:26 (7180): No heartbeat from core client for 30 sec - exiting 11:59:27 (7180): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=648, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8184, selfPID=6980, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_dfx5_2042_1_008276997/dataout/atmos_restart.day after 11 attempts forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_dfx5_2042_1_008276997\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_pnw_um_6. 00C8C52A Unknown Unknown Unknown hadrm3p_pnw_um_6. 00C34460 Unknown Unknown Unknown hadrm3p_pnw_um_6. 00C3362A Unknown Unknown Unknown hadrm3p_pnw_um_6. 00C12469 Unknown Unknown Unknown hadrm3p_pnw_um_6. 00B166EB Unknown Unknown Unknown hadrm3p_pnw_um_6. 00BB2AE2 Unknown Unknown Unknown hadrm3p_pnw_um_6. 00BB35AF Unknown Unknown Unknown hadrm3p_pnw_um_6. 00959860 Unknown Unknown Unknown hadrm3p_pnw_um_6. 00C70893 Unknown Unknown Unknown kernel32.dll 7667D2E9 Unknown Unknown Unknown ntdll.dll 771E1603 Unknown Unknown Unknown ntdll.dll 771E15D6 Unknown Unknown Unknown forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_dfx5_2042_1_008276997\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_pnw_um_6. 011FA39A Unknown Unknown Unknown hadam3p_pnw_um_6. 011A2CD0 Unknown Unknown Unknown hadam3p_pnw_um_6. 011A1E9A Unknown Unknown Unknown hadam3p_pnw_um_6. 01182819 Unknown Unknown Unknown hadam3p_pnw_um_6. 01082287 Unknown Unknown Unknown hadam3p_pnw_um_6. 0111E7B2 Unknown Unknown Unknown hadam3p_pnw_um_6. 0111F2DA Unknown Unknown Unknown hadam3p_pnw_um_6. 00E99BD2 Unknown Unknown Unknown hadam3p_pnw_um_6. 011DE638 Unknown Unknown Unknown kernel32.dll 7667D2E9 Unknown Unknown Unknown ntdll.dll 771E1603 Unknown Unknown Unknown ntdll.dll 771E15D6 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=5672, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Feb 2013 20:28:03 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 92,256 | 207,845 | 2.2529 |
27 Feb 2013 12:22:38 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 80,736 | 182,564 | 2.2612 |
26 Feb 2013 18:36:57 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 69,216 | 157,196 | 2.2711 |
25 Feb 2013 12:50:59 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 57,696 | 130,901 | 2.2688 |
23 Feb 2013 22:56:18 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 46,176 | 105,290 | 2.2802 |
23 Feb 2013 14:40:31 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 34,656 | 78,743 | 2.2721 |
22 Feb 2013 17:26:09 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 23,136 | 52,331 | 2.2619 |
21 Feb 2013 14:55:38 | 1225775 | 15615413 | hadam3p_pnw_dfx5_2042_1_008276997_1 | 11,616 | 26,939 | 2.3191 |
©2024 cpdn.org