climateprediction.net home page
Task 15615413

Task 15615413

Name hadam3p_pnw_dfx5_2042_1_008276997_1
Workunit 8428132
Created 20 Feb 2013, 16:33:07 UTC
Sent 20 Feb 2013, 16:33:13 UTC
Report deadline 2 Feb 2014, 21:53:13 UTC
Received 1 Mar 2013, 16:01:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1225775
Run time 3 days 7 hours 7 min 28 sec
CPU time 2 days 15 hours 47 min 16 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 3.13 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.31</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5432, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=604, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:17:25 (1008): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
18:17:26 (1008): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4348, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1000, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
08:22:11 (2484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:22:12 (2484): No heartbeat from core client for 30 sec - exiting
08:22:13 (2484): No heartbeat from core client for 30 sec - exiting
09:49:28 (5728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:49:29 (5728): No heartbeat from core client for 30 sec - exiting
09:56:23 (6104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:56:25 (6104): No heartbeat from core client for 30 sec - exiting
09:56:26 (6104): No heartbeat from core client for 30 sec - exiting
10:03:37 (4172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:59:09 (7180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:59:15 (7180): No heartbeat from core client for 30 sec - exiting
11:59:16 (7180): No heartbeat from core client for 30 sec - exiting
11:59:17 (7180): No heartbeat from core client for 30 sec - exiting
11:59:18 (7180): No heartbeat from core client for 30 sec - exiting
11:59:19 (7180): No heartbeat from core client for 30 sec - exiting
11:59:22 (7180): No heartbeat from core client for 30 sec - exiting
11:59:23 (7180): No heartbeat from core client for 30 sec - exiting
11:59:24 (7180): No heartbeat from core client for 30 sec - exiting
11:59:25 (7180): No heartbeat from core client for 30 sec - exiting
11:59:26 (7180): No heartbeat from core client for 30 sec - exiting
11:59:27 (7180): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=648, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8184, selfPID=6980, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_dfx5_2042_1_008276997/dataout/atmos_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_dfx5_2042_1_008276997\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_pnw_um_6.  00C8C52A  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00C34460  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00C3362A  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00C12469  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00B166EB  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00BB2AE2  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00BB35AF  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00959860  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00C70893  Unknown               Unknown  Unknown
kernel32.dll       7667D2E9  Unknown               Unknown  Unknown
ntdll.dll          771E1603  Unknown               Unknown  Unknown
ntdll.dll          771E15D6  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_dfx5_2042_1_008276997\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_pnw_um_6.  011FA39A  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  011A2CD0  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  011A1E9A  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  01182819  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  01082287  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  0111E7B2  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  0111F2DA  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00E99BD2  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  011DE638  Unknown               Unknown  Unknown
kernel32.dll       7667D2E9  Unknown               Unknown  Unknown
ntdll.dll          771E1603  Unknown               Unknown  Unknown
ntdll.dll          771E15D6  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_dfx5_2042_1_008276997_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Feb 2013 20:28:03 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 92,256 207,845 2.2529
27 Feb 2013 12:22:38 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 80,736 182,564 2.2612
26 Feb 2013 18:36:57 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 69,216 157,196 2.2711
25 Feb 2013 12:50:59 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 57,696 130,901 2.2688
23 Feb 2013 22:56:18 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 46,176 105,290 2.2802
23 Feb 2013 14:40:31 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 34,656 78,743 2.2721
22 Feb 2013 17:26:09 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 23,136 52,331 2.2619
21 Feb 2013 14:55:38 1225775 15615413 hadam3p_pnw_dfx5_2042_1_008276997_1 11,616 26,939 2.3191


©2024 cpdn.org