climateprediction.net home page
Task 12559764

Task 12559764

Name hadam3p_pnw_zgoq_1985_1_006985746_1
Workunit 7189062
Created 6 Feb 2011, 2:17:56 UTC
Sent 6 Feb 2011, 3:06:09 UTC
Report deadline 19 Jan 2012, 8:26:09 UTC
Received 21 Apr 2011, 11:46:27 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1042360
Run time 4 days 16 hours 23 min 47 sec
CPU time 3 days 17 hours 50 min 5 sec
Validate state Invalid
Credit 2,254.93
Device peak FLOPS 2.51 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=824, selfPID=5348, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6880, selfPID=5676, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4532, selfPID=4532, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6680, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=6612, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6260, selfPID=5712, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8008, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3052, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7128, selfPID=5392, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6236, selfPID=4784, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3344, selfPID=4340, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2480, selfPID=5840, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5420, selfPID=5332, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7068, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4304, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1388, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6256, selfPID=2580, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4836, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5716, selfPID=2860, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7128, selfPID=7128, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6872, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4692, selfPID=3180, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8160, selfPID=7464, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6692, selfPID=5128, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
21:26:09 (5128): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zgoq_1985_1_006985746_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zgoq_1985_1_006985746_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zgoq_1985_1_006985746_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Apr 2011 11:08:15 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 103,776 314,965 3.0350
04 Apr 2011 10:06:07 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 92,256 273,930 2.9692
03 Apr 2011 09:01:07 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 80,736 239,391 2.9651
23 Mar 2011 10:26:18 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 69,216 205,396 2.9675
23 Mar 2011 10:26:18 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 57,696 168,939 2.9281
26 Feb 2011 12:13:49 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 46,176 136,436 2.9547
25 Feb 2011 14:43:47 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 34,656 102,065 2.9451
12 Feb 2011 22:43:18 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 23,138 68,709 2.9695
12 Feb 2011 12:53:42 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 23,136 68,328 2.9533
06 Feb 2011 14:32:11 1042360 12559764 hadam3p_pnw_zgoq_1985_1_006985746_1 11,616 34,855 3.0006


©2024 cpdn.org