climateprediction.net home page
Task 12795159

Task 12795159

Name hadam3p_eu_w1x3_1998_1_006769039_2
Workunit 6972355
Created 12 Apr 2011, 14:28:18 UTC
Sent 12 Apr 2011, 14:41:18 UTC
Report deadline 24 Mar 2012, 20:01:18 UTC
Received 24 Apr 2011, 10:20:46 UTC
Server state In progress
Outcome ---
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 922918
Run time 3 days 2 hours 51 min 45 sec
CPU time 3 days 1 hours 8 min 29 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 2.34 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=664, iMonCtr=2
Model crash detected, will try to restart...
08:12:14 (4208): Can't acquire lockfile (32) - waiting 35s
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5172, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5180, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2268, selfPID=2268, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4152, selfPID=4152, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=5824, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=604, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=2
Model crash detected, will try to restart...
16:41:53 (5724): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5432, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1012, selfPID=1012, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4948, selfPID=3648, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1512, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
11:19:22 (5516): Can't acquire lockfile (32) - waiting 35s

GCM: BUFFIN : Read Failed: No such file or directory
GCM : BUFFIN: C I/O Error feof - Unit 116 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 116 - Return code = 16


Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_w1x3_1998_1_006769039_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w1x3_1998_1_006769039_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w1x3_1998_1_006769039_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w1x3_1998_1_006769039_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w1x3_1998_1_006769039_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Apr 2011 13:03:16 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 80,736 245,061 3.0353
20 Apr 2011 19:41:25 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 69,216 204,755 2.9582
20 Apr 2011 19:41:25 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 57,696 173,294 3.0036
20 Apr 2011 19:41:25 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 46,176 141,599 3.0665
20 Apr 2011 19:41:25 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 34,656 110,113 3.1773
20 Apr 2011 19:41:25 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 23,136 78,072 3.3745
20 Apr 2011 19:41:25 922918 12795159 hadam3p_eu_w1x3_1998_1_006769039_2 11,616 44,239 3.8085


©2024 climateprediction.net