climateprediction.net home page
Task 21464038

Task 21464038

Name wah2_safr50_c1e5_199212_16_777_011696069_2
Workunit 11696069
Created 1 Jan 2019, 13:33:48 UTC
Sent 1 Jan 2019, 13:37:55 UTC
Report deadline 14 Dec 2019, 18:57:55 UTC
Received 27 Mar 2019, 7:57:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1450228
Run time 6 days 17 hours 35 min 55 sec
CPU time 11 hours 19 min 59 sec
Validate state Invalid
Credit 2,299.53
Device peak FLOPS 2.71 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Stderr
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=964, selfPID=6828, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1032, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4272, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4660, selfPID=1436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4776, selfPID=4356, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4020, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4532, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2812, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3924, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7768, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
23:49:05 (5072): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6696, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6752, selfPID=6632, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6388, selfPID=6912, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4756, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4476, selfPID=3768, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=3164, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3620, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4228, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6352, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4036, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4496, selfPID=4012, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4072, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4400, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4476, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4348, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4444, selfPID=4100, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
21:40:09 (4100): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1332, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4356, selfPID=4032, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4688, selfPID=3536, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5152, selfPICPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5024, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5100, selfPID=4584, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
17:06:24 (4584): called boinc_finish(0)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6264, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
23:20:13 (4324): called boinc_finish(0)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3920, selfPID=4824, iMonCtr=1
Model crash detected, will try to restart...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xacxf.pipe_dummy                                                            2048    
Leaving CPDN_ain::Monitor...
15:55:17 (4264): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_4.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_5.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_6.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_c1e5_199212_16_777_011696069_2_r931319777_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jan 2019 08:48:29 1450228 21464038 wah2_safr50_c1e5_199212_16_777_011696069_2 34,859 127,980 3.6714
14 Jan 2019 08:24:28 1450228 21464038 wah2_safr50_c1e5_199212_16_777_011696069_2 23,339 87,481 3.7483
04 Jan 2019 22:56:26 1450228 21464038 wah2_safr50_c1e5_199212_16_777_011696069_2 11,819 46,161 3.9057


©2024 cpdn.org