climateprediction.net home page
Task 21569365

Task 21569365

Name wah2_safr50_a0t1_201612_24_790_011752301_1
Workunit 11752301
Created 15 Mar 2019, 0:42:04 UTC
Sent 17 Mar 2019, 8:31:34 UTC
Report deadline 27 Feb 2020, 13:51:34 UTC
Received 22 Mar 2019, 9:53:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1273646
Run time 2 days 17 hours 4 min 44 sec
CPU time 2 days 14 hours 43 min 23 sec
Validate state Invalid
Credit 5,339.28
Device peak FLOPS 4.09 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 297.77 MB
Peak swap size 221.64 MB
Peak disk usage 89.96 MB
Stderr
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3384, selfPID=7608, iMonCtr=1
Model crash detected, will try to restart...
17:10:53 (1336): Can't acquire lockfile (32) - waiting 35s
17:10:54 (3052): BOINC client no longer exists - exiting
17:10:54 (3052): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:11:04 (3052): BOINC client no longer exists - exiting
17:11:14 (3052): timer handler: client dead, exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8440, selfPID=10424, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8812, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5756, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9896, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8584, selfPID=6656, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7524, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8800, selfPID=8836, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
cpdnmonitor: cannot open input file D:\User\BOINC/projects/climateprediction.net/wah2_safr50_a0t1_201612_24_790_011752301/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\User\BOINC/projects/climateprediction.net/wah2_safr50_a0t1_201612_24_790_011752301/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xadae.pipe_dummy                                                            

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xacxf.pipe_dummy                                                            2048    
Leaving CPDN_ain::Monitor...
08:52:44 (9540): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_a0t1_201612_24_790_011752301_1_r376789761_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Mar 2019 21:21:24 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 80,939 216,184 2.6709
21 Mar 2019 14:09:22 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 69,419 187,310 2.6983
20 Mar 2019 21:22:52 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 57,899 158,480 2.7372
20 Mar 2019 12:41:26 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 46,379 129,421 2.7905
19 Mar 2019 20:06:48 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 34,859 99,260 2.8475
19 Mar 2019 10:26:20 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 23,339 67,942 2.9111
18 Mar 2019 18:20:08 1273646 21569365 wah2_safr50_a0t1_201612_24_790_011752301_1 11,819 37,342 3.1595


©2024 cpdn.org