climateprediction.net home page
Task 21638498

Task 21638498

Name wah2_sam50_a1nt_201612_25_810_011827825_1
Workunit 11827825
Created 24 Apr 2019, 6:44:23 UTC
Sent 28 Apr 2019, 0:28:23 UTC
Report deadline 9 Apr 2020, 5:48:23 UTC
Received 21 Jun 2019, 2:26:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1422796
Run time 6 days 9 hours 31 min 25 sec
CPU time 3 days 7 hours 22 min 34 sec
Validate state Invalid
Credit 8,379.03
Device peak FLOPS 4.10 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 226.36 MB
Peak swap size 189.44 MB
Peak disk usage 218.55 MB
Stderr
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21048, selfPID=8520, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2232, selfPID=4340, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
23:29:48 (4340): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20644, selfPID=20644, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20644, selfPID=8772, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3052, selfPID=15064, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

CPDN Monitor - Quit request from BOINC...
07:49:44 (10864): BOINC client no longer exists - exiting
07:49:44 (10864): timer handler: client dead, exiting
07:49:54 (10864): BOINC client no longer exists - exiting
07:49:54 (10864): timer handler: client dead, exiting
07:50:04 (10864): BOINC client no longer exists - exiting
07:50:04 (10864): timer handler: client dead, exiting
07:50:14 (10864): BOINC client no longer exists - exiting
07:50:14 (10864): timer handler: client dead, exiting
07:50:24 (10864): BOINC client no longer exists - exiting
07:50:24 (10864): timer handler: client dead, exiting
07:50:34 (10864): BOINC client no longer exists - exiting
07:50:34 (10864): timer handler: client dead, exiting
07:50:40 (18020): Can't acquire lockfile (32) - waiting 35s
07:50:44 (10864): BOINC client no longer exists - exiting
07:50:44 (10864): timer handler: client dead, exiting
07:50:54 (10864): BOINC client no longer exists - exiting
07:50:54 (10864): timer handler: client dead, exiting
07:51:04 (10864): BOINC client no longer exists - exiting
07:51:04 (10864): timer handler: client dead, exiting
07:51:14 (10864): BOINC client no longer exists - exiting
07:51:14 (10864): timer handler: client dead, exiting
07:51:15 (18020): Can't acquire lockfile (32) - exiting
07:51:15 (18020): Error: The process cannot access the file because it is being used by another process.

 (0x20)
07:51:25 (10864): BOINC client no longer exists - exiting
07:51:25 (10864): timer handler: client dead, exiting
07:51:35 (10864): BOINC client no longer exists - exiting
07:51:35 (10864): timer handler: client dead, exiting
07:51:45 (10864): BOINC client no longer exists - exiting
07:51:45 (10864): timer handler: client dead, exiting
07:51:55 (10864): BOINC client no longer exists - exiting
07:51:55 (10864): timer handler: client dead, exiting
07:52:05 (10864): BOINC client no longer exists - exiting
07:52:05 (10864): timer handler: client dead, exiting
07:52:15 (10864): BOINC client no longer exists - exiting
07:52:15 (10864): timer handler: client dead, exiting
07:52:25 (10864): BOINC client no longer exists - exiting
07:52:25 (10864): timer handler: client dead, exiting
07:52:35 (10864): BOINC client no longer exists - exiting
07:52:35 (10864): timer handler: client dead, exiting
07:52:45 (10864): BOINC client no longer exists - exiting
07:52:45 (10864): timer handler: client dead, exiting
07:52:55 (10864): BOINC client no longer exists - exiting
07:52:55 (10864): timer handler: client dead, exiting
07:53:05 (10864): BOINC client no longer exists - exiting
07:53:05 (10864): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17268, selfPID=20636, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13344, selfPID=144, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
cpdnmonitor: cannot open input file D:\boinc\data/projects/climateprediction.net/wah2_sam50_a1nt_201612_25_810_011827825/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\boinc\data/projects/climateprediction.net/wah2_sam50_a1nt_201612_25_810_011827825/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xadae.pipe_dummy                                                            

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xacxf.pipe_dummy                                                            2048    
Leaving CPDN_ain::Monitor...
21:26:16 (16304): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_25.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a1nt_201612_25_810_011827825_1_r735738583_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Jun 2019 03:48:40 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 127,019 278,290 2.1909
17 Jun 2019 17:51:57 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 115,499 254,789 2.2060
10 Jun 2019 11:09:47 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 103,979 228,822 2.2007
10 Jun 2019 01:45:54 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 92,459 205,652 2.2243
10 Jun 2019 01:45:54 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 80,939 181,192 2.2386
10 Jun 2019 01:45:54 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 69,419 156,690 2.2572
10 Jun 2019 01:45:54 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 57,899 130,719 2.2577
01 May 2019 10:13:43 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 46,379 97,207 2.0959
29 Apr 2019 05:00:19 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 34,859 72,980 2.0936
28 Apr 2019 18:18:07 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 23,339 48,384 2.0731
28 Apr 2019 09:08:06 1422796 21638498 wah2_sam50_a1nt_201612_25_810_011827825_1 11,819 23,508 1.9890


©2024 cpdn.org