climateprediction.net home page
Task 22328939

Task 22328939

Name wah2_eas25_a0j8_199011_25_994_012216130_1
Workunit 12216130
Created 25 Jun 2023, 1:46:22 UTC
Sent 25 Jun 2023, 1:57:48 UTC
Report deadline 6 Jul 2024, 7:17:48 UTC
Received 3 Aug 2023, 13:59:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1542065
Run time 14 days 18 hours 24 min 1 sec
CPU time 11 days 15 hours 20 min 35 sec
Validate state Invalid
Credit 5,339.28
Device peak FLOPS 1.00 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 430.61 MB
Peak swap size 394.71 MB
Peak disk usage 874.21 MB
Stderr
<core_client_version>7.22.2</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10404, selfPID=23444, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
04:19:49 (23444): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12900, selfPID=12900, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13488, selfPID=13712, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=4124, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12860, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
10:40:35 (12860): called boinc_finish(0)
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=32444, selfPID=32444, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13204, selfPID=13204, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14056, selfPID=14056, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13888, selfPID=13888, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13888, selfPID=14328, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9964, selfPID=9964, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24664, selfPID=22552, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14716, selfPID=14716, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15864, selfPID=16228, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14596, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14184, selfPID=14184, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31408, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=32556, selfPID=15212, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12168, selfPID=26352, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15504, selfPID=15724, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13784, selfPID=13784, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13348, selfPID=13348, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11908, selfPID=11908, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15408, selfPID=15408, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12752, selfPID=12752, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8844, selfPID=14340, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8844, selfPID=8844, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9592, selfPID=9592, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9592, selfPID=13736, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13492, selfPID=14164, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13492, selfPID=13492, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7660, selfPID=7660, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14180, selfPID=14180, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15404, selfPID=15404, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14012, selfPID=14260, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14012, selfPID=14012, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15352, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14496, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
09:49:59 (14496): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_25.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Aug 2023 00:02:48 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 80,939 956,106 11.8127
28 Jul 2023 05:41:43 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 69,419 816,428 11.7609
21 Jul 2023 17:03:46 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 57,899 636,627 10.9955
13 Jul 2023 20:32:02 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 46,379 496,096 10.6966
11 Jul 2023 00:42:49 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 34,859 388,995 11.1591
06 Jul 2023 02:36:21 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 23,339 281,485 12.0607
03 Jul 2023 00:47:15 1542065 22328939 wah2_eas25_a0j8_199011_25_994_012216130_1 11,819 146,138 12.3647


©2024 climateprediction.net