climateprediction.net home page
Task 22328935

Task 22328935

Name wah2_eas25_a22p_200211_25_994_012218127_1
Workunit 12218127
Created 25 Jun 2023, 1:37:21 UTC
Sent 25 Jun 2023, 1:57:48 UTC
Report deadline 6 Jul 2024, 7:17:48 UTC
Received 10 Aug 2023, 15:43:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1542065
Run time 19 days 14 hours 32 min 58 sec
CPU time 3 days 15 hours 58 min 20 sec
Validate state Invalid
Credit 3,059.47
Device peak FLOPS 1.00 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 430.79 MB
Peak swap size 393.89 MB
Peak disk usage 880.23 MB
Stderr
<core_client_version>7.22.2</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24132, selfPID=28300, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5080, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
10:21:00 (5080): called boinc_finish(0)
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13744, selfPID=13744, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16500, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13496, selfPID=13744, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=768, selfPID=768, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=768, selfPID=2412, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12728, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
10:40:35 (12728): called boinc_finish(0)
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=40328, selfPID=40328, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15084, selfPID=15084, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14016, selfPID=14016, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13944, selfPID=13944, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13944, selfPID=13656, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9960, selfPID=9960, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24260, selfPID=22120, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11320, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14708, selfPID=14708, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15940, selfPID=16368, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9024, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25084, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14152, selfPID=14152, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10696, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30012, selfPID=30012, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30012, selfPID=7032, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15472, selfPID=15472, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14156, selfPID=14156, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10180, selfPID=10180, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11036, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24696, selfPID=15100, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14636, selfPID=14636, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14636, selfPID=15576, iMonCtr=1
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8872, selfPID=8872, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8872, selfPID=14444, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13380, selfPID=14128, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13380, selfPID=13380, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13336, selfPID=13336, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14112, selfPID=14112, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15376, selfPID=15376, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13980, selfPID=14220, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13980, selfPID=13980, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15336, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
13:20:39 (15336): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14472, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15048, selfPID=15048, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15048, selfPID=15304, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13972, selfPID=13972, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16136, selfPID=16136, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16136, selfPID=12816, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15036, selfPID=15228, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15036, selfPID=15036, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19376, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18508, selfPID=11336, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
02:03:42 (11336): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_25.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Jul 2023 23:17:39 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 80,939 955,268 11.8023
26 Jul 2023 10:36:37 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 69,419 799,648 11.5192
20 Jul 2023 17:23:55 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 57,899 621,696 10.7376
20 Jul 2023 10:40:15 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 57,899 617,937 10.6727
10 Aug 2023 02:55:44 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 46,379 308,921 6.6608
13 Jul 2023 18:30:35 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 46,379 491,971 10.6076
09 Jul 2023 23:01:19 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 34,859 382,956 10.9859
08 Aug 2023 19:58:42 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 34,859 238,628 6.8455
05 Jul 2023 21:18:50 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 23,339 275,304 11.7959
06 Aug 2023 07:01:07 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 23,339 168,151 7.2047
04 Aug 2023 17:59:07 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 11,819 96,819 8.1918
02 Jul 2023 20:44:23 1542065 22328935 wah2_eas25_a22p_200211_25_994_012218127_1 11,819 140,145 11.8576


©2024 climateprediction.net