climateprediction.net home page
Task 20636232

Task 20636232

Name wah2_sas50_s004_209112_13_635_011187169_0
Workunit 11187169
Created 9 Aug 2017, 19:06:37 UTC
Sent 14 Aug 2017, 6:58:54 UTC
Report deadline 27 Jul 2018, 12:18:54 UTC
Received 2 Sep 2017, 23:57:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1440167
Run time 5 days 3 hours 8 min 57 sec
CPU time 5 days 3 hours 8 min 57 sec
Validate state Invalid
Credit 8,379.03
Device peak FLOPS 3.55 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 233.35 MB
Peak swap size 199.64 MB
Peak disk usage 37.65 MB
Stderr
<core_client_version>7.6.33</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6000, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5940, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5956, selfPID=5108, iMonCtr=1
Model crash detected, will try to restart...
14:17:29 (3400): Can't acquire lockfile (32) - waiting 35s
14:18:04 (3400): Can't acquire lockfile (32) - exiting
14:18:04 (3400): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird.

 (0x20)
20:03:03 (6636): Can't acquire lockfile (32) - waiting 35s
20:03:38 (6636): Can't acquire lockfile (32) - exiting
20:03:38 (6636): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird.

 (0x20)
23:33:55 (6656): Can't acquire lockfile (32) - waiting 35s
23:34:30 (6656): Can't acquire lockfile (32) - exiting
23:34:30 (6656): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird.

 (0x20)
00:11:46 (4200): Can't acquire lockfile (32) - waiting 35s
00:12:21 (4200): Can't acquire lockfile (32) - exiting
00:12:21 (4200): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird.

 (0x20)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3756, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=2
Leaving CPDN_ain::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 11 received: Segment violation
Signal 11 received: Software termination signal from kill 
Signal 11 received: Abnormal termination triggered by abort call
Signal 11 received, exiting...
00:57:13 (1436): called boinc_finish(193)
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1276, selfPID=1276, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1276, selfPID=1864, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
00:57:18 (1864): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_sas50_s004_209112_13_635_011187169_0_r1739765391_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_s004_209112_13_635_011187169_0_r1739765391_13.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_s004_209112_13_635_011187169_0_r1739765391_restart.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Sep 2017 11:55:33 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 127,019 407,980 3.2120
01 Sep 2017 20:51:19 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 115,499 373,943 3.2376
01 Sep 2017 01:18:57 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 103,979 339,429 3.2644
31 Aug 2017 09:12:05 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 92,459 296,395 3.2057
30 Aug 2017 03:59:52 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 69,419 217,369 3.1313
29 Aug 2017 16:20:29 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 57,899 176,044 3.0405
29 Aug 2017 05:55:50 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 46,379 140,898 3.0380
28 Aug 2017 20:11:35 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 34,859 106,326 3.0502
15 Aug 2017 06:01:45 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 23,339 70,718 3.0300
14 Aug 2017 18:20:47 1440167 20636232 wah2_sas50_s004_209112_13_635_011187169_0 11,819 35,573 3.0098


©2024 cpdn.org