climateprediction.net home page
Task 21959746

Task 21959746

Name wah2_sam50_a05t_201312_25_881_012034217_0
Workunit 12034217
Created 2 Nov 2020, 12:08:34 UTC
Sent 2 Nov 2020, 12:22:35 UTC
Report deadline 15 Oct 2021, 17:42:35 UTC
Received 24 Nov 2020, 19:13:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1510585
Run time 4 days 9 hours 14 min 36 sec
CPU time 4 days 4 hours 28 min 51 sec
Validate state Invalid
Credit 9,138.96
Device peak FLOPS 4.36 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 226.84 MB
Peak swap size 188.07 MB
Peak disk usage 153.80 MB
Stderr
<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
06:28:26 (21760): start_timer_thread(): CreateThread() failed, errno 0
06:28:28 (18824): start_timer_thread(): CreateThread() failed, errno 0
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:06:10 (21212): Can't acquire lockfile (32) - waiting 35s
20:06:45 (21212): Can't acquire lockfile (32) - exiting
20:06:45 (21212): Error: The process cannot access the file because it is being used by another process.

 (0x20)
20:16:51 (20792): Can't acquire lockfile (32) - waiting 35s
20:17:26 (20792): Can't acquire lockfile (32) - exiting
20:17:26 (20792): Error: The process cannot access the file because it is being used by another process.

 (0x20)
20:27:39 (16448): Can't acquire lockfile (32) - waiting 35s
20:28:14 (16448): Can't acquire lockfile (32) - exiting
20:28:14 (16448): Error: The process cannot access the file because it is being used by another process.

 (0x20)
20:38:31 (11552): Can't acquire lockfile (32) - waiting 35s
20:39:06 (11552): Can't acquire lockfile (32) - exiting
20:39:06 (11552): Error: The process cannot access the file because it is being used by another process.

 (0x20)
20:49:14 (20428): Can't acquire lockfile (32) - waiting 35s
20:49:49 (20428): Can't acquire lockfile (32) - exiting
20:49:49 (20428): Error: The process cannot access the file because it is being used by another process.

 (0x20)
21:00:00 (12316): Can't acquire lockfile (32) - waiting 35s
21:00:35 (12316): Can't acquire lockfile (32) - exiting
21:00:35 (12316): Error: The process cannot access the file because it is being used by another process.

 (0x20)
21:10:57 (18796): Can't acquire lockfile (32) - waiting 35s
21:11:32 (18796): Can't acquire lockfile (32) - exiting
21:11:32 (18796): Error: The process cannot access the file because it is being used by another process.

 (0x20)
21:21:37 (22464): Can't acquire lockfile (32) - waiting 35s
21:22:12 (22464): Can't acquire lockfile (32) - exiting
21:22:12 (22464): Error: The process cannot access the file because it is being used by another process.

 (0x20)
22:42:20 (16176): Can't acquire lockfile (32) - waiting 35s
22:42:55 (16176): Can't acquire lockfile (32) - exiting
22:42:55 (16176): Error: The process cannot access the file because it is being used by another process.

 (0x20)
00:01:54 (24780): Can't acquire lockfile (32) - waiting 35s
00:02:29 (24780): Can't acquire lockfile (32) - exiting
00:02:29 (24780): Error: The process cannot access the file because it is being used by another process.

 (0x20)
00:17:16 (23120): Can't acquire lockfile (32) - waiting 35s
00:17:51 (23120): Can't acquire lockfile (32) - exiting
00:17:51 (23120): Error: The process cannot access the file because it is being used by another process.

 (0x20)
00:27:53 (11928): Can't acquire lockfile (32) - waiting 35s
00:28:28 (11928): Can't acquire lockfile (32) - exiting
00:28:28 (11928): Error: The process cannot access the file because it is being used by another process.

 (0x20)
00:54:25 (20384): Can't acquire lockfile (32) - waiting 35s
00:55:00 (20384): Can't acquire lockfile (32) - exiting
00:55:00 (20384): Error: The process cannot access the file because it is being used by another process.

 (0x20)
06:49:42 (19032): Can't acquire lockfile (32) - waiting 35s
06:50:17 (19032): Can't acquire lockfile (32) - exiting
06:50:17 (19032): Error: The process cannot access the file because it is being used by another process.

 (0x20)
07:04:26 (25684): Can't acquire lockfile (32) - waiting 35s
07:05:01 (25684): Can't acquire lockfile (32) - exiting
07:05:01 (25684): Error: The process cannot access the file because it is being used by another process.

 (0x20)
07:46:23 (10704): Can't acquire lockfile (32) - waiting 35s
07:46:58 (10704): Can't acquire lockfile (32) - exiting
07:46:58 (10704): Error: The process cannot access the file because it is being used by another process.

 (0x20)
08:34:15 (24724): Can't acquire lockfile (32) - waiting 35s
08:34:50 (24724): Can't acquire lockfile (32) - exiting
08:34:50 (24724): Error: The process cannot access the file because it is being used by another process.

 (0x20)
08:54:45 (16936): Can't acquire lockfile (32) - waiting 35s
08:55:20 (16936): Can't acquire lockfile (32) - exiting
08:55:20 (16936): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:28:09 (26172): Can't acquire lockfile (32) - waiting 35s
11:28:44 (26172): Can't acquire lockfile (32) - exiting
11:28:44 (26172): Error: The process cannot access the file because it is being used by another process.

 (0x20)
12:02:33 (19332): Can't acquire lockfile (32) - waiting 35s
12:03:08 (19332): Can't acquire lockfile (32) - exiting
12:03:08 (19332): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:06:03 (12536): Can't acquire lockfile (32) - waiting 35s
13:06:38 (12536): Can't acquire lockfile (32) - exiting
13:06:38 (12536): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:32:18 (19040): Can't acquire lockfile (32) - waiting 35s
13:32:53 (19040): Can't acquire lockfile (32) - exiting
13:32:53 (19040): Error: The process cannot access the file because it is being used by another process.

 (0x20)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3808, selfPID=20340, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
cpdnmonitor: cannot open input file Y:\BOINC/projects/climateprediction.net/wah2_sam50_a05t_201312_25_881_012034217/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file Y:\BOINC/projects/climateprediction.net/wah2_sam50_a05t_201312_25_881_012034217/dataout/region_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5336, selfPID=18240, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout10.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout11.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout12.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout2.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout3.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout4.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout5.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout6.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout7.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout8.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout9.zip
12:14:00 (18240): handle_file_upload_status: can't open boinc_ufs_cpdnout_out.zip
12:14:01 (18240): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_25.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sam50_a05t_201312_25_881_012034217_0_r1779626272_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Nov 2020 18:59:33 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 138,539 333,061 2.4041
20 Nov 2020 10:28:57 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 127,019 306,056 2.4095
16 Nov 2020 10:24:34 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 115,499 273,703 2.3697
15 Nov 2020 12:55:19 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 103,979 245,686 2.3628
14 Nov 2020 12:15:04 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 92,459 215,367 2.3293
12 Nov 2020 13:58:25 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 57,899 134,996 2.3316
12 Nov 2020 04:55:36 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 46,379 106,175 2.2893
12 Nov 2020 02:53:09 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 34,859 80,284 2.3031
12 Nov 2020 02:53:09 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 23,339 58,618 2.5116
11 Nov 2020 08:24:37 1510585 21959746 wah2_sam50_a05t_201312_25_881_012034217_0 11,819 33,132 2.8033


©2024 climateprediction.net