climateprediction.net home page
Task 22411177

Task 22411177

Name wah2_eas25_a3ro_200912_24_1007_012269740_0
Workunit 12269740
Created 21 Feb 2024, 9:55:54 UTC
Sent 22 Feb 2024, 11:48:32 UTC
Report deadline 21 Jun 2024, 11:48:32 UTC
Received 23 Feb 2024, 12:49:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1534057
Run time 22 hours 2 min 34 sec
CPU time 22 hours 2 min 34 sec
Validate state Invalid
Credit 849.83
Device peak FLOPS 5.68 GFLOPS
Application version Weather At Home 2 (wah2) (region independent) v8.29
windows_intelx86
Peak working set size 339.77 MB
Peak swap size 307.19 MB
Peak disk usage 0.03 MB
Stderr
<core_client_version>7.24.1</core_client_version>
<![CDATA[
<stderr_txt>
modelGetExecutables: check control files, strTemp0 & 1 : 
E:\BOINC/projects/climateprediction.net/wah2_eas25_a3ro_200912_24_1007_012269740/jobs/xadae.namelists
E:\BOINC/projects/climateprediction.net/wah2_eas25_a3ro_200912_24_1007_012269740/jobs/xacxf.namelists
modelGetExecutables: unzipping control files : strInput & strTmp 
wah2_eas25_a3ro_200912_24_1007_012269740.zip
wah2_eas25_a3ro_200912_24_1007_012269740/jobs
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka
global model: command string: "E:\BOINC/projects/climateprediction.net/wah2am3m2_um_8.29_windows_intelx86.exe" wah2_eas25_a3ro_200912_24_1007_012269740 generic_phase1_spinup_eas25_global_aabaka ic19610221_10_N96 ALLclim_ancil_146months_OSTIA_sst_2004-12-01_2017-01-30 ALLclim_ancil_146months_OSTIA_ice_2004-12-01_2017-01-30 SO2DMS_N96_cmip6hist-ssp245_2009-2020 oxi.addfa ozone_cmip6hist-ssp245_N96_1979_2031
06:55:19 (13656): start_timer_thread(): CreateThread() failed, errno 0
regional model: command string: "E:\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.29_windows_intelx86.exe" wah2_eas25_a3ro_200912_24_1007_012269740
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=18108, GCM_PID=13656, RCM_PID=12848
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:22:34 (2704): Can't acquire lockfile (32) - waiting 35s
09:23:09 (2704): Can't acquire lockfile (32) - exiting
09:23:09 (2704): Error: The process cannot access the file because it is being used by another process.

 (0x20)
09:33:22 (752): Can't acquire lockfile (32) - waiting 35s
09:33:57 (752): Can't acquire lockfile (32) - exiting
09:33:57 (752): Error: The process cannot access the file because it is being used by another process.

 (0x20)
09:43:59 (4224): Can't acquire lockfile (32) - waiting 35s
09:44:34 (4224): Can't acquire lockfile (32) - exiting
09:44:34 (4224): Error: The process cannot access the file because it is being used by another process.

 (0x20)
09:55:17 (4676): Can't acquire lockfile (32) - waiting 35s
09:55:52 (4676): Can't acquire lockfile (32) - exiting
09:55:52 (4676): Error: The process cannot access the file because it is being used by another process.

 (0x20)
10:05:54 (1964): Can't acquire lockfile (32) - waiting 35s
10:06:29 (1964): Can't acquire lockfile (32) - exiting
10:06:29 (1964): Error: The process cannot access the file because it is being used by another process.

 (0x20)
10:16:33 (7692): Can't acquire lockfile (32) - waiting 35s
10:17:08 (7692): Can't acquire lockfile (32) - exiting
10:17:08 (7692): Error: The process cannot access the file because it is being used by another process.

 (0x20)
10:27:52 (7324): Can't acquire lockfile (32) - waiting 35s
10:28:27 (7324): Can't acquire lockfile (32) - exiting
10:28:27 (7324): Error: The process cannot access the file because it is being used by another process.

 (0x20)
10:39:04 (15036): Can't acquire lockfile (32) - waiting 35s
10:39:39 (15036): Can't acquire lockfile (32) - exiting
10:39:39 (15036): Error: The process cannot access the file because it is being used by another process.

 (0x20)
10:49:40 (748): Can't acquire lockfile (32) - waiting 35s
10:50:15 (748): Can't acquire lockfile (32) - exiting
10:50:15 (748): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:00:19 (4416): Can't acquire lockfile (32) - waiting 35s
11:00:54 (4416): Can't acquire lockfile (32) - exiting
11:00:54 (4416): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:10:59 (6440): Can't acquire lockfile (32) - waiting 35s
11:11:34 (6440): Can't acquire lockfile (32) - exiting
11:11:34 (6440): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:21:39 (12944): Can't acquire lockfile (32) - waiting 35s
11:22:14 (12944): Can't acquire lockfile (32) - exiting
11:22:14 (12944): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:33:01 (5468): Can't acquire lockfile (32) - waiting 35s
11:33:36 (5468): Can't acquire lockfile (32) - exiting
11:33:36 (5468): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:43:55 (13612): Can't acquire lockfile (32) - waiting 35s
11:44:30 (13612): Can't acquire lockfile (32) - exiting
11:44:30 (13612): Error: The process cannot access the file because it is being used by another process.

 (0x20)
11:54:32 (13116): Can't acquire lockfile (32) - waiting 35s
11:55:07 (13116): Can't acquire lockfile (32) - exiting
11:55:07 (13116): Error: The process cannot access the file because it is being used by another process.

 (0x20)
12:06:01 (12432): Can't acquire lockfile (32) - waiting 35s
12:06:36 (12432): Can't acquire lockfile (32) - exiting
12:06:36 (12432): Error: The process cannot access the file because it is being used by another process.

 (0x20)
12:16:47 (2308): Can't acquire lockfile (32) - waiting 35s
12:17:22 (2308): Can't acquire lockfile (32) - exiting
12:17:22 (2308): Error: The process cannot access the file because it is being used by another process.

 (0x20)
12:28:07 (3856): Can't acquire lockfile (32) - waiting 35s
12:28:42 (3856): Can't acquire lockfile (32) - exiting
12:28:42 (3856): Error: The process cannot access the file because it is being used by another process.

 (0x20)
12:38:56 (10532): Can't acquire lockfile (32) - waiting 35s
12:39:31 (10532): Can't acquire lockfile (32) - exiting
12:39:31 (10532): Error: The process cannot access the file because it is being used by another process.

 (0x20)
12:50:06 (2676): Can't acquire lockfile (32) - waiting 35s
12:50:41 (2676): Can't acquire lockfile (32) - exiting
12:50:41 (2676): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:01:20 (16068): Can't acquire lockfile (32) - waiting 35s
13:01:55 (16068): Can't acquire lockfile (32) - exiting
13:01:55 (16068): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:12:14 (14812): Can't acquire lockfile (32) - waiting 35s
13:12:49 (14812): Can't acquire lockfile (32) - exiting
13:12:49 (14812): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:23:41 (17424): Can't acquire lockfile (32) - waiting 35s
13:24:16 (17424): Can't acquire lockfile (32) - exiting
13:24:16 (17424): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:34:23 (11500): Can't acquire lockfile (32) - waiting 35s
13:34:58 (11500): Can't acquire lockfile (32) - exiting
13:34:58 (11500): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:45:22 (5504): Can't acquire lockfile (32) - waiting 35s
13:45:57 (5504): Can't acquire lockfile (32) - exiting
13:45:57 (5504): Error: The process cannot access the file because it is being used by another process.

 (0x20)
13:55:59 (14152): Can't acquire lockfile (32) - waiting 35s
13:56:34 (14152): Can't acquire lockfile (32) - exiting
13:56:34 (14152): Error: The process cannot access the file because it is being used by another process.

 (0x20)
14:07:06 (17364): Can't acquire lockfile (32) - waiting 35s
14:07:41 (17364): Can't acquire lockfile (32) - exiting
14:07:41 (17364): Error: The process cannot access the file because it is being used by another process.

 (0x20)
14:18:25 (4444): Can't acquire lockfile (32) - waiting 35s
14:19:00 (4444): Can't acquire lockfile (32) - exiting
14:19:00 (4444): Error: The process cannot access the file because it is being used by another process.

 (0x20)
14:29:50 (12944): Can't acquire lockfile (32) - waiting 35s
14:30:25 (12944): Can't acquire lockfile (32) - exiting
14:30:25 (12944): Error: The process cannot access the file because it is being used by another process.

 (0x20)
14:41:10 (1296): Can't acquire lockfile (32) - waiting 35s
14:41:45 (1296): Can't acquire lockfile (32) - exiting
14:41:45 (1296): Error: The process cannot access the file because it is being used by another process.

 (0x20)
14:52:26 (10844): Can't acquire lockfile (32) - waiting 35s
14:53:01 (10844): Can't acquire lockfile (32) - exiting
14:53:01 (10844): Error: The process cannot access the file because it is being used by another process.

 (0x20)
15:03:45 (16100): Can't acquire lockfile (32) - waiting 35s
15:04:20 (16100): Can't acquire lockfile (32) - exiting
15:04:20 (16100): Error: The process cannot access the file because it is being used by another process.

 (0x20)
15:15:15 (11584): Can't acquire lockfile (32) - waiting 35s
15:15:50 (11584): Can't acquire lockfile (32) - exiting
15:15:50 (11584): Error: The process cannot access the file because it is being used by another process.

 (0x20)
15:26:30 (9128): Can't acquire lockfile (32) - waiting 35s
15:27:05 (9128): Can't acquire lockfile (32) - exiting
15:27:05 (9128): Error: The process cannot access the file because it is being used by another process.

 (0x20)
15:37:11 (8324): Can't acquire lockfile (32) - waiting 35s
15:37:46 (8324): Can't acquire lockfile (32) - exiting
15:37:46 (8324): Error: The process cannot access the file because it is being used by another process.

 (0x20)
15:48:08 (14396): Can't acquire lockfile (32) - waiting 35s
15:48:43 (14396): Can't acquire lockfile (32) - exiting
15:48:43 (14396): Error: The process cannot access the file because it is being used by another process.

 (0x20)
15:58:56 (16792): Can't acquire lockfile (32) - waiting 35s
15:59:31 (16792): Can't acquire lockfile (32) - exiting
15:59:31 (16792): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:09:31 (8672): Can't acquire lockfile (32) - waiting 35s
16:10:06 (8672): Can't acquire lockfile (32) - exiting
16:10:06 (8672): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:20:09 (7976): Can't acquire lockfile (32) - waiting 35s
16:20:44 (7976): Can't acquire lockfile (32) - exiting
16:20:44 (7976): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:31:28 (12648): Can't acquire lockfile (32) - waiting 35s
16:32:03 (12648): Can't acquire lockfile (32) - exiting
16:32:03 (12648): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:42:45 (14160): Can't acquire lockfile (32) - waiting 35s
16:43:20 (14160): Can't acquire lockfile (32) - exiting
16:43:20 (14160): Error: The process cannot access the file because it is being used by another process.

 (0x20)
16:53:35 (15732): Can't acquire lockfile (32) - waiting 35s
16:54:10 (15732): Can't acquire lockfile (32) - exiting
16:54:10 (15732): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:04:54 (3220): Can't acquire lockfile (32) - waiting 35s
17:05:29 (3220): Can't acquire lockfile (32) - exiting
17:05:29 (3220): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:15:59 (17432): Can't acquire lockfile (32) - waiting 35s
17:16:34 (17432): Can't acquire lockfile (32) - exiting
17:16:34 (17432): Error: The process cannot access the file because it is being used by another process.

 (0x20)
17:16:57 (18108): BOINC client no longer exists - exiting
17:16:57 (18108): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
modelGetExecutables: check control files, strTemp0 & 1 : 
E:\BOINC/projects/climateprediction.net/wah2_eas25_a3ro_200912_24_1007_012269740/jobs/xadae.namelists
E:\BOINC/projects/climateprediction.net/wah2_eas25_a3ro_200912_24_1007_012269740/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka
global model: command string: "E:\BOINC/projects/climateprediction.net/wah2am3m2_um_8.29_windows_intelx86.exe" wah2_eas25_a3ro_200912_24_1007_012269740 generic_phase1_spinup_eas25_global_aabaka ic19610221_10_N96 ALLclim_ancil_146months_OSTIA_sst_2004-12-01_2017-01-30 ALLclim_ancil_146months_OSTIA_ice_2004-12-01_2017-01-30 SO2DMS_N96_cmip6hist-ssp245_2009-2020 oxi.addfa ozone_cmip6hist-ssp245_N96_1979_2031
17:19:17 (5508): start_timer_thread(): CreateThread() failed, errno 0
regional model: command string: "E:\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.29_windows_intelx86.exe" wah2_eas25_a3ro_200912_24_1007_012269740
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=17704, GCM_PID=5508, RCM_PID=2532
CPDN Monitor - Quit request from BOINC...
modelGetExecutables: check control files, strTemp0 & 1 : 
E:\BOINC/projects/climateprediction.net/wah2_eas25_a3ro_200912_24_1007_012269740/jobs/xadae.namelists
E:\BOINC/projects/climateprediction.net/wah2_eas25_a3ro_200912_24_1007_012269740/jobs/xacxf.namelists
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka
global model: command string: "E:\BOINC/projects/climateprediction.net/wah2am3m2_um_8.29_windows_intelx86.exe" wah2_eas25_a3ro_200912_24_1007_012269740 generic_phase1_spinup_eas25_global_aabaka ic19610221_10_N96 ALLclim_ancil_146months_OSTIA_sst_2004-12-01_2017-01-30 ALLclim_ancil_146months_OSTIA_ice_2004-12-01_2017-01-30 SO2DMS_N96_cmip6hist-ssp245_2009-2020 oxi.addfa ozone_cmip6hist-ssp245_N96_1979_2031
06:49:17 (16724): start_timer_thread(): CreateThread() failed, errno 0
regional model: command string: "E:\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.29_windows_intelx86.exe" wah2_eas25_a3ro_200912_24_1007_012269740
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. 
executeModelProcess: MonID=17040, GCM_PID=16724, RCM_PID=15768
06:53:09 (15768): called boinc_finish(193)
Global Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 15768, selfPID = 16724, iMonCtr = 2
Controller:: CPDN process is not running, exiting, bRetVal = T, checkPID = 16724, selfPID = 17040, iMonCtr = 1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
monitor:finished called ... tidying up.
monitor:finished: Uploading out files...
monitor:finished: calling modelResultFiles ...
modelResultFiles : Cleaning up : wah2_eas25_a3ro_200912_24_1007_012269740 in E:\BOINC/projects/climateprediction.net
monitor:finished: Closed output file : stdout_mon.txt
monitor:finished: handing over to boinc_finish(RetVal=0)
06:53:18 (17040): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_2.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_3.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_4.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_5.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_6.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eas25_a3ro_200912_24_1007_012269740_0_r1557902831_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Feb 2024 11:29:28 1534057 22411177 wah2_eas25_a3ro_200912_24_1007_012269740_0 11,819 78,487 6.6407


©2024 climateprediction.net