climateprediction.net home page
Posts by Jean-David Beyer

Posts by Jean-David Beyer

21) Message boards : Number crunching : New Work Announcements 2024 (Message 70694)
Posted 2 Apr 2024 by Jean-David Beyer
Post:
Next Weather@Home batches going out today. Same region, EAS25, as previous 1006 & 1007, so expect the same runtimes etc.


My pipsqueak Windows 10 machine got one, Batch 1008. Predicting about 17 1/2 days, but it will probably finish in about half that time. It has about a half hour on it now. McAfee had no trouble with it.
22) Message boards : Number crunching : Should full credit be given for time on non successful tasks? (Message 70692)
Posted 2 Apr 2024 by Jean-David Beyer
Post:
Note besides: Glenn, when will the announced RAM heavy CPDN WUs really come? 2024? 2025? I increased RAM for these WUs in October last year and still have not seen any… ;-)


My big Linux box came with 32 GBytes of RAM with two memory modules. There are 8 memory slots in my box. As RAM prices came down, I bought two more memory modules and put them in, raising the RAM to 64 GBytes. Then I started getting a few 8GByte CPDN tasks with more to come. So when RAM prices dropped again, I got four more memory modules so I have 128GBytes in there now. DDR4 modules.

My machine wouild take up to 512 GBytes of RAM were I to replace all the modules with the largest size. my processor chip has only 16896 KBytes of processor cache, so I do not think it would make sense to enable running more tasks like this at the same time. 16896 KBytes is fairly large for this kind of processor chip, but I run only 13 Boinc tasks at a time in winter and 8-12 in the summer. I have no AC.

Actually, I have gotten no new work for CPDN since last June. ;=(

Computer 1511241

CPU type 	GenuineIntel
Intel(R) Xeon(R) W-2245 CPU @ 3.90GHz [Family 6 Model 85 Stepping 7]
Number of processors 	16

Operating System 	Linux Red Hat Enterprise Linux
Red Hat Enterprise Linux 8.9 (Ootpa) [4.18.0-513.18.1.el8_9.x86_64|libc 2.28]
BOINC version 	7.20.2
Memory 	125.07 GB
Cache 	16896 KB
23) Message boards : News : BOINC Needs Votes at a UN Upcoming Forum (Message 70673)
Posted 27 Mar 2024 by Jean-David Beyer
Post:
Done.
Of course, my main Linux machine has gotten no CPDN work since last June, IIRC.
24) Message boards : Number crunching : Weather At Home 2 (wah2) (region independent) v8.29 crashed (Message 70654)
Posted 16 Mar 2024 by Jean-David Beyer
Post:
One reason I like Linux better than Microsoft Windows is that when my Linux system has updates, there is a little pop-up informing me of that fact.
It does not install the updates. I can get a list of the updates it wants to download and install.

I can tell ahead of time if installing the updates will require a reboot or not.
If not, i just install them. The main reason to do a reboot is if a new kernel needs to be installed. (Even if I install it, I can defer rebooting until a more opportune time.)

So what I do is when I am going to reboot I stop fetching new work. I then suspend all the tasks that are ready to start.
I wait until all running tasks have completed, reboot the system, resume the suspended processes, and resume the fetching new work. This keeps the problems down, but does not solve them completely.
25) Message boards : Number crunching : Weather At Home 2 (wah2) (region independent) v8.29 crashed (Message 70652)
Posted 13 Mar 2024 by Jean-David Beyer
Post:
My Windows machine crashed this task (and two others. What happened is an update sneaked in and rebooted the computerl The other two tasks were v8.24 tasks, so that is what I would have expected, but it appears that v8.29 are not immune to this either.

Task 22414291
Name wah2_eas25_a3wb_201012_24_1007_012269907_1
Workunit 12269907
Computer ID 1512658
Application version Weather At Home 2 (wah2) (region independent) v8.29
windows_intelx86

The beginning of the stderr is like this.

Stderr

<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
The storage control block address is invalid.
(0x9) - exit code 9 (0x9)</message>
26) Message boards : Number crunching : processors, memory, performance and heat. (Message 70622)
Posted 6 Mar 2024 by Jean-David Beyer
Post:
Do Linux users know about this interesting tool?

#
 perf stat -e cache-references,cache-misses,cycles,instructions,branches,faults
^C
 Performance counter stats for 'system wide':

     4,751,265,017      cache-references                                            
     1,957,008,106      cache-misses              #   41.189 % of all cache refs    
 1,416,865,456,289      cycles                                                      
 1,984,715,137,591      instructions              #    1.40  insn per cycle         
   273,726,331,297      branches                                                    
            50,751      faults                                                      

      25.357650625 seconds time elapsed

You start the perf program with the first line. When you think it has run long enough, you hit Control C. It then prints the results.
The machine was doing this; i.e., mostly Boinc work -- 13 boinc tasks
top - 17:56:27 up 11 days,  4:21,  2 users,  load average: 13.58, 13.52, 13.51
Tasks: 483 total,  14 running, 469 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.4 us,  0.1 sy, 80.6 ni, 18.6 id,  0.0 wa,  0.2 hi,  0.0 si,  0.0 st
MiB Mem : 128074.1 total,   2100.0 free,   6385.8 used, 119588.3 buff/cache
MiB Swap:  15992.0 total,  15947.2 free,     44.8 used. 118485.6 avail Mem 


My actual results here are probably of no interest to readers here because none of the Boinc tasks were running any CPDN tasks. But if I ever get more, I will be able to see how they do.

With that work load on my machine, a little over half the memory references were satisfied by the cache.
27) Message boards : Number crunching : processors, memory, performance and heat. (Message 70620)
Posted 6 Mar 2024 by Jean-David Beyer
Post:
I do wonder if faster RAM might help. Potentially I might need more than 32GB for some testing with OIFS even if on main site they are rationed to avoid problems with machines that don't have enough for multiple tasks.


My machine has this memory at the moment.
CPU type Genuine Intel - Intel(R) Xeon(R) W-2245 CPU @ 3.90GHz [Family 6 Model 85 Stepping 7]
Number of processors 	16
Operating System    Red Hat Enterprise Linux 8.9 (Ootpa) [4.18.0-513.18.1.el8_9.x86_64|libc 2.28]
BOINC version 	7.20.2
Memory 	125.07 GB [2933MHz DDR4]
Cache 	 16896 KB

It came with 32 GBytes but I doubled it a couple of times as prices for RAM came down.
I guess it is no longer state-of-the-art (if it ever was), but it is several years old now, so there must surely be faster machines out there now.
I cannot put faster RAM in there, but I could run it up to 512 GBytes if someone would send me the money to do it. I doubt there is much point to doing that, since my L3 cache is 16384 Kbytes, which is pretty good for that kind of processor chip, I got all that RAM to run all those OIFS tasks that I have not received since last June, IIRC.
28) Questions and Answers : Unix/Linux : New Work Coming? (Message 70589)
Posted 2 Mar 2024 by Jean-David Beyer
Post:
Testing showed up some problems, that's why the Linux work has been delayed. There is alot in the pipeline, but I can't give you a date just yet.


I am glad there is a lot in the pipeline for Linux work.
The last work unit I got ended last June.
22318648 	12138603 	30 May 2023, 3:38:46 UTC 	9 Jun 2023, 1:20:39 UTC 	Completed 	
29) Message boards : Number crunching : processors, memory, performance and heat. (Message 70588)
Posted 2 Mar 2024 by Jean-David Beyer
Post:
With 5 cores using 130W it takes around 7 days for wah2 8.29 or 4.37 kWh/task
Running 2-3-4 cores is faster but not much.


My main (Linux) machine is consuming 275 watts and running 13 Boinc processes. (None of them ClimatePrediction).
The 275 watts includes the computer, the router, and the monitor.
ID: 1511241
Number of processors 	16
Memory 	 125.07 GB
Cache 	  16896 KB
Swap space 	15.62 GB
Total disk space 	488.04 GB
Free Disk Space 	480.47 GB
Measured floating point speed 	5.92 billion ops/sec
Measured integer speed 	       23.22 billion ops/sec
Average upload rate 	  194.32 KB/sec
Average download rate 	15613.09 KB/sec
Average turnaround time 	7.96 days

Every 11.0s: sensors  localhost.localdomain: Sat Mar  2 13:33:14 2024

coretemp-isa-0000
Adapter: ISA adapter
Package id 0:  +75.0°C  (high = +88.0°C, crit = +98.0°C)
Core 8:        +68.0°C  (high = +88.0°C, crit = +98.0°C)
Core 2:        +66.0°C  (high = +88.0°C, crit = +98.0°C)
Core 3:        +71.0°C  (high = +88.0°C, crit = +98.0°C)
Core 5:        +70.0°C  (high = +88.0°C, crit = +98.0°C)
Core 1:        +75.0°C  (high = +88.0°C, crit = +98.0°C)
Core 9:        +74.0°C  (high = +88.0°C, crit = +98.0°C)
Core 11:       +67.0°C  (high = +88.0°C, crit = +98.0°C)
Core 12:       +65.0°C  (high = +88.0°C, crit = +98.0°C)
30) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70567)
Posted 28 Feb 2024 by Jean-David Beyer
Post:
Well, I got one last night, and it now has a little over five hours on it, so I guess McAfee did whatever it needed to do. aoss 1007
I did not do anything to make it work.
31) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70523)
Posted 22 Feb 2024 by Jean-David Beyer
Post:
Yes, as I said in my earlier message. For some reason I sometimes have to try 'Add File' twice on the same file before it appears on the page. Probably a bug or McAfee being slow. Try it again, it does work.

That is probably the case.

Should I put the old 'wah2_8.29_windows_intelx86.exe' file back and try again? I am keeping it in the trash bin.

Sure. Now McAfee have emailed to confirm it's a false positive it's a safe thing to do.


OK: I put it back and tried ten times to exclude it from scanning, but it never appeared in the list.
32) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70518)
Posted 22 Feb 2024 by Jean-David Beyer
Post:
I had a response from McAfee and they have accepted the binary file wah2_8.29_windows_intelx86.exe for whitelisting as a false positive. Hopefully this will soon propagate down to users and stop being a problem.


I just updated my McAfee stuff but all seems to be the same. E.g., Version 1.14.212, etc.
33) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70516)
Posted 22 Feb 2024 by Jean-David Beyer
Post:
On my version of McAfee, the list of 'Excluded Files' is on the same settings page as 'Real time scanning'. The file list appears above the 'Add file' button. You probably can't see it because there's none added?

When I click Add file, a pop-up menu of files in the right directory appears. If I click on the file to be excluded, nothing happens. I can select either open or cancel. If I try open, it does not really respond. And I do not really want to open it anyway. There feels to be something wrong there.

Yes, as I said in my earlier message. For some reason I sometimes have to try 'Add File' twice on the same file before it appears on the page. Probably a bug or McAfee being slow. Try it again, it does work.


That is probably the case.

Should I put the old 'wah2_8.29_windows_intelx86.exe' file back and try again? I am keeping it in the trash bin.
34) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70493)
Posted 21 Feb 2024 by Jean-David Beyer
Post:
Hi Jean-David, I've got this working with McAfee. For reference I have: McAfee LiveSafe 1.14.212.

I have what they call McAfee Advantage Individual
Version 1.14.212
Build 1.1.2569.0
So it is probably like yours.

Starting from scratch, that is, no wah-ri 8.29 tasks and no wah 8.29 executables in the ...../boinc/projects/climateprediction.net folder.


Right now, my Windows 10 machine has No New Tasks set for climateprediction.
There are also some other files in that directory with 8.29 in their names. .zip files. there was the 8.29 ,exe file, so I deleted it. There are no files in quarantine.
There are four 8.24 tasks running.

I do not recall that they are called wah-ri. IIRC, they are called wah2_8.29_windows_intel86.....

Set 'Allow new tasks' for climateprediction.net in boincmgr.
Starts download for a new wah-ri 8.29 task. 'Suspend' the climateprediction.net project in boincmgr -- this will allow the download to complete but not start.

I hesitate do do this now because if I allow new CPDN tasks, I may have to wait a week or so until at least one of those 8.24 tasks finishes. Maybe if I suspend one?

Open McAfee and from Home page go to 'My Protection => Real-time scanning' in the left hand bar. Make sure 'Real time scanning' is ON. It's not possible to exclude files without it on.


It is ON,

At the bottom of the real-time scanning window, click on 'Add file'. Navigate to the ../boinc/projects/climateprediction.net folder, select 'wah2_8.29_windows_intelx86.exe' and add it. It should then appear in the list of 'Excluded files'.


I have never been able to find a list of Excluded files. Where is it?

When I click Add file, a pop-up menu of files in the right directory appears. If I click on the file to be excluded, nothing happens. I can select either open or cancel. If I try open, it does not really respond. And I do not really want to open it anyway. There feels to be something wrong there.
If it doesn't, do it again (for some odd reason, I find I sometimes have to do this twice). Then in boincmgr, Resume the climateprediction.net project and the task should then run fine.


If boinc has already downloaded the exe file, you might find it under 'My Protection => Quarantined Files'. In which case, select it, click 'Restore'. Then as above, go to 'Real-time scanning' and 'Add' the file to the excluded list.


If the boinc data directory is hidden from you to get to the exe in File explorer, turn on 'View - Show - Hidden folders'.

Not a problem.

Hope that gets things going for you.?
35) Message boards : Number crunching : Uploads not working (Message 70453)
Posted 20 Feb 2024 by Jean-David Beyer
Post:
I've been getting really slow uploads today, as in 3.6KBps slow. Dial up slow.

I did not catch my Windows machine during transfers, but the web site reveals pretty good upload and download speeds. It is on a fiber-optic link that can get up to 1000 Megabit/second speeds. I just measured my speeds and got these:\
DOWNLOAD 851.24 Mbps
UPLOAD 887.80 Mbps
LATENCY 3 ms
JITTER 2 ms

Computer 1512658
Computer information

Domain name 	DESKTOP-K1UQGC4
Local Standard Time 	UTC -5 hours
Created 	19 Dec 2020, 22:21:58 UTC
Total credit 	355,587
Average credit 	4,478.43
Cross project credit 	BOINCstats.com Free-DC
CPU type 	GenuineIntel
11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz [Family 6 Model 140 Stepping 1]
Number of processors 	8

Operating System 	Microsoft Windows 10
Core x64 Edition, (10.00.19045.00)
BOINC version 	7.24.1
Memory 	15.64 GB
Cache 	256 KB
Swap space 	18.02 GB
Total disk space 	460.73 GB
Free Disk Space 	303.49 GB
Measured floating point speed 	3.92 billion ops/sec
Measured integer speed 	29.31 billion ops/sec
Average upload rate 	996.93 KB/sec
Average download rate 	9770.82 KB/sec
Average turnaround time 	10.74 days
36) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70434)
Posted 19 Feb 2024 by Jean-David Beyer
Post:
Ok, hopefully that'll fix it. Otherwise I'm not sure what else to do.

Failed again same way.

Task 22410446
Name 	wah2_eas25_a288_199912_24_1007_012267744_1
Workunit 	12267744
Created 	18 Feb 2024, 15:28:47 UTC
Sent 	18 Feb 2024, 15:41:36 UTC
Report deadline 	17 Jun 2024, 15:41:36 UTC
Received 	19 Feb 2024, 2:23:46 UTC
Server state 	Over
Outcome 	Computation error
Client state 	Compute error
Exit status 	-185 (0xFFFFFF47) ERR_RESULT_START
Computer ID 	1512658
Run time 	
CPU time 	
Validate state 	Invalid
Credit 	0.00
Device peak FLOPS 	3.92 GFLOPS
Application version 	Weather At Home 2 (wah2) (region independent) v8.29
windows_intelx86
Stderr 	

<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
couldn't start app: CreateProcess() failed - The system cannot find the file specified.
 (0x2)</message>
]]>


Workunit 12267744
name 	wah2_eas25_a288_199912_24_1007_012267744
application 	Weather At Home 2 (wah2) (region independent)
created 	16 Feb 2024, 0:03:21 UTC
minimum quorum 	1
initial replication 	1
max # of error/total/success tasks 	3, 3, 1
Task
click for details	Computer	Sent	Time reported
or deadline
explain	Status	Run time
(sec)	CPU time
(sec)	Credit	Application
22410593 	--- 	--- 	--- 	Unsent 	--- 	--- 	--- 	---
22410446 	1512658 	18 Feb 2024, 15:41:36 UTC 	19 Feb 2024, 2:23:46 UTC 	Error while computing 	0.00 	0.00 	--- 	Weather At Home 2 (wah2) (region independent) v8.29
windows_intelx86
22407803 	1541197 	18 Feb 2024, 14:54:42 UTC 	18 Feb 2024, 15:28:45 UTC 	Abandoned 	0.00 	0.00 	--- 	Weather At Home 2 (wah2) (region independent) v8.29
windows_intelx86
37) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70431)
Posted 18 Feb 2024 by Jean-David Beyer
Post:
With my McAfee, once I've restored the file, it does not quarantine it again. I think that's how you tell McAfee the file is trusted. That was all I had to do.

OK: I told McAfee to restore the file from the quarantine. I checked and it is there where it should be, right after the 8.24 one.
I then told Boinc-client to resume the process (it had never started so it is now in ready-to-start state).

However, if the file is replaced again, even if it's exactly the same file, McAfee will quarantine it again. So the key is not have a task start unless the exe is restored.


I have two 8.24 tasks running and one waiting to run. The 8.29 one is now waiting to start. app-config allows four climateprediction.net processes to run at a time, but my machine is busy with other Boinc tasks, so it can be a while.
Boinc is allowed only 7 of my 8 cores. This is a machine that has little to do other than Boinc-client although I will be running TaxAct soon.
38) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70429)
Posted 18 Feb 2024 by Jean-David Beyer
Post:
Download a new task, then suspend either the task or all computation before it gets a chance to run. Look to see if the .exe was quarantined immediately on download. If so, restore it and use whatever tools the AV program provides to say "I trust this file".

One was downloaded but not started, so I suspended it. It was not yet quarantined.
There is a way to tell it not to scan a file, but I do not think it works. I can go through the motions, but it never acknowledges that it has done it.

If it wasn't flagged immediately on download, look to see if the AV provides an option to scan a single file on request (most do). Scan it, and respond to any warnings/options it generates. Again, the idea it to get to the point where you can say "I trust this file".


I could scan the single file, and it quarantined it right away.
There is one file in the list of quarntied files-- this one. It says

Item: wah2--8.29-windows-intelx8
Threat:RealProtect-L S6e9fdca5e68a
Dated: 15/02/2024 12:56 PM
C:\ProgramData\BOINC\projects\climateprediction.net/
RealProtect-L S6e9fdca5e68a


I have a choice of Restore or Delete. No option to say I trust that file.,
If I select Restore, I can then go to where it allow me to say to not scan it, but it has no way to display a list of non-scanned files. Furthermore after I tell McAfee to not scan the file, it pretends, but does not do it; i,e,, if I scan it again, it quarantines it again.

Only then allow the task to start running.

I did not do this yet.
39) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70426)
Posted 18 Feb 2024 by Jean-David Beyer
Post:
Failed anyway. File is quarantined.

Task 22406406
Name 	wah2_eas25_a15g_199212_24_1007_012266348_0
Workunit 	12266348
Created 	15 Feb 2024, 23:48:40 UTC
Sent 	18 Feb 2024, 1:53:27 UTC
Report deadline 	17 Jun 2024, 1:53:27 UTC
Received 	18 Feb 2024, 7:12:21 UTC
Server state 	Over
Outcome 	Computation error
Client state 	Compute error
Exit status 	-185 (0xFFFFFF47) ERR_RESULT_START
Computer ID 	1512658
Run time 	
CPU time 	
Validate state 	Invalid
Credit 	0.00
Device peak FLOPS 	3.92 GFLOPS
Application version 	Weather At Home 2 (wah2) (region independent) v8.29
windows_intelx86
Stderr 	

<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
couldn't start app: CreateProcess() failed - The system cannot find the file specified.
 (0x2)</message>
]]>


Computer 1512658
Computer information
IP address 	Show IP address
Domain name 	DESKTOP-K1UQGC4
Local Standard Time 	UTC -5 hours
Created 	19 Dec 2020, 22:21:58 UTC
Total credit 	346,432
Average credit 	4,313.15
Cross project credit 	BOINCstats.com Free-DC
CPU type 	GenuineIntel
11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz [Family 6 Model 140 Stepping 1]
Number of processors 	8
Coprocessors 	---
Virtualization 	None
Operating System 	Microsoft Windows 10
Core x64 Edition, (10.00.19045.00)
BOINC version 	7.24.1
Memory 	15.64 GB
Cache 	256 KB
Swap space 	18.02 GB
Total disk space 	460.73 GB
Free Disk Space 	301.09 GB
Measured floating point speed 	3.92 billion ops/sec
Measured integer speed 	29.31 billion ops/sec
Average upload rate 	179.73 KB/sec
Average download rate 	7909.33 KB/sec
Average turnaround time 	10.74 days
Application details 	Show
Tasks 	50
Number of times client has contacted server 	2153
Last time contacted server 	18 Feb 2024, 9:13:43 UTC
Fraction of time BOINC is running 	99.99%
While BOINC is running, fraction of time computer has an Internet connection 	100.00%
While BOINC is running, fraction of time computing is allowed 	100.00%
While is BOINC running, fraction of time GPU computing is allowed 	100.00%
Task duration correction factor 	1
40) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70422)
Posted 17 Feb 2024 by Jean-David Beyer
Post:
To get around it, I suggest setting "no new tasks" for CPDN.

Done.

Suspend any current 8.29 wah,

There were none. (Two 8.24 are there with time on them. One running, one waiting to run)

check McAfee quarantine and restore the wah_8.29_windows_intelx86.exe file if it's been quarantined.

It is there; nothing in quarantine. I probably restored it last night, time stamp is 23:56 yesterday.

Then resume the tasks.

Done.

Check McAfee doesn't quarantine the exe again, it shouldn't do, this is how I worked around the problem when I was developing the new app.


It has not quarantined it yet. We will see next time I get an 8.29 task.


Previous 20 · Next 20

©2024 climateprediction.net