climateprediction.net home page
Posts by Honza

Posts by Honza

31) Message boards : Number crunching : Time remaining wrong (Message 19140)
Posted 10 Jan 2006 by Profile Honza
Post:
The new version is better at estimating, but it takes time for it to learn. Possibly several models.
Which is OK with the other projects, but a bit of a bore with cpdn.

Yes, I\'m using 5.3.6 from trux and running on a Pentium D CPDN exclusive - time estimates are settled down are conrespondents with command sense/math.
32) Message boards : Number crunching : Server State Over, but wu is in progress! (Message 19139)
Posted 10 Jan 2006 by Profile Honza
Post:
So are there any implications for continued crunching of these wu\'s? [...] I\'d rather be crunching for science needs and not just the credit (which pains me to say since I am a credit whore).
I think it is still valuable to run those runs.
http://www.climateprediction.net/board/viewtopic.php?p=32429#32429

But running whole model is more (most) valuable and even makes more sense to those looking at results of all phases.
33) Message boards : Number crunching : Server State Over, but wu is in progress! (Message 19097)
Posted 9 Jan 2006 by Profile Honza
Post:
See Carl\'s reply here - http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=3746#18635
btw, the link doesn\'t work; should have been provided to your host like
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/results.php?hostid=214525
34) Message boards : Number crunching : @Thyme Lawn (Message 19096)
Posted 9 Jan 2006 by Profile Honza
Post:
IIRC, he has posted on SpinUp forum that he will be absent for some time...so not answering to PM right away can be bacaouse of it.
35) Message boards : Number crunching : Classic forum gone? (Message 19095)
Posted 9 Jan 2006 by Profile Honza
Post:
@ geophi - the message you have seen are definitely symptoms of upgrade installation process.
I\'m not sure why it took so long: both to realize there is a new veriso of phpBB and long process of installation (takes couple of minutes with standard/unmodified version).
Anyway, it is good to known that we are updated and running.
36) Message boards : Number crunching : Domino effect leads to unrecoverable errors (Message 19074)
Posted 6 Jan 2006 by Profile Honza
Post:
Marc, error -161 is a known problem cause by a buggy WU batch
http://www.climateprediction.net/board/viewtopic.php?p=32429#32429

Checkpoints on CPDN are saved every 144 timestep (3 model days).

I have never seen CPDN \"come to a screeching halt\" but seen Outlook doing this several times (sic).

I\'m running boxes on 24/7 basis and their are not only CPDN exclusive - on servers as an e-mail/file server, another for accounting, my main box runs Photoshop, Quark and other 2D/3D graphics application, printing hundredes of pages, PDF files with 100+MB size - never got CPDN halting screen/keyboard whatever.

Thrue is, that I don\'t run Outlook here (but the office one does), don\'t run AV from Symantec (but all boxes a NOD powered) so can\'t say much about Outlook/Symantec vs CPDN.

I would exclude \\BOINC from AV scanning/shield.
37) Message boards : Number crunching : Problems with sulphur_4.22_windows_intel.exe? (Message 19063)
Posted 5 Jan 2006 by Profile Honza
Post:
I had the same report from 2 members of Czech Nation team...
Another batch of corrupted WUs?
38) Questions and Answers : Windows : Screensaver freezes PC (Message 18980)
Posted 4 Jan 2006 by Profile Honza
Post:
On a desktop, via it\'s properties, go to screensave tab and choose None (or Blank screen).
It seems that you set BOINC as a default screen-saver during installation process.
39) Questions and Answers : Windows : BOINC halts processing when network not available??? (Message 18979)
Posted 4 Jan 2006 by Profile Honza
Post:
Hi there,

What version of BOINC are you running?
Do you have \"Leave apps in memory\" set on in general preferences?
40) Message boards : Number crunching : Domino effect leads to unrecoverable errors (Message 18954)
Posted 3 Jan 2006 by Profile Honza
Post:
Still, if this is indeed a resource starvation problem like I think it is (\"There are no child processes to wait for\"), the client should have waited for resources to be free, not errored out. Right?
Problem well described.
At what priority is Outlook running? Lucky me - I never used Outlook on my system but found a lot of problems with it over office computers: eating-up all CPU resources as a common one.

How long should the BOINC core (client) waited for child processes (CPDN aplication)? If 30 minutes is not enough, should it wait even more in every case (e.g. Windows shutdown/restart)? I definitely would not waited half an hour until Windows restart. There must be a timeout limit...which was propably exeeded due to Outlook resource demands.

The other problem on Windows and single-CPU/core machine is that when one process (application) with high-priority demands all available CPU resources, there are no available resources to manage the problem.
Solution might be: (i) put both applications demanding resources on equal priority so they can share resources, (ii) having more CPU unit (dual-core, dual-CPU machine) so that each CPU unit handle each CPU resource demanding application (beware of processes CPU affinity), (iii) solution on OS level.

I would first check out Outlook priority, do some maintining of Outlook (compact database, degraf disk as such application tends to fragments large files which results in slower running) etc.
What AV solution are you using? There can be a connection with e-mail client.
41) Message boards : Number crunching : Two Tickles (Message 18949)
Posted 3 Jan 2006 by Profile Honza
Post:
No all \"Reason: To send trickle-up message\" are due to sending trickles.
I remember some models sending trickle even when not needed - but I believe this is due to wrong BOINC interpretation of connecting CPDN server.
I would not bother with it as far as model is doing fine and real trickles are listed on server on WU result page.
42) Message boards : Number crunching : Zip files (Message 18925)
Posted 2 Jan 2006 by Profile Honza
Post:
Actually, a completed sulphur model is zipped down to a little less than 1 GB at the end, but it can get up to 2.5 GB right before that.
Correct,
@ Ray and others: you may try to re-RAR the completed model package.
Slab went down from 340MB Zip files to 288MB RAR (best compression) so 85% of the original.
Sulphur use gZIP compression and went down from 1000MB gz to 882 MB RAR. Still will not fit to a single CD...but one may put 5 re-RARed SC models to a DVD-R instead of 4 original ZIPped.
43) Message boards : Number crunching : Old Unit (Message 18916)
Posted 1 Jan 2006 by Profile Honza
Post:
It is in my climate folder and is titled sulphur_data 4.22 windows_intelx86.zip. It includes jobs and datain\\anci\\ctidata\\datasets. Hope this is of some use.
This is a part of CPDN application, which is used with every Sulphur cycle model (WU). It is not generated during error nor model post-processing, but downloaded with the rest of the application package (sulphur_um_4.22_windows_intelx86.zip and sulphur_se_4.22_windows_intelx86.zip with some .exe files inside).
Basically, you need to keep all files in \\climateprediction.net.
Files in subfolder of \\climateprediction.net are dedicated to each of CPDN WU and those can be backuped/deleted after particular WU is completed.

This leads me to a \'backup issue\' which has been already discussed, for example here: http://www.climateprediction.net/board/viewtopic.php?t=2361

Hope it help.
44) Message boards : Number crunching : Old Unit (Message 18913)
Posted 1 Jan 2006 by Profile Honza
Post:
What is the name and location of the file?
Anythink interesting inside?
45) Message boards : Number crunching : Old Unit (Message 18910)
Posted 1 Jan 2006 by Profile Honza
Post:
Once WU is uploaded, you can backup/delete it\'s folder ( \\ sulphur_46vb_000295799_1 in this case).
You have another 3 WUs errored out just at the beginning...something now good is happening with your machine.
46) Message boards : Number crunching : Database Sluggish -- Moved Trickle Info \'Down\' (Message 18794)
Posted 28 Dec 2005 by Profile Honza
Post:
Perhaps a script of some kind can be added so as to update credit displays (from Trickles) every, say, 8 hours, so as to better handle the server load.
As it is , my trickles RAC is riseing but on the stats display pages it is falling.
As I understand, millions of trickles are to \'blame\' databse sluggish so recounting them several times a day is not an option (for the moment).
RAC never did good job for CPDN...
47) Message boards : Number crunching : slab / sulphur models, and where we are heading (Message 18791)
Posted 28 Dec 2005 by Profile Honza
Post:
...the big future forecast runs (the 1920 to 2080 hindcast/forecast runs that will be the \"grand finale\" of CPDN over the next year or two).

Ok, now I find this depressing... does this mean I can only look forward to at most 2 more years of CPDN?
Time goes different in Oxford so it\'s not 2 years of Earth time :-)
Atmospheric models, High resolution models, regional forecast...there is still plenty to go.
And, as Carl sayd, \"we will regenerate workunits based on past results\". There will be re-runs, re-checkings.

Carl, thanks for the update and keeping us informed. Many things got clear, some are still foggy to me (separated \'pool\', web etc.).
48) Message boards : Number crunching : Current timestep (Message 18774)
Posted 27 Dec 2005 by Profile Honza
Post:
It is in the \\projects\\climateprediction.net WU description file (updated every checkpoint I believe), e.g. sulphur_abpl_000481737.xml
<N>sulphur_abpl_000481737</N>
<PH>5</PH>
<TS>62544</TS>
<DAY>14</DAY>
<MTH>7</MTH>
<YR>1874</YR>

[BOINC forum doesn\'t allow to post xml tags; click reply button to see original tags].
49) Message boards : Number crunching : latest trickles info stopped (Message 18546)
Posted 21 Dec 2005 by Profile Honza
Post:
I see your point. Perhaps I\'m not as concerned with trickles (hence credit for some users). I do have only dual-cored machines.
Mainly i monitor progress of each model in % - BoincView does very good job over LAN/multi-cpu/cored machines. I watch for final upload and model complete then - it is the case where i dig into the model: final graph etc.

There is still a chance that Carl/Tolu will optimize db so that this feature will be ther again.
50) Message boards : Number crunching : Boinc on Dual Boot Machine (Message 18540)
Posted 21 Dec 2005 by Profile Honza
Post:
Hi there,

BOINC does not allow having a separate folder/partition for data; all goes in one folder and it\'s subfolders.
Even when you edit appropriate .xml files etc, new WU will go to the default folder.
I suggest to simply put BOINC folder to a place where both OSes have access.


Previous 20 · Next 20

©2024 climateprediction.net