climateprediction.net home page
Posts by old_user27607

Posts by old_user27607

1) Message boards : Number crunching : Missing Credits from Host Total (Message 43732)
Posted 2 Feb 2012 by Profile old_user27607
Post:
I have been seeing the same kind of problem among my 10 systems, but not all. Looking at the graph of the score for each system shows either a slowly increasing (upward) line, or a level/slightly downward line.

Some of the increasing lines have leveled off, but none of the level lines have increased.

Tentatively, I can suggest that once a system gets dropped from the calculations, it stays lost. At least it has this appearance. Or the final step of the calculations has failed to complete.

Based on the recent history of computer problems, failures and equipment relocation, I can suggest that some communications link may have become overloaded, or one or more support programs are missing or not functioning.

If operations need specifics, please ask.

BillN
billntoo@fairpoint.net
2) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42167)
Posted 12 May 2011 by Profile old_user27607
Post:
Update: Thursday the 12th

Some progress has been made. Two systems still offline, five systems up, five systems still dropping. But it's still changing in smaller numbers than before which indicates that the scoring fix is not in yet.

But now I am optimistic that progress is being made. Soon, I hope, the scoring fix will get us out of this strange situation. I suspect more problems than just scoring popped up, or they decided to rearrange/redesign some of the systems. Clearly the problem(s) were more complex than we suspected.

BillN
3) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42130)
Posted 6 May 2011 by Profile old_user27607
Post:
Well, here it is Friday and it appears that the scoring fix is not in yet, at least from my systems results.

Of 12 systems, two are offline, working systems: three are up slightly, and *seven* are down in small to medium amounts. I did see the PHKUP machine was down for a day or two, so that may be part of the trouble. It's back up as I just checked.

Looks like we have a little longer to wait. :-{

BillN
4) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42090)
Posted 2 May 2011 by Profile old_user27607
Post:
Thanks again for the response, your followup is appreciated.
I have also seen variable results in what appear to be good working systems, but until Wednesday or so, I'll wait for results and hope the problems vanish silently in the night. :-}

BillN
5) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42083)
Posted 1 May 2011 by Profile old_user27607
Post:
I am looking at the reported stats from CPDN for my systems as sent to
[http://climateapps2.oucs.ox.ac.uk/cpdnboinc/hosts_user.php?sort=name&rev=0&show_all=0&userid=27607]

Six day list:
date S0 S1 S2 S3 S4 S5 S6 S7 S8
24th 331 394 1541 1007 588 976 603 1573
S9 S10 S11
1632 1318 639
25th 272 324 1507 985 482 1004 506 1566 1656 1541 524
26th 246 293 1517 975 437 1025 489 1592 1711 1631 475
27th <no rec>
28th 202 240 1537 957 371 470 286 1120 1856 1653 389 1835
29th 183 218 1487 949 361 426 283 1130 1874 1702 353 1855
30th 166 197 1432 941 383 386 281 1150 1901 1665 319 1825

I can't see any consistent positive pattern except for S8. Maybe the credits are just not delivered yet, though I check almost every day to make a record.

If you have a clue, please let me know. :-}

BillN
6) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42075)
Posted 30 Apr 2011 by Profile old_user27607
Post:
Well, I spoke too soon. A few of the 12 systems I run jumped up, and I expected the others to follow. They did not.

Based on my records, taken each day on all systems, from Apr 24th thru 30th, all systems have reverted back to the previous LOW numbers and despite most systems running 24/7, the totals are now stepping down slowly, with two exceptions not running the new codes.

Apparently the code fix for tiny credits has been pulled or reverted without any notice that I have seen. Once again I request that this problem be officially placed on the fix list, hopefully this time fixed for good.

With all the work and changes going on, I'm not completely surprised given the excessively small staff and the very large job and set of responsibilities that they carry. But I do hope for a complete and permanent fix this time around.

<sigh>
Bill Nicholls
[/b]
7) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 41979)
Posted 12 Apr 2011 by Profile old_user27607
Post:
These folks are as good as their word, and fast too. As of 12Apr2011, I have jumped up about 2500 points since yesterday.

My compliments on the fast work. Thanks for the extra effort.

BillN
8) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 41944)
Posted 9 Apr 2011 by Profile old_user27607
Post:
The phrase 'Low Credits' isn't strong enough by a mile. I'm getting an more than order of magnitude lower numbers.

Example: Dual core 3GHz cpu running 24/7: After 9 days I have 47 credits!
That particular CPU has been with me a while - my records show it first appeared in Jan 2009. Two months later it was at 1200 credits and has been in that area since.

Currently, after 9 days it was up to a stunning 47, now at 12 days it is *down* to 35 despite running constantly. I have several other systems with the same syndrome - crunching out numbers as fast as usual and losing ground while doing it.

I've been patient through the transition to no models and back up, but this is an insult to everyone who pays for extra electricity to run these models and provide computation that CPDN could not afford otherwise. Production and position are the *only* rewards we get, and now both of these are in the negative column.

I'm going to hang in for another two weeks, then start shifting my systems unless there is significant progress in turning this negative feedback into positive. Anyone who thinks that credit doesn't matter will be surprised by the fallout if this problem continues much longer.

Ignore this issue at CPDN's risk.

BillN
9) Message boards : Number crunching : Current model AMD Processors "Not Available for your Type.." (Message 41634)
Posted 14 Feb 2011 by Profile old_user27607
Post:
Thanks for the quick response. Please consider my systems as a testing base when the new models come in.

BillN
10) Message boards : Number crunching : Current model AMD Processors "Not Available for your Type.." (Message 41630)
Posted 13 Feb 2011 by Profile old_user27607
Post:
I've been getting these messages on a number of my systems with late generation AMD processors indicating that something in my processor won't work with all of the new work units (except Famous) being sent out.
Is this correct? What is the cause - use of AVX instructions? When will WUs be available for AMD processors?

FYI, I am an all AMD install and my workload on 12 systems is rapidly falling off. Please advise ASAP.
Thanks,
BillN
11) Message boards : Number crunching : No credits in last 3 days??? (Message 41251)
Posted 11 Dec 2010 by Profile old_user27607
Post:
I've got the same problem - no updates for three days - and this is not the first time! Two weeks ago (approx), same thing. My refresh requests simply show the same data even though the ten systems are running.

Please fix asap!

BillN
12) Questions and Answers : Windows : CPU Benchmarks Do Not Exit! (Message 37506)
Posted 19 Jul 2009 by Profile old_user27607
Post:
Has anything, hardware or software, changed on that PC since the last time that things were working well?


Nope. System has run in current config for about a year.

The only recent thing was a run of IOLO Technologies System Mechanic. It is possible the automatic cleanup may have removed something essential to CPDN. If that is the case, I should be able to stop BOINC, DL and install the latest vers, and restart.

I'll do this and report shortly.

BillN


Well, that was er, interesting.
I could not kill the BOINC app with the process manager. Despite showing the icon gone, the CPU remained at 100%. So I rebooted, waited while the system ran Chkdsk on C:, unfortunately a large disk. It finished before I got too hungry. :-}
Upon reboot, BOINC came up completely normally!

Apparently, the reboot required by IOLO SM was somehow holding the CPU test from completing. Darned if I have a clue how. But all's well that ends.

BillN
13) Questions and Answers : Windows : CPU Benchmarks Do Not Exit! (Message 37505)
Posted 19 Jul 2009 by Profile old_user27607
Post:
Has anything, hardware or software, changed on that PC since the last time that things were working well?


Nope. System has run in current config for about a year.

The only recent thing was a run of IOLO Technologies System Mechanic. It is possible the automatic cleanup may have removed something essential to CPDN. If that is the case, I should be able to stop BOINC, DL and install the latest vers, and restart.

I'll do this and report shortly.

BillN
14) Questions and Answers : Windows : CPU Benchmarks Do Not Exit! (Message 37499)
Posted 18 Jul 2009 by Profile old_user27607
Post:
XP 86 on an AMD X2 5200 4GB Ram, 3.25 accesable.

Shutdown and restart leads to same hang. HELP!

BillN

7/18/2009 11:12:52 AM Starting BOINC client version 6.6.28 for windows_intelx86
7/18/2009 11:12:52 AM log flags: task, file_xfer, sched_ops
7/18/2009 11:12:52 AM Libraries: libcurl/7.19.4 OpenSSL/0.9.8j zlib/1.2.3
7/18/2009 11:12:52 AM Running as a daemon
7/18/2009 11:12:52 AM Data directory: E:\boinc\data
7/18/2009 11:12:52 AM Running under account boinc_master
7/18/2009 11:12:52 AM Processor: 2 AuthenticAMD AMD Athlon(tm) 64 X2 Dual Core Processor 5200+ [x86 Family 15 Model 67 Stepping 2]
7/18/2009 11:12:52 AM Processor features: fpu tsc pae nx sse sse2 3dnow mmx
7/18/2009 11:12:52 AM OS: Microsoft Windows XP: Professional x86 Edition, Service Pack 3, (05.01.2600.00)
7/18/2009 11:12:52 AM Memory: 3.25 GB physical, 5.09 GB virtual
7/18/2009 11:12:52 AM Disk: 279.47 GB total, 105.63 GB free
7/18/2009 11:12:52 AM Local time is UTC -7 hours
7/18/2009 11:12:53 AM No CUDA devices found
7/18/2009 11:12:53 AM No coprocessors
7/18/2009 11:12:53 AM Not using a proxy
7/18/2009 11:12:53 AM climateprediction.net URL: http://climateprediction.net/; Computer ID: 976633; location: home; project prefs: home
7/18/2009 11:12:53 AM climateprediction.net General prefs: from climateprediction.net (last modified 30-Sep-2008 19:54:20)
7/18/2009 11:12:53 AM climateprediction.net Computer location: home
7/18/2009 11:12:53 AM General prefs: using separate prefs for home
7/18/2009 11:12:53 AM Reading preferences override file
7/18/2009 11:12:53 AM Preferences limit memory usage when active to 2994.73MB
7/18/2009 11:12:53 AM Preferences limit memory usage when idle to 3327.48MB
7/18/2009 11:12:53 AM Preferences limit disk usage to 10.00GB
7/18/2009 11:12:54 AM climateprediction.net Restarting task hadcm3istd_cswa_1920_160_06021582_5 using hadcm3i version 604
7/18/2009 11:12:54 AM climateprediction.net Restarting task hadcm3istd_csw7_1920_160_06021579_4 using hadcm3i version 604
7/19/2009 11:20:35 AM Running CPU benchmarks
7/19/2009 11:20:35 AM Suspending computation - running CPU benchmarks
15) Message boards : Number crunching : HADSM3-MH Crash and Re-Set? (Message 36666)
Posted 9 Apr 2009 by Profile old_user27607
Post:
\"Bill,

Looks like http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7782677 stopped trickling on April 4th after the end of the first phase, however, it didn\'t get reported as an error until April 7th, when a new model was downloaded. So three days without credit on one core there. The other model downloaded on March 31st and the new model appear to be moving along fine.\"

Thanks. The new numbers show it up at 421. But it was very mysterious, atypical for my systems.

BillN
16) Message boards : Number crunching : HADSM3-MH Crash and Re-Set? (Message 36663)
Posted 9 Apr 2009 by Profile old_user27607
Post:
I have a pair of HADSM3fub_kc?? running in an AMD X2 4600. In past use, it peaks out at aprox 700 RAC.

With these runs, it has peaked at about 350, half the usual. I checked, both copies are running, both cores active.

What\'s happening? Any suggestions?

BillN
17) Message boards : Number crunching : SPEED DIFFERENCES BETWEEN AMD & INTEL ON HADAM3P (Message 36640)
Posted 7 Apr 2009 by Profile old_user27607
Post:
Well, Intel\'s older Fortran compilers tested for \"Genuine Intel\" in the chip, and would only generate SS2/SS33 code if it were there.

However, probably due to AMD\'s antitrust suit, Intel belatedly removed the test. There is still a difference between SS4a and b as I recall, AMD only showing the A version last time I checked.

YMMV

BillN
18) Questions and Answers : Wish list : Using GPUs for number crunching (Message 36633)
Posted 6 Apr 2009 by Profile old_user27607
Post:
Thanks to all who provided the explanations of the bottleneck.

It appears that it is a search performance problem: Massive DB plus complex searches can take a long time. It is also obvious (I think) that the staff has already considered using a Postgres (or other) DB with indexing on the search parameters.

Either it was not right, or perhaps just too short of staff. From my limited POV, this might solve some problems, but unless they get critical, staff seem already oversubscribed. :-}

Unless a DB expert can do the design and setup, and the staff can install and run, there doesn\'t seem to be an early solution, unfortunately. :-{

The good news is that the compute side is working well. Hope the data side can catch up. I\'ll be available to help if needed.

BillN
19) Questions and Answers : Wish list : Using GPUs for number crunching (Message 36618)
Posted 3 Apr 2009 by Profile old_user27607
Post:
Yikes! We\'re doing more work than you can process!

Well, maybe it\'s time for some volunteer processor time for results analysis. Maybe the code can be easily converted, maybe not, but it must be easier than converting CPDN by a lot.

And if they are using a commercial analysis tool, it may already be available for micros.

I\'m ready to volunteer one or two machines for this work. What would it take?
PS. Physics degree, long time programmer, starting with Fortran.

BillN
20) Message boards : Number crunching : Per Computer Results Missing Names (Message 35636)
Posted 3 Dec 2008 by Profile old_user27607
Post:
I am amazed to find that the web display of computer results has dropped the names without consideration for users tracking their work.

I have over *three years* of daily performance in a spreadsheet, organized by system name. Now I\'m Screwed!

Please return the *option* to list by name instead of number, which I cannot match to my current systems, since some are duplicate configurations.

Gadzooks guys, let\'s not drop useful stuff!
BillN


Next 20

©2024 climateprediction.net