Posts by Dusty

21) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52282)
Posted 5 Apr 2008 by Dusty
Post:
I verified they are in DIMM_00, etc. I've been changing BIOS settings one at a time, rebooting, then re-running Sandra. Branch Interleaving gives the best performance regarding bandwidth efficiency, and sparing is disabled. As for Rank Interleaving, I tried all three choices and 4:1 was the best, but even the worst didn't lower my bandwidth efficiency by more than 10%. As of right now, it's showing 52% efficient and that's the best I've been able to reach.

It would be interesting swapping out all 4 4-gig DIMMS and replacing them with 1-gig DIMMS, but that would be an expensive test... I think I'll wait for the video card to arrive and see how that improves bandwidth.

I've been rebooting the machine so often this morning that I question the validity of the last two results posted for that machine. Rosetta says the claimed credit was approx 15, but the granted credit was 32. I wonder if that's because I boosted the Ratio CMOS Setting from 6 to 8, and now the CPU is performing much better even though memory bandwidth is still suffering. The Rosetta benchmarks were based on the CPU before I made the changes...

Now I'm back to running 6 processes. I noticed that whenever I change the number of CPUs available, Rosetta re-runs the benchmark test. We'll see what this next batch shows in a few hours. I've just about changed every BIOS setting I can find, so I think I've run out of options there to improve memory bandwidth efficiency.

Interestingly enough, I just finished running Sandra on my two other 2.4 Ghz Quad machines. They both have 4 gigs of DDR800 RAM installed (so no excess RAM for XP Pro32 to be confused over), and both have PCI Express V1 video cards (Intel D975XBX2KR Mobos). They both show memory bandwidth efficiencies of 56 and 58%. So, it seems that my memory bandwidth deficiencies are not limited to the Dual Xeon machine alone. Granted credits for those two machines are in the low 50's.

My head hurts.... ;o)


Well, if system has only 1 stick, the memory-bandwidth should drop to half (or possibly 1/4th) of currently, meaning much worse than currently. If it's not dropping, it atleast indicates win32 is only using 1 memory-stick...

Taking a look on the manual, wow, 16 memory-slots, 4 channels, so should atleast in theory have a ton of memory-bandwidth...

Hmm, with 4 sticks, the optimal is to put one in each "channel". Make sure it's in "DIMM_00", "DIMM_10", "DIMM_20", "DIMM_30", it's likely easy to put it wrong... Hmm, would guess the BIOS "System Memory Information" will show there each stick is placed?

In BIOS, on "NorthBridge Chipset Configuration", would guess the optimal is:

"MCH Branch Mode" - Interleaving

"*** sparing" - disabled

"Branc 0/1" - enabled

"Rank Interleaving" - good question... Hmm, not sure if 1:1 is best here or not, with 2 sticks in each "branch"... You'll have to test this...



22) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52275)
Posted 5 Apr 2008 by Dusty
Post:
I just noticed that my memory Bandwidth Efficiency % in Sandra is only 52.48!!! The program mentioned the risks of sharing bandwidth with the on-board VGA adapter (just like Paul D. Buck mentioned earlier in this thread), so when the new PCIExpress-2.0 card arrives Monday it will be interesting to see how it impacts the memory bandwidth efficiency. I was really shocked to see how low it really is on my machine!

I removed two banks of 4G FBDIMMs, making sure they were not Slots 0 & 1. My Bandwidth Efficiency dropped to 37.03%!!

So, even though XP Pro/32 cannot utilize anything above 4 gig, apparently the system can use the DIMMs to increase bandwidth. Interesting!!!


Yes, if running 1 instance shows example 50 credit/hour per core and 8 instances shows 30 credit/hour per core, it's a clear indication your computer is memory-bandwidth-limited in Rosetta@home. In this example Rosetta@home is likely maxed-out at 5 or 6 cores, meaning it's probably possible to find another non-memory-bandwidth-limited BOINC-project and run this for 25% of the time "for free", since Rosetta@home will get the same credit/day regardless of uses all 8 cores or only 6 of them...

Or, switching to faster memory would increase Rosetta@home-production...

If on the other hand 1 instance gives example 50 credit/hour per core and 8 instances gives 47 credit/hour per core, it indicates Rosetta@home is not memory-bandwidth-limited, and there's likely another reason for your computers mediocre Rosetta@home-production...


As for switching to 64-bit OS, this should be done regardless of whatever test-results you're getting, since running an OS that can't use 78% of installed memory doesn't make much sence...

Wouldn't expect OS-switch will change anything significantly for Rosetta@home...
... Except...
I've no idea if it's true or not for some mainboards, but if you're running 4x 4 GB-memory-sticks, it's maybe possible your mainboard somehow only uses the 1st. memory-stick in win32, so in practice it's single-channel-mode in win32...
23) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52272)
Posted 5 Apr 2008 by Dusty
Post:
Wow, so much to learn... Many thanks to everyone for helping answer my myriad questions regarding system performance!!!

I'm running two WU's right now and have about 3 hours till they are completed. Then I'll see what credit I received for them.

Yes, I was considering removing the other 3 sticks of 4-gig RAM; it's just that I don't have another FBDIMM-capable motherboard laying around....However, if it's somehow slowing me down leaving them in there, then I'll remove them!

Please see my rather lengthy post on Sandra testing this morning. I got some significant improvements in CPU performance; although memory bandwidth due to FSB is still a problem. I'm still trying to figure out how to increase FSB in my BIOS. I think it's the Ratio CMOS setting, and that's now set to the max of 8 (it was at 6). Actually, according to Sandra's Computer Overview section, my FSB is running at 1.33Ghz. Why I'm still getting an FSB-limited error under Memory Bandwidth testing is very confusing.



Yes, if running 1 instance shows example 50 credit/hour per core and 8 instances shows 30 credit/hour per core, it's a clear indication your computer is memory-bandwidth-limited in Rosetta@home. In this example Rosetta@home is likely maxed-out at 5 or 6 cores, meaning it's probably possible to find another non-memory-bandwidth-limited BOINC-project and run this for 25% of the time "for free", since Rosetta@home will get the same credit/day regardless of uses all 8 cores or only 6 of them...

Or, switching to faster memory would increase Rosetta@home-production...

If on the other hand 1 instance gives example 50 credit/hour per core and 8 instances gives 47 credit/hour per core, it indicates Rosetta@home is not memory-bandwidth-limited, and there's likely another reason for your computers mediocre Rosetta@home-production...


As for switching to 64-bit OS, this should be done regardless of whatever test-results you're getting, since running an OS that can't use 78% of installed memory doesn't make much sence...

Wouldn't expect OS-switch will change anything significantly for Rosetta@home...
... Except...
I've no idea if it's true or not for some mainboards, but if you're running 4x 4 GB-memory-sticks, it's maybe possible your mainboard somehow only uses the 1st. memory-stick in win32, so in practice it's single-channel-mode in win32...
24) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52270)
Posted 5 Apr 2008 by Dusty
Post:
Running the memory Bandwidth test in Sandra revealed a memory issue I'm trying to figure out. There were two notes after running the benchmark (My system was at the bottom of the chart as far as performance goes). The first note says, "System bandwidth appears FSB limited. Attempt to increase FSB." The second note says, "Low bandwith efficiency (advanced). Check memory timings and settings." I changed the memory configuration from Branch Sequencing to Branch Interleaving under the Northbridge Chipset Configuration menu, and I no longer get the "Low bandwidth efficiency" error message, but I still get the FSB Limited error message.

Where do I change FSB? In the BIOS under Advanced CPU Settings , there is no option for changing FSB. There IS an option for Ratio CMOS Setting, and I've set it to the max of 8. It was originally the default of 6. I also have Virtualization Technology disabled. I don't know what would be limiting my FSB. All BIOS options to slow the CPU down for overheating (CPU TM, Speedstep) are disabled.

There is also a Rank Interleaving option under the Northbridge Chipset Configuration. The default is 4:1, and that's what mine is set at. There is also a 1:1 and a 2:1 option, but I haven't tried those options yet. Would changing this improve performance, or is it already set at the best setting?

My CPUs are 1333 FSB capable, and I'm sure the ASUS DSEB-D16/SAS Mobo is also. There ia a note in the Mobo manual that says, "The FBDIMM 800 Mhz has to work with the 1600FSB CPU or above. Otherwise, the memory module downgrades and runs at the speed of 667Mhz." I'm using PC2-5300 DDR667 FBDIMMs anyway, and since the Harpertown CPUs only run at 1333FSB, the memory shouldn't be downgrading. Sandra shows my memory timings at 5.0-5-5-15.

One final note. After changing the Northbridge chipset from Branch Sequencing to Branch Interleaving, and forcing the BIOS Ratio CMOS setting from 6 to 8, I re-ran the Processor Arithmatic Test on Sandra, and this time my 2.66 Harpertown is beating the 2.33Ghz Clovertown comparison CPU. My numbers are now Dhrystone ALU 89834 MIPS and Whetstone iSSE3 77535 MFLOPS. I'm still getting the bandwidth FSB Limited error message under the Memory Bandwidth test, however.

I wonder if it would just balance out--less WUs complete, but fewer ones completed faster for more credit per WU....


I think the point was that if a test running say 2 WUs shows better credit per hour per core (with 2 cores) then when running 8 WUs at once, it would basically prove that memory is constrained and therefore it would indicate it might be worth installing an OS that can support all of your memory. Then you could expect to run all 8 at once and yield the same (better) credit per core you saw while running 2.
25) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52266)
Posted 5 Apr 2008 by Dusty
Post:
I see. I didn't catch that. Thank you! I'll give it a try right away.

I wonder if it would just balance out--less WUs complete, but fewer ones completed faster for more credit per WU....


I think the point was that if a test running say 2 WUs shows better credit per hour per core (with 2 cores) then when running 8 WUs at once, it would basically prove that memory is constrained and therefore it would indicate it might be worth installing an OS that can support all of your memory. Then you could expect to run all 8 at once and yield the same (better) credit per core you saw while running 2.

26) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52264)
Posted 5 Apr 2008 by Dusty
Post:
Thank you!! I completely forgot about the memory limitation ceiling for 32-bit machine. That explains why it only shows 4 gig of memory even though I have 16 gig installed.

I was looking in my BIOS to configure the memory. They have several different options: Rank Interleaving 1:1, 2:1 and 4:1. The default is 4:1, and that's where I left it. It also allows Branch Sparing, but it's default is disabled, so that's where I kept it too.

I'll have to see whether I get more credit per WU by running less WU's at once, but in the long run I wonder if it would just balance out--less WUs complete, but fewer ones completed faster for more credit per WU....



While win-64 will probably have little or no effect in Rosetta@home, didn't you say your system has 16 GB ram? Meaning, over 75% of the installed memory can't be used as long as runs 32-bit...

The BOINC-benchmark seems to be higher on 64-bit, especially integer, but this isn't a good indication of actual performance-increase.


I've no idea if Rosetta@home is influenced by cache-size and memory-bandwidth, so is possibly memory-bandwidth-limited then tries to run 8 instances even with large cache-size... One method to test this would be to see how performs then running only 1 instance, 2 instances, 3 ... upto 8. But, with the large variations in rosetta-wu's, should preferably test this by running the exact same wu on all cores...

Hmm, it would be possible to test if single/dual-channel-memory has any effect, even if runs 8 different wu's...


27) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52262)
Posted 5 Apr 2008 by Dusty
Post:
I downloaded Sandra Lite this morning, and the Benchmark test is coming up with the same Integer speed as Rosetta Benchmarks-approx 48563 Dhrystone MIPS. Strangely, one of the comparison CPU's in Sandra is a Clovertown 2.33 Ghz model (E5345) which shows 84426 MIPs. Whetstone is similar: 41932 MFLOPS for mine (2.66 Ghz E5430 Harpertown) and 58749 MFLOPS for the 2.33 Clovertown. Since the 2.33 Ghz Clovertown was actually cheaper than my Harpertown processors, I'm scratching my head at these results....


Could the difference be because he's using XP PRO/64 while I'm using Pro/32?

Nice machine :D

No - 64 bit isn't an advantage for BOINC/Rosetta at the moment (might be in the future). Even with those benchmarks you should be getting higher scores for your granted credit... What are your preferences set as - i.e. do you have 'use at least 8 CPUs' and use 100% of CPU?
You could download Sandra Lite and running a few benchmarks, but it sounds like it's running fine. Are there 8 Rosetta processes each using ~12% CPU utilisation? (easiest way to check is to get Task Manager up and have the 'CPU time' column showing and then sort the column by that so the longest running threads are at the top...
28) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52259)
Posted 5 Apr 2008 by Dusty
Post:
Thank you very, very much! I'm sure that my tweaking bios settings a couple times a day isn't helping at all, so I'm trying to do it as little as possible.

I really appreciate you taking the time to search for identical work units across my systems and compare them. I hadn't thought of that!

Mod.Sense - can you have a look at this as the scores don't look right. Any ideas?


I don't have any additional means to look in to this either. But let's compare your 2.66 Ghz Xeon with your 3Ghz P4.

I found WUs with the same name and batch number in the results list for each.

Xeon WU CPU seconds: 8,074 Models: 2 claimed: 33.27 granted: 10.39
P4 WU CPU seconds: 9,430 Models: 3 claimed: 20.60 granted: 15.57

Credit is granted based on the completed models. The Xeon only completed two, and so received 2/3'rds the credit of the P4 which completed 3.

But your Xeon is running 8 CPUs, and the P4 only 2. So if you calculated credit per hour of CPU on the above, the Xeon is pulling much more credit per hour. (37.06 vs 11.89). And once it has been reporting in work consistently for 2 weeks, you will see this reflected in the RAC for the Xeon.

29) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52257)
Posted 5 Apr 2008 by Dusty
Post:
Thanks; regarding the memory, that's a big relief. I keep thinking that something is grossly wrong in my CMOS, but for the most part I have the defaults set. I do have it set so that the CPU doesn't slow down if it gets hot, and I installed a 4-fan controller and are keeping the case fans on max.

I still can't get over why the Integer speed is so low.

Thanks for the link to CPUID! Running that, it says the CPU is running at 2.66Ghz, but I was surpised that the max bandwidth of the FBDIMM PC2-5300 DDR2 667 ram is only 333Mhz. I guess they split the bandwidth for each CPU? I have one 4-Gig Ram chip in Slot 0 of each of 4 banks. This RAM was recommended by ASUS as being compatible, but I just couldn't afford faster RAM.


Your RAM is fast enough. CPUZ reports 333MHz as it's DDR (double data rate) so 333MHz frequency is correct, and will have very little effect as it's pretty quick anyway, and the large cache on those CPUs makes a big difference too.

I'd leave CPUz running for a while and montior the CPU speed just incase it is dropping the CPU speed. It will do that if the CPUs get too hot too.

HTH
Danny

30) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52255)
Posted 5 Apr 2008 by Dusty
Post:
I just checked. It's set to the default 16 processors and 100% CPU time. In Task Manager, it shows each process is getting 12%. Thanks for the suggestions!

Could the difference be because he's using XP PRO/64 while I'm using Pro/32?

Nice machine :D

No - 64 bit isn't an advantage for BOINC/Rosetta at the moment (might be in the future). Even with those benchmarks you should be getting higher scores for your granted credit... What are your preferences set as - i.e. do you have 'use at least 8 CPUs' and use 100% of CPU?
You could download Sandra Lite and running a few benchmarks, but it sounds like it's running fine. Are there 8 Rosetta processes each using ~12% CPU utilisation? (easiest way to check is to get Task Manager up and have the 'CPU time' column showing and then sort the column by that so the longest running threads are at the top...

31) Message boards : Number crunching : Newbie Q&A, if you're new, have a view! (Message 52253)
Posted 5 Apr 2008 by Dusty
Post:
I've gone through the entire "Newbie Q&A," but I didn't find this question asked:

What are the optimum CMOS settings to maximize System/CPU/Memory efficiency for Rosetta WITHOUT overclocking?

For example, my Mobo has the option for "virtualization technology." Does having this enabled help or hurt Rosetta performance?

I'd like to tweak my CMOS and reboot, but because it's running 8 processes at once, I always lose credit for several of the processes because they aren't far enough along to remember how far I've gotten, and they start back at zero. It would be nice if these 'markers' were more frequent.

I know I'm asking a lot, but making this thread just about efficiency and not about overclocking would make it really useful for me, since I tend to just go with the default settings and don't know if it's hurting me.
32) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52249)
Posted 5 Apr 2008 by Dusty
Post:
Thanks. I haven't been using the system at all; just running Rosetta on it.

Comparing the stats in Rosetta of my 2.4Ghz QuadCore CPU to the 2.6Ghz dual Quad Xeon CPU is interesting. The 2.6 Ghz Dual Xeon CPU machine's FPS is 2623.61 and has an Integer speed of 4498.38, while the QuadCore measures 2355.9 FPS and 5410.25 Integer speed. And the Xeon processors have a 12MB L2 Cache. Hmmmmmm. Don't know what to make of that slower Integer speed.

Further, comparing it to another 2.66 Dual CPU Xeon X5355 machine (although not the same exact model) on BOINC showed that computer (owned by ROBiie) showed a FPS of 2531.02 but an astounding 8193.16 Integer speed!

That's the performance I was looking for--especially since my Xeon processors are the 45nm Hi-K models Intel's been ranting about. Could the difference be because he's using XP PRO/64 while I'm using Pro/32?

Thanks again!

a graphics card won't make any difference to Rosetta on that machine, but it might improve surfing etc if you use it for that?

There's definitely something wrong, but I can't see what. I don't recommend this often, but maybe a project reset is required...

Mod.Sense - can you have a look at this as the scores don't look right. Any ideas?
33) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52247)
Posted 4 Apr 2008 by Dusty
Post:
I see! This wasn't a cheap board by any means. I went with this one because I hadn't seen very great reviews of the Intel 5400X Dual LGA 771 board (although I'm normally a real proponent of genuine Intel). I'll have the video card by next week. I just finished putting in a Fan controller, since the CPU fans are running high (but not max). I think I need to upgrade the fans in this Thermaltake Armour case as well.

Thanks again!

The video on the mother board is very much an issue. Because ANY updates to video uses the main memory bus to make changes to video memory. So, the off board memory is the way to go, even with a cheap $29 card.

So, if you are using the system and even doing something like surfing it can make a big differnece.

This is why a slightly more expensive MB can make all the diffence in the world.

I once changed nothing more than the MB, same CPU, memory, Video card, HDD and saw a 20%+ increase in speed. Part was the way memory was accessed. single vs dual channel ... but, it was just a better MB and well, it was on sale ...

34) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52243)
Posted 4 Apr 2008 by Dusty
Post:
Thanks for the link to CPUID! Running that, it says the CPU is running at 2.66Ghz, but I was surpised that the max bandwidth of the FBDIMM PC2-5300 DDR2 667 ram is only 333Mhz. I guess they split the bandwidth for each CPU? I have one 4-Gig Ram chip in Slot 0 of each of 4 banks. This RAM was recommended by ASUS as being compatible, but I just couldn't afford faster RAM.

I'll work on bringing temps down, but since the CPU Step function is disabled, at least I know that isn't the problem. Thanks again!

if it is that machine (E5430 Xeon) then it has 12MB L2 cache which it looks like BOINC can't read/report correctly. It doesn't support HT - it's a true 2x4 core machine.

There is something wrong with the credit though - it should be claiming and receiving far more than it is. Is speedstep kicking in?

[/quote]
Speedstep turns down the speed of your CPUs when they're not in use. An idle priority task (rosetta) might not qualify as 'in use' and so speedstep might be kicking in. Easiest way to tell would be to run cpuz - http://www.cpuid.com/cpuz.php which will show you the speed of your CPUs in real-time. If it varies depending on what you're doing then Speedstep is enabled and might be the cause of the low scores/throughput. You'll be able to change the settings for that in the BIOS but I think there's a way to control it from Windows too.

You're almost certainly right about the L1/L2 issue too - it looks like BOINC is reporting the wrong number!

HTH[/quote]
[/quote]
35) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52241)
Posted 4 Apr 2008 by Dusty
Post:
Great suggestions! I checked the ASUS website, and I have the most current MB BIOS. None of the drivers that came with the Mobo were for XP Pro..just Win Server 2003, so I emailed their tech support and he gave me links to XP drivers for it. The video is on the motherboard; a measley 32 meg. I've ordered a PCI Express 2.0 card for it, but will that make much of difference since I don't use the screensaver anyway?

Anyway, thanks for the great suggestions! I hope to figure something out. I just can't understand out why the system anticipates I'll earn 30+ credits, but I only end up with about 10. I'll keep tinkering!

Mark

Other things to check are that you have the most up to date drivers for the motherboard, video card and OS...

You can get a brand new MB and have it have a really old driver/bios ...

A pain, but if this continues that needs to be added to the checklist.

Also check that heat is within tolerances as most processors have "self-help" that will slow the processor for overheat.

There may be other settings in the BIOS that can cause variable processor speeds.

Just for ha-has you could also try another project to see if the same kind of thing is happening there ... sometimes you have to turn over a lot of rocks to isolate an odd problem here and there ...

36) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52237)
Posted 4 Apr 2008 by Dusty
Post:
Thanks for the clarification! Since Speedstep is off in the BIOS, it isn't the problem. It's nice to know what it's for, however. There isn't much help in the Mobo manual about BIOS functionality (which is more often the case than not).
Mark

I'm not sure what "speedstep" is. Where would I see that reported as 'kicking in?'

if it is that machine (E5430 Xeon) then it has 12MB L2 cache which it looks like BOINC can't read/report correctly. It doesn't support HT - it's a true 2x4 core machine.

There is something wrong with the credit though - it should be claiming and receiving far more than it is. Is speedstep kicking in?


Speedstep turns down the speed of your CPUs when they're not in use. An idle priority task (rosetta) might not qualify as 'in use' and so speedstep might be kicking in. Easiest way to tell would be to run cpuz - http://www.cpuid.com/cpuz.php which will show you the speed of your CPUs in real-time. If it varies depending on what you're doing then Speedstep is enabled and might be the cause of the low scores/throughput. You'll be able to change the settings for that in the BIOS but I think there's a way to control it from Windows too.

You're almost certainly right about the L1/L2 issue too - it looks like BOINC is reporting the wrong number!

HTH

37) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52236)
Posted 4 Apr 2008 by Dusty
Post:
Regarding "Speedstep," I checked my CMOS, and Speedstep is currently disabled. Do you recommend I enable it?

if it is that machine (E5430 Xeon) then it has 12MB L2 cache which it looks like BOINC can't read/report correctly. It doesn't support HT - it's a true 2x4 core machine.

There is something wrong with the credit though - it should be claiming and receiving far more than it is. Is speedstep kicking in?

38) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52234)
Posted 4 Apr 2008 by Dusty
Post:
Checking the CMOS, the CPUs are being reported as having a 128k L1 Cache and a 12MB L2 cache. Perhaps that's why BOINC is reporting the system as having a 122K cache?

I presume this is the host you are referring to??

I suspect that the tasks that were active at the time you rebooted the machine had not reached a checkpoint yet, and therefore none of their work had been preserved. Some of the tasks being worked on lately are not able to reach a checkpoint very often, and so this further increases the odds of you seeing all 8 starting over at the same time. The good news is that as long as the machine is eventually able to run BOINC long enough, it will recover and continue normally with no intervention. And if this should happen to keep up due to how you use that machine, Rosetta will abort the tasks after 5 restarts in a row with no progress being made. And so then perhaps the new work you get will be able to checkpoint more frequently.

It looks like that host has a very very small L2 cache on the CPU. Indeed it's reporting in with 122K... not Meg, but K! In a nutshell, Rosetta seems to run better on processors with more L2 cache memory. And so your benchmark results are not in line with the actual results you are able to produce when crunching the Rosetta WUs.

This sort of thing is part of why you see so much talk on the internet about one processor being more powerful then another. It's not just as simple as comparing Ghz, and actual performance is very dependant on the type of work you measure the processor doing.

39) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52233)
Posted 4 Apr 2008 by Dusty
Post:
I'm not sure what "speedstep" is. Where would I see that reported as 'kicking in?'

if it is that machine (E5430 Xeon) then it has 12MB L2 cache which it looks like BOINC can't read/report correctly. It doesn't support HT - it's a true 2x4 core machine.

There is something wrong with the credit though - it should be claiming and receiving far more than it is. Is speedstep kicking in?

40) Message boards : Number crunching : Strange problem with dual Xeon machine (Message 52232)
Posted 4 Apr 2008 by Dusty
Post:
Yes, that is the computer. They are two, Quad-core 45nm Hi-K processors with 12MB L2 cache, so I'm really not sure why they are being reported as only having 122K of cache. I'm running them on an ASUS DSEB-D16/SAS Enterprise Server Mobo which has the Intel 5400 chipset. I'm running 16 Gig of RAM on that Mobo, which is probably overkill and doesn't benefit BOINC much. Also, I'm using the latest (5.10.45) version of BOINC. Perhaps there is something in the BIOS that I'm not seeing that is improperly reporting the CPU L2 cache.

Thanks for the information on "checkpoints." I wasn't aware of those. I've rebooted the system several times since getting it online, so that must have been why it kept resetting the processes to zero. Thanks for the clarification!

Mark

I presume this is the host you are referring to??

I suspect that the tasks that were active at the time you rebooted the machine had not reached a checkpoint yet, and therefore none of their work had been preserved. Some of the tasks being worked on lately are not able to reach a checkpoint very often, and so this further increases the odds of you seeing all 8 starting over at the same time. The good news is that as long as the machine is eventually able to run BOINC long enough, it will recover and continue normally with no intervention. And if this should happen to keep up due to how you use that machine, Rosetta will abort the tasks after 5 restarts in a row with no progress being made. And so then perhaps the new work you get will be able to checkpoint more frequently.

It looks like that host has a very very small L2 cache on the CPU. Indeed it's reporting in with 122K... not Meg, but K! In a nutshell, Rosetta seems to run better on processors with more L2 cache memory. And so your benchmark results are not in line with the actual results you are able to produce when crunching the Rosetta WUs.

This sort of thing is part of why you see so much talk on the internet about one processor being more powerful then another. It's not just as simple as comparing Ghz, and actual performance is very dependant on the type of work you measure the processor doing.



Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org