cpu optimization

Message boards : Number crunching : cpu optimization

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
FluffyChicken
Avatar

Send message
Joined: 1 Nov 05
Posts: 1260
Credit: 369,635
RAC: 0
Message 52371 - Posted: 10 Apr 2008, 20:37:57 UTC

iirc as mentioned above somewhere, they needed to have it all recoded in a modern language (C of some type) to make it effective as the old bodged together fortran/c code was restricting things. I assume this is what mini-rosetta is and it'll be tested there..

But what no developers, the project coders chiming in to talk about it .... :(
Team mauisun.org
ID: 52371 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 52372 - Posted: 10 Apr 2008, 20:41:41 UTC - in response to Message 52371.  

But what no developers, the project coders chiming in to talk about it .... :(

Would you rather they were here spending time... or coding?

I'd pick coding ... :)

But that is just me ...
ID: 52372 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 52377 - Posted: 11 Apr 2008, 4:35:53 UTC - in response to Message 52372.  
Last modified: 11 Apr 2008, 4:46:42 UTC

@Paul D. Buck - From one of my other postings...

Can you really have your cake and eat it too? :-0

How can the Project justify expending programming resources for this, under the criteria you have advocated in your various posts in this thread?

And if they can do that, then why can't they spend 15 mins / week here?

For example, how was this possible?

"When David Baker, who also serves as a principal investigator for Howard Hughes Medical Institute, originally developed the code, it had to be run in serial - broken into manageable amounts of data, with each portion calculated in series, one after another.

Through a research collaboration, SDSC's expertise and supercomputing resources helped modify the Rosetta code to run in parallel on SDSC's massive supercomputers, dramatically speeding processing, and providing a testing ground for running the code on the world's fastest non-classified computer.
ID: 52377 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 52380 - Posted: 11 Apr 2008, 14:46:20 UTC - in response to Message 52377.  

@Paul D. Buck - From one of my other postings...

Can you really have your cake and eat it too? :-0

How can the Project justify expending programming resources for this, under the criteria you have advocated in your various posts in this thread?

And if they can do that, then why can't they spend 15 mins / week here?

For example, how was this possible?

"When David Baker, who also serves as a principal investigator for Howard Hughes Medical Institute, originally developed the code, it had to be run in serial - broken into manageable amounts of data, with each portion calculated in series, one after another.

Through a research collaboration, SDSC's expertise and supercomputing resources helped modify the Rosetta code to run in parallel on SDSC's massive supercomputers, dramatically speeding processing, and providing a testing ground for running the code on the world's fastest non-classified computer.

Different person ...

Different problem ...

And I did not say I did not want project participation on the boards, as a matter of record I was one of the first, if not the first to point out that RAH, which in the initial stages, was one of the best projects for having project participation on the boards ... would lose that interest within 6 months ... I was off by 2 months ... their presence waned in 4 months ...

I have been one of the few, if not the only, person in the participant community that has tried to interest the projects in mobilizing and using their participants into a usable tool rather than a "passive" herd of cows in their farm of computers ...

But, I was labeled loony for even suggesting that there is such a thing as a BOINC Community ... yet, there is, and we and the projects all belong to it ... A rather lawless place as my recent metaphor goes in that the projects can, and do, act like the old western gunfighters and we, the cowed and subservient townspeople ...

The fact that some, if not most, of the gunslingers are benign does not make them any less what they are ... and you cannot always tell by looking ... look at a picture of "Billy the Kid" and he does not look the part either ...
ID: 52380 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 52382 - Posted: 11 Apr 2008, 15:01:49 UTC

@Paul D. Buck: I was orignially referring to Post 52339.

I think we are basically on the same page as far as community and communication, and I respect your background.

We can respectfully disagree as to where the "break-even" point is, in regards to making an effort to evaluate the potential impact of new technology on the Project.
ID: 52382 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 52386 - Posted: 11 Apr 2008, 18:06:27 UTC - in response to Message 52382.  

We can respectfully disagree as to where the "break-even" point is, in regards to making an effort to evaluate the potential impact of new technology on the Project.

Agreed... :)

One of the hardest concepts I tried to "sell" was the thought that we can disagree with out being disagreeable. At the moment, I think we are still above the threshold (at least I HOPE I have not dipped below... I know no one has offended me ...)

I was only trying to make the point that I want the coders coding and testing. Not so much talking to me.

Historically, the spokespersons for the projects have tended to be the moderators and (at least in the past) volunteers like myself and others who tried to read EVERYTHING so we could answer questions.

In this case, particularly when there is (apparently) a new version in the wings is not the right time to be thinking about specific CPU optimization efforts. I have tended to be a little "shorter" in that I find I am tiring faster so ... perhaps welcome relief to those that never cared for my, ahem, longwindedness.

Anyway. Optimizations have their place, it is just, unfortunately not as simple as we would like ...

Just as an example, the brevety of an SQL statement is by no means an indication that the query will execute quickly (one i worked on was 4:20 and some seconds 4.5 plus million rows to pull two). Nor that one that fills two pages of 11x14 paper will be slow (15 ms) ...

Anyway, *MY* main intent was to attempt to fill-in the background so that the "why not" might be a little easier to understand ... because ... it seems so simple ... :)
ID: 52386 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael G.R.

Send message
Joined: 11 Nov 05
Posts: 264
Credit: 11,247,510
RAC: 0
Message 52683 - Posted: 24 Apr 2008, 15:55:47 UTC

"Version 1.15 of minirosetta is now released, which includes protocols for doing very fast optimization of multiple starting structures, improved methods for modeling protein structures based on constraints, and faster versions of our all-atom refinement protocol."

Seems like this new version does have some of the optimizations that have been mentioned here (not SSE, but better algorhythms).
ID: 52683 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 52805 - Posted: 30 Apr 2008, 12:39:14 UTC

Intel's upcoming AVX instructions go back to the future

This past IDF, Intel announced its next major X86 instruction set facelift - the Advanced Vector Extensions, or AVX. What's the big deal, you may ask? After all, there were countless MMX and SSE rounds till this day.

Well, it is a nice 3-operand, RISC-like approach, so you can finally do A+B=C in a single opcode: AMD's proposed SSE5 is supposed to go along this line as well. Later, with fused multiply-adds, this could even become A*B+C=D. Then, you got a more efficient instruction format with a lot of baggage (and length) reduced - again, one of major problems of X86 on efficient fixed-opcode length RISCs. No need to mention the good ship Itanic here, it can have more instruction FORMATS than some RISCs have instructions - that's how 'elegant' it is.

Then, AVX doubles the SSE register length to 256 bits - doubling the amount of data fitting in and, matched with doubled data paths, providing twice the FP throughput per clock in the Sandy Bridge CPU some two years from now. And, one day maybe, you could fit two quad-precision 128-bit FP numbers into each of these registers. Marvellous!
ID: 52805 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 52843 - Posted: 3 May 2008, 7:54:12 UTC
Last modified: 3 May 2008, 7:56:25 UTC

AVX it is Geat,
but it will be in use, in 1.5 - 2 year when "4x core CPU with AVX" will cost 200-300$
and "CPU with AVX" will demands new MBRAM ((
ID: 52843 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 52845 - Posted: 3 May 2008, 11:08:56 UTC - in response to Message 52843.  

is that enough advanced notice for the progrgammers to have an optimized Rosie app ready ?!

ID: 52845 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
neil.hunter14

Send message
Joined: 9 May 06
Posts: 10
Credit: 278,867
RAC: 0
Message 53475 - Posted: 31 May 2008, 6:22:01 UTC - in response to Message 52845.  

is that enough advanced notice for the progrgammers to have an optimized Rosie app ready ?!




I've just hopped onto this thread, and flicked through it quickly. Seems like there's a catch-22 going on. Developers won't spend huge amounts of time optimising Code for the latest CPUs if there are many takers. But there won't be many takers until the Code yields significant performance benefits.

My personal view is that Project Developers (not just R@H) should keep on top of the current state-of-play. At the end of the day, we are using our PCs (and electricity bills) to support their research, so they should do their utmost to keep us interested and onboard their Project. Using a Marketing tool such as "optimised for SSE4 etc", would keep us going!!!

Neil.
ID: 53475 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 7,494
Message 53479 - Posted: 31 May 2008, 11:47:00 UTC - in response to Message 53475.  
Last modified: 31 May 2008, 11:51:04 UTC

is that enough advanced notice for the progrgammers to have an optimized Rosie app ready ?!




I've just hopped onto this thread, and flicked through it quickly. Seems like there's a catch-22 going on. Developers won't spend huge amounts of time optimising Code for the latest CPUs if there are many takers. But there won't be many takers until the Code yields significant performance benefits.

My personal view is that Project Developers (not just R@H) should keep on top of the current state-of-play. At the end of the day, we are using our PCs (and electricity bills) to support their research, so they should do their utmost to keep us interested and onboard their Project. Using a Marketing tool such as "optimised for SSE4 etc", would keep us going!!!

Neil.


Unfortunately, the point is that the code isn't static and therefore 'optimisation' isn't a one-time job. It would possibly need to be re-done (at least in-part) after every new Rosetta release.

The guys at the lab wouldn't need to be involved with optimisation of the app anyway though - it could be (and has been) attempted by others. I'd suggest the reason no-one has been successful is because it's not a trivial job.

The final point is that it might be very well optimised already - maybe SSE4 would help, but maybe it wouldn't. I think the general consensus was that SSE1-3 wouldn't have helped.
ID: 53479 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : cpu optimization



©2024 University of Washington
https://www.bakerlab.org