All FFD_ units ending with Validate error

Message boards : Number crunching : All FFD_ units ending with Validate error

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2059
Credit: 40,463,815
RAC: 1,453
Message 78758 - Posted: 11 Sep 2015, 1:47:14 UTC - in response to Message 78752.  

Is it planned to run the routine to give credits to the units with Validate error?

thanks

It seem to run every day. No idea why it never shows on the summary page - that is annoying, I have to agree. It does show when you check each individual task though - an amount equal to the claimed credit.
ID: 78758 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 1,882
Message 78760 - Posted: 11 Sep 2015, 5:16:47 UTC - in response to Message 78758.  

Is it planned to run the routine to give credits to the units with Validate error?

thanks

It seem to run every day. No idea why it never shows on the summary page - that is annoying, I have to agree. It does show when you check each individual task though - an amount equal to the claimed credit.


For me it has not run in all the week, I have validate error wus with no credit as old as September 6. In total close to 300 wus.
ID: 78760 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 1,882
Message 78761 - Posted: 11 Sep 2015, 5:22:17 UTC - in response to Message 78757.  
Last modified: 11 Sep 2015, 5:22:34 UTC

Is it planned to run the routine to give credits to the units with Validate error?

thanks


If you check the task details, you will see the granted credit. For whatever reason, tasks given credit for errors do not show the granted credit on the task summary page.


I see now what you say, yes, the credit is granted in the task details page, so it is OK, weird that they do not show in the summary page.

thanks
ID: 78761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
krypton
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 16 Nov 11
Posts: 108
Credit: 2,164,309
RAC: 0
Message 78762 - Posted: 11 Sep 2015, 5:57:32 UTC

We'll check with DK about this tomorrow!
ID: 78762 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Betting Slip

Send message
Joined: 26 Sep 05
Posts: 71
Credit: 5,702,246
RAC: 0
Message 78764 - Posted: 11 Sep 2015, 7:31:46 UTC
Last modified: 11 Sep 2015, 7:48:21 UTC

I have had many of these units and because my runtime is 1 day I have lost 17 days of crunching and that's only those that remain in my list of WU's over 3 machines. I'm not in this for the credit but I would normally get well over 1000 credits for each day of prcessing so "300" is not good enough.

You are supposed to test when you change something, you didn't! It's little wonder that a lot of people sign up but don't stay long as a lot of "new" users to BOINC have "zero" tolerance to errors, they don't know what's going on and so abandon the project and you are always just left with the "hardcore" such as myself, who stick with you through "thick and thin" but you are trying my patience.

You don't write, you don't call, there is little information from you, yet you still expect us to crunch blindly, hopefully and obediently.

I often wonder how much belief there really is in BOINC and this and other projects. I think we are just a testbed and scientists have little confidence in BOINC and the public at large.

We're just a resource, that's what you think isn't it?

EDIT to add:

I think if this project and it's users were important to scientists we would not be waiting for optimizations in code to take advantage of extensions that have been around in CPU's for years if not decades such as SSE, SSE2, SSE3, SSE4 etc. Afterall these would reap you very generous rewards in computing power from existing users but you still haven't done it have you. It seems the idea of even using SSE has just crept int your consciousness.

Now if Rosetta, its users and its results were as important as you often say they are surely you would have done these things, in fact it beggars belief that you haven't and still maintain we and our results are of any importance.
ID: 78764 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Timo
Avatar

Send message
Joined: 9 Jan 12
Posts: 185
Credit: 45,649,459
RAC: 0
Message 78765 - Posted: 11 Sep 2015, 14:02:35 UTC
Last modified: 11 Sep 2015, 14:05:08 UTC

*sigh* I wish there was a way to down-vote a post, Betting Slip certainly does not represent any of my views. David, Sergey, and countless others who take time out of their day to visit these forums and even collaborate directly with some of us - you guys all do a fantastic job!

I know from my day job how challenging software development is - add bioinformatics onto that and wow.. Bioinformatics is hard, some of it is even NP-hard (wow a pun).

Keep calm and crunch on! XD
ID: 78765 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 78767 - Posted: 11 Sep 2015, 15:28:57 UTC

I would just point out that the Rosetta development community is comprised of bioinformatics and physics researchers that are focused on accuracy of prediction, and less focused on development methodologies, hardware technologies, state-of-the-art practices etc. This exemplifies why cross-discipline collaboration is so important in the research community.

The research paper each is working on at the time will be no better with better compiler options or etc. So, I would take the recent pursuit of such improvements as being above-and-beyond their scope and mission, and am encouraged to be seeing it.
Rosetta Moderator: Mod.Sense
ID: 78767 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1963
Credit: 9,099,488
RAC: 551
Message 78769 - Posted: 11 Sep 2015, 18:15:13 UTC - in response to Message 78767.  

The research paper each is working on at the time will be no better with better compiler options or etc.


??
With better compiler options you have faster app, so more science in less time
ID: 78769 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Betting Slip

Send message
Joined: 26 Sep 05
Posts: 71
Credit: 5,702,246
RAC: 0
Message 78772 - Posted: 11 Sep 2015, 21:58:42 UTC - in response to Message 78767.  


The research paper each is working on at the time will be no better with better compiler options or etc. So, I would take the recent pursuit of such improvements as being above-and-beyond their scope and mission, and am encouraged to be seeing it.


What you are saying is that if Rosetta were "faster" more "efficient" increasing throughput of WU's would have no effect on each individual scientists work.

How can the improvement of their own project be "above-and-beyond their scope and mission"?

My post was about more than an improvement to compiler options but you obviously didn't understand or chose to ignore.
ID: 78772 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2059
Credit: 40,463,815
RAC: 1,453
Message 78775 - Posted: 12 Sep 2015, 2:40:49 UTC - in response to Message 78761.  

Is it planned to run the routine to give credits to the units with Validate error?

thanks

If you check the task details, you will see the granted credit. For whatever reason, tasks given credit for errors do not show the granted credit on the task summary page.

I see now what you say, yes, the credit is granted in the task details page, so it is OK, weird that they do not show in the summary page.

thanks

Weird... and a little annoying... but certainly confusing, which results in unneccessary complaints, like we're seeing <sigh>
ID: 78775 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2059
Credit: 40,463,815
RAC: 1,453
Message 78776 - Posted: 12 Sep 2015, 2:54:39 UTC - in response to Message 78764.  

I'm not in this for the credit but I would normally get well over 1000 credits for each day of processing so "300" is not good enough.

I was going to say this is wrong, but I've just checked and it's right. What <I've> been saying is wrong - though in my defence I'd never seen it before.

Where I've been saying granted credit is given equal to claimed credit, the day-long tasks Betting Slip has that are coming up with a Validation Error are claiming over 500 or over 800 but granting only a flat 300.

This certainly shouldn't be happening.
ID: 78776 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 78797 - Posted: 15 Sep 2015, 16:47:35 UTC

@Betting Slip

I am just trying to point out that the (very pertinent and valid) issues you are raising is sort of similar to asking the automotive engineer to resolve the issue with the odor the production cars have in the seat cushions or a flaw with the specific tires that are used. It is just not how they spend the majority of their time. They are more focused on the MPG, aerodynamics, and other aspects. Such runtime variations increase the complexity of the code they are constantly trying to rework in ways to produce better protein models. So there are also some valid reasons for keeping the codebase as simple as possible.

So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes.

I'll also take this opp. to point out that I am not on the Project Team, simply a volunteer cruncher, with one person's perspective.
Rosetta Moderator: Mod.Sense
ID: 78797 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 119,133,504
RAC: 2,487
Message 78799 - Posted: 15 Sep 2015, 22:18:45 UTC - in response to Message 78797.  

This is probably getting off-topic now, but to pre-empt a response and summarise as per my understanding, I think the point from the other threads here and on Ralph, are:

It is generally(?) accepted/understood that code overhauls which would be required for parallel execution like GPGPU, are not practical given the nature of the code, the fact that it is continually evolving, and the development time required for that (assuming it were viable/useful).

The suggestion to make use of modern compiler options to make use of modern CPU extensions for free, is the alternative which requires no code changes, but does require some dev time to a) implement it, b) test it and c) fix it when it inevitably results in something unexpected.

Visual Studio 2015 is out now, which I think was one of the issues for testing AVX2 on the code (from here: https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6645)

Regardless of any of the above though, it is completely unfair for anyone to suggest what the project team should be doing with their time, given that none of us outside the lab have any idea what priorities will produce the best/quickest scientific results.
ID: 78799 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dr. Merkwürdigliebe
Avatar

Send message
Joined: 5 Dec 10
Posts: 81
Credit: 2,657,273
RAC: 0
Message 78801 - Posted: 16 Sep 2015, 7:32:01 UTC - in response to Message 78797.  



So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes.


I don't quite get it. We already had a 64-Bit version with SSEwhatever enabled and a significant speedup.

Has this app yielded false results or were there just ancients CPUs choking on SSE?
ID: 78801 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rjs5

Send message
Joined: 22 Nov 10
Posts: 273
Credit: 22,487,487
RAC: 1,032
Message 78802 - Posted: 16 Sep 2015, 14:15:28 UTC - in response to Message 78801.  



So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes.


I don't quite get it. We already had a 64-Bit version with SSEwhatever enabled and a significant speedup.

Has this app yielded false results or were there just ancients CPUs choking on SSE


I would be interested in the number of 32-bit hosts and number of ACTIVE 32-bit hosts. I suspect it is low but I would not be too surprised if it was significant. If the work of 32-bit hosts was below the percentage of improvement seen by a 64-bit optimization, then it might make sense to release only a 64-bit version .... if the decision was made to forego the 32-bit hosts. I thought the improvement was near 20% (from memory).


Compiling with any 64-bit option will cause the compiler to generate code that uses the SSE2 (or newer) instructions BUT!!! Rosetta code is compiling into SCALAR (one FP operation per instruction) code and not VECTOR (multiple FP operations per instruction) code. That means there would be little, if any, performance gain from AVX, AVX2, AVXn, ... Rosetta code will still just crunch one number at a time. IMO, it is not worth the "illegal opcode" errors that will result. If the Rosetta@Home team is incapable of generating a simple 64-bit -O -mSSE2 version, then building anything more complex is beyond their skill level.

It is pretty easy to break VECTOR code so the compiler is forced to generate SCALAR code again. With all the people making Rosetta changes, it would be really tough to clean up the coding problems to keep the compiled binary parallel.

There is some comfort in just having the Rosetta code consistently "work". I just put Rosetta on 10% resource setting and devote just a fraction of a host to them. It helps cool off my CPU from running the modern projects.


ID: 78802 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 78803 - Posted: 16 Sep 2015, 15:43:40 UTC - in response to Message 78802.  



So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes.


I don't quite get it. We already had a 64-Bit version with SSEwhatever enabled and a significant speedup.

Has this app yielded false results or were there just ancients CPUs choking on SSE


I would be interested in the number of 32-bit hosts and number of ACTIVE 32-bit hosts. I suspect it is low but I would not be too surprised if it was significant. If the work of 32-bit hosts was below the percentage of improvement seen by a 64-bit optimization, then it might make sense to release only a 64-bit version .... if the decision was made to forego the 32-bit hosts. I thought the improvement was near 20% (from memory).


Compiling with any 64-bit option will cause the compiler to generate code that uses the SSE2 (or newer) instructions BUT!!! Rosetta code is compiling into SCALAR (one FP operation per instruction) code and not VECTOR (multiple FP operations per instruction) code. That means there would be little, if any, performance gain from AVX, AVX2, AVXn, ... Rosetta code will still just crunch one number at a time. IMO, it is not worth the "illegal opcode" errors that will result. If the Rosetta@Home team is incapable of generating a simple 64-bit -O -mSSE2 version, then building anything more complex is beyond their skill level.

It is pretty easy to break VECTOR code so the compiler is forced to generate SCALAR code again. With all the people making Rosetta changes, it would be really tough to clean up the coding problems to keep the compiled binary parallel.

There is some comfort in just having the Rosetta code consistently "work". I just put Rosetta on 10% resource setting and devote just a fraction of a host to them. It helps cool off my CPU from running the modern projects.




The Rosetta code works really well at what it is intended to do; one of the best, if not, the best tertiary protein structure predictor out there. It isn't optimized to squeeze everything out of modern CPUs unfortunately.
ID: 78803 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Robby1959

Send message
Joined: 10 May 07
Posts: 38
Credit: 9,298,741
RAC: 0
Message 78838 - Posted: 21 Sep 2015, 1:13:12 UTC

I too have a computer that has lots of validate errors . it does more work units and gets fewer points than my other system.
ID: 78838 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1963
Credit: 9,099,488
RAC: 551
Message 78844 - Posted: 22 Sep 2015, 9:21:16 UTC - in response to Message 78802.  

I would be interested in the number of 32-bit hosts and number of ACTIVE 32-bit hosts. I suspect it is low but I would not be too surprised if it was significant. If the work of 32-bit hosts was below the percentage of improvement seen by a 64-bit optimization, then it might make sense to release only a 64-bit version .... if the decision was made to forego the 32-bit hosts. I thought the improvement was near 20% (from memory).


In firsts 1000 top hosts no 32 bit OS.
So, a simple 10% more for these rac monsters outclasses all the 32 bit power.


ID: 78844 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Link
Avatar

Send message
Joined: 4 May 07
Posts: 356
Credit: 382,349
RAC: 0
Message 78845 - Posted: 23 Sep 2015, 16:27:47 UTC - in response to Message 78844.  
Last modified: 23 Sep 2015, 16:28:29 UTC

In firsts 1000 top hosts no 32 bit OS.

On the first few pages this might be right, although it's hard to tell what all those Linux machines are. Anyway, there are still many powerfull machines using 32-bit OS, example host 1651904, currently with a RAC of almost 5000 on rank 250, ist using Windows 7 x86. That's not really slow.



So, a simple 10% more for these rac monsters outclasses all the 32 bit power.

You might want to explain how exactly you get this number.
.
ID: 78845 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dr. Merkwürdigliebe
Avatar

Send message
Joined: 5 Dec 10
Posts: 81
Credit: 2,657,273
RAC: 0
Message 78846 - Posted: 23 Sep 2015, 17:04:40 UTC - in response to Message 78845.  

In firsts 1000 top hosts no 32 bit OS.

On the first few pages this might be right, although it's hard to tell what all those Linux machines are. Anyway, there are still many powerfull machines using 32-bit OS, example host 1651904, currently with a RAC of almost 5000 on rank 250, ist using Windows 7 x86. That's not really slow.

No way. That's one craptastic CPU. No way. Decimal points?!
ID: 78846 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : All FFD_ units ending with Validate error



©2024 University of Washington
https://www.bakerlab.org