All FFD_ units ending with Validate error

Author	Message
Sid Celery Send message Joined: 11 Feb 08 Posts: 2465 Credit: 46,464,996 RAC: 274	Message 78758 - Posted: 11 Sep 2015, 1:47:14 UTC - in response to Message 78752. Is it planned to run the routine to give credits to the units with Validate error? thanks It seem to run every day. No idea why it never shows on the summary page - that is annoying, I have to agree. It does show when you check each individual task though - an amount equal to the claimed credit. ID: 78758 · Rating: 0 · rate: / Reply Quote

Trotador Send message Joined: 30 May 09 Posts: 108 Credit: 302,521,161 RAC: 1	Message 78760 - Posted: 11 Sep 2015, 5:16:47 UTC - in response to Message 78758. Is it planned to run the routine to give credits to the units with Validate error? thanks It seem to run every day. No idea why it never shows on the summary page - that is annoying, I have to agree. It does show when you check each individual task though - an amount equal to the claimed credit. For me it has not run in all the week, I have validate error wus with no credit as old as September 6. In total close to 300 wus. ID: 78760 · Rating: 0 · rate: / Reply Quote

Trotador Send message Joined: 30 May 09 Posts: 108 Credit: 302,521,161 RAC: 1	Message 78761 - Posted: 11 Sep 2015, 5:22:17 UTC - in response to Message 78757. Last modified: 11 Sep 2015, 5:22:34 UTC Is it planned to run the routine to give credits to the units with Validate error? thanks If you check the task details, you will see the granted credit. For whatever reason, tasks given credit for errors do not show the granted credit on the task summary page. I see now what you say, yes, the credit is granted in the task details page, so it is OK, weird that they do not show in the summary page. thanks ID: 78761 · Rating: 0 · rate: / Reply Quote

krypton Volunteer moderator Project developer Project scientist Send message Joined: 16 Nov 11 Posts: 108 Credit: 2,164,309 RAC: 0	Message 78762 - Posted: 11 Sep 2015, 5:57:32 UTC We'll check with DK about this tomorrow! ID: 78762 · Rating: 0 · rate: / Reply Quote

Betting Slip Send message Joined: 26 Sep 05 Posts: 71 Credit: 5,702,246 RAC: 0	Message 78764 - Posted: 11 Sep 2015, 7:31:46 UTC Last modified: 11 Sep 2015, 7:48:21 UTC I have had many of these units and because my runtime is 1 day I have lost 17 days of crunching and that's only those that remain in my list of WU's over 3 machines. I'm not in this for the credit but I would normally get well over 1000 credits for each day of prcessing so "300" is not good enough. You are supposed to test when you change something, you didn't! It's little wonder that a lot of people sign up but don't stay long as a lot of "new" users to BOINC have "zero" tolerance to errors, they don't know what's going on and so abandon the project and you are always just left with the "hardcore" such as myself, who stick with you through "thick and thin" but you are trying my patience. You don't write, you don't call, there is little information from you, yet you still expect us to crunch blindly, hopefully and obediently. I often wonder how much belief there really is in BOINC and this and other projects. I think we are just a testbed and scientists have little confidence in BOINC and the public at large. We're just a resource, that's what you think isn't it? EDIT to add: I think if this project and it's users were important to scientists we would not be waiting for optimizations in code to take advantage of extensions that have been around in CPU's for years if not decades such as SSE, SSE2, SSE3, SSE4 etc. Afterall these would reap you very generous rewards in computing power from existing users but you still haven't done it have you. It seems the idea of even using SSE has just crept int your consciousness. Now if Rosetta, its users and its results were as important as you often say they are surely you would have done these things, in fact it beggars belief that you haven't and still maintain we and our results are of any importance. ID: 78764 · Rating: 0 · rate: / Reply Quote

Timo Send message Joined: 9 Jan 12 Posts: 185 Credit: 45,661,974 RAC: 0	Message 78765 - Posted: 11 Sep 2015, 14:02:35 UTC Last modified: 11 Sep 2015, 14:05:08 UTC sigh I wish there was a way to down-vote a post, Betting Slip certainly does not represent any of my views. David, Sergey, and countless others who take time out of their day to visit these forums and even collaborate directly with some of us - you guys all do a fantastic job! I know from my day job how challenging software development is - add bioinformatics onto that and wow.. Bioinformatics is hard, some of it is even NP-hard (wow a pun). Keep calm and crunch on! XD ID: 78765 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 78767 - Posted: 11 Sep 2015, 15:28:57 UTC I would just point out that the Rosetta development community is comprised of bioinformatics and physics researchers that are focused on accuracy of prediction, and less focused on development methodologies, hardware technologies, state-of-the-art practices etc. This exemplifies why cross-discipline collaboration is so important in the research community. The research paper each is working on at the time will be no better with better compiler options or etc. So, I would take the recent pursuit of such improvements as being above-and-beyond their scope and mission, and am encouraged to be seeing it. Rosetta Moderator: Mod.Sense ID: 78767 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2118 Credit: 12,390,943 RAC: 265	Message 78769 - Posted: 11 Sep 2015, 18:15:13 UTC - in response to Message 78767. The research paper each is working on at the time will be no better with better compiler options or etc. ?? With better compiler options you have faster app, so more science in less time ID: 78769 · Rating: 0 · rate: / Reply Quote

Betting Slip Send message Joined: 26 Sep 05 Posts: 71 Credit: 5,702,246 RAC: 0	Message 78772 - Posted: 11 Sep 2015, 21:58:42 UTC - in response to Message 78767. The research paper each is working on at the time will be no better with better compiler options or etc. So, I would take the recent pursuit of such improvements as being above-and-beyond their scope and mission, and am encouraged to be seeing it. What you are saying is that if Rosetta were "faster" more "efficient" increasing throughput of WU's would have no effect on each individual scientists work. How can the improvement of their own project be "above-and-beyond their scope and mission"? My post was about more than an improvement to compiler options but you obviously didn't understand or chose to ignore. ID: 78772 · Rating: 0 · rate: / Reply Quote

Sid Celery Send message Joined: 11 Feb 08 Posts: 2465 Credit: 46,464,996 RAC: 274	Message 78775 - Posted: 12 Sep 2015, 2:40:49 UTC - in response to Message 78761. Is it planned to run the routine to give credits to the units with Validate error? thanks If you check the task details, you will see the granted credit. For whatever reason, tasks given credit for errors do not show the granted credit on the task summary page. I see now what you say, yes, the credit is granted in the task details page, so it is OK, weird that they do not show in the summary page. thanks Weird... and a little annoying... but certainly confusing, which results in unneccessary complaints, like we're seeing <sigh> ID: 78775 · Rating: 0 · rate: / Reply Quote

Sid Celery Send message Joined: 11 Feb 08 Posts: 2465 Credit: 46,464,996 RAC: 274	Message 78776 - Posted: 12 Sep 2015, 2:54:39 UTC - in response to Message 78764. I'm not in this for the credit but I would normally get well over 1000 credits for each day of processing so "300" is not good enough. I was going to say this is wrong, but I've just checked and it's right. What <I've> been saying is wrong - though in my defence I'd never seen it before. Where I've been saying granted credit is given equal to claimed credit, the day-long tasks Betting Slip has that are coming up with a Validation Error are claiming over 500 or over 800 but granting only a flat 300. This certainly shouldn't be happening. ID: 78776 · Rating: 0 · rate: / Reply Quote

Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0	Message 78797 - Posted: 15 Sep 2015, 16:47:35 UTC @Betting Slip I am just trying to point out that the (very pertinent and valid) issues you are raising is sort of similar to asking the automotive engineer to resolve the issue with the odor the production cars have in the seat cushions or a flaw with the specific tires that are used. It is just not how they spend the majority of their time. They are more focused on the MPG, aerodynamics, and other aspects. Such runtime variations increase the complexity of the code they are constantly trying to rework in ways to produce better protein models. So there are also some valid reasons for keeping the codebase as simple as possible. So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes. I'll also take this opp. to point out that I am not on the Project Team, simply a volunteer cruncher, with one person's perspective. Rosetta Moderator: Mod.Sense ID: 78797 · Rating: 0 · rate: / Reply Quote

dcdc Send message Joined: 3 Nov 05 Posts: 1834 Credit: 124,260,318 RAC: 37	Message 78799 - Posted: 15 Sep 2015, 22:18:45 UTC - in response to Message 78797. This is probably getting off-topic now, but to pre-empt a response and summarise as per my understanding, I think the point from the other threads here and on Ralph, are: It is generally(?) accepted/understood that code overhauls which would be required for parallel execution like GPGPU, are not practical given the nature of the code, the fact that it is continually evolving, and the development time required for that (assuming it were viable/useful). The suggestion to make use of modern compiler options to make use of modern CPU extensions for free, is the alternative which requires no code changes, but does require some dev time to a) implement it, b) test it and c) fix it when it inevitably results in something unexpected. Visual Studio 2015 is out now, which I think was one of the issues for testing AVX2 on the code (from here: https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6645) Regardless of any of the above though, it is completely unfair for anyone to suggest what the project team should be doing with their time, given that none of us outside the lab have any idea what priorities will produce the best/quickest scientific results. ID: 78799 · Rating: 0 · rate: / Reply Quote

Dr. Merkwürdigliebe Send message Joined: 5 Dec 10 Posts: 81 Credit: 2,657,273 RAC: 0	Message 78801 - Posted: 16 Sep 2015, 7:32:01 UTC - in response to Message 78797. So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes. I don't quite get it. We already had a 64-Bit version with SSEwhatever enabled and a significant speedup. Has this app yielded false results or were there just ancients CPUs choking on SSE? ID: 78801 · Rating: 0 · rate: / Reply Quote

rjs5 Send message Joined: 22 Nov 10 Posts: 274 Credit: 23,730,845 RAC: 0	Message 78802 - Posted: 16 Sep 2015, 14:15:28 UTC - in response to Message 78801. So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes. I don't quite get it. We already had a 64-Bit version with SSEwhatever enabled and a significant speedup. Has this app yielded false results or were there just ancients CPUs choking on SSE I would be interested in the number of 32-bit hosts and number of ACTIVE 32-bit hosts. I suspect it is low but I would not be too surprised if it was significant. If the work of 32-bit hosts was below the percentage of improvement seen by a 64-bit optimization, then it might make sense to release only a 64-bit version .... if the decision was made to forego the 32-bit hosts. I thought the improvement was near 20% (from memory). Compiling with any 64-bit option will cause the compiler to generate code that uses the SSE2 (or newer) instructions BUT!!! Rosetta code is compiling into SCALAR (one FP operation per instruction) code and not VECTOR (multiple FP operations per instruction) code. That means there would be little, if any, performance gain from AVX, AVX2, AVXn, ... Rosetta code will still just crunch one number at a time. IMO, it is not worth the "illegal opcode" errors that will result. If the Rosetta@Home team is incapable of generating a simple 64-bit -O -mSSE2 version, then building anything more complex is beyond their skill level. It is pretty easy to break VECTOR code so the compiler is forced to generate SCALAR code again. With all the people making Rosetta changes, it would be really tough to clean up the coding problems to keep the compiled binary parallel. There is some comfort in just having the Rosetta code consistently "work". I just put Rosetta on 10% resource setting and devote just a fraction of a host to them. It helps cool off my CPU from running the modern projects. ID: 78802 · Rating: 0 · rate: / Reply Quote

Chilean Send message Joined: 16 Oct 05 Posts: 711 Credit: 26,694,507 RAC: 0	Message 78803 - Posted: 16 Sep 2015, 15:43:40 UTC - in response to Message 78802. So, I'm not trying to say such issues should not be pursued, just that it is going to be a paradigm shift and likely require some skills development, and the community of developers involved is not all directly under the control of any one person. What the community contributes, comes as it comes. I don't quite get it. We already had a 64-Bit version with SSEwhatever enabled and a significant speedup. Has this app yielded false results or were there just ancients CPUs choking on SSE I would be interested in the number of 32-bit hosts and number of ACTIVE 32-bit hosts. I suspect it is low but I would not be too surprised if it was significant. If the work of 32-bit hosts was below the percentage of improvement seen by a 64-bit optimization, then it might make sense to release only a 64-bit version .... if the decision was made to forego the 32-bit hosts. I thought the improvement was near 20% (from memory). Compiling with any 64-bit option will cause the compiler to generate code that uses the SSE2 (or newer) instructions BUT!!! Rosetta code is compiling into SCALAR (one FP operation per instruction) code and not VECTOR (multiple FP operations per instruction) code. That means there would be little, if any, performance gain from AVX, AVX2, AVXn, ... Rosetta code will still just crunch one number at a time. IMO, it is not worth the "illegal opcode" errors that will result. If the Rosetta@Home team is incapable of generating a simple 64-bit -O -mSSE2 version, then building anything more complex is beyond their skill level. It is pretty easy to break VECTOR code so the compiler is forced to generate SCALAR code again. With all the people making Rosetta changes, it would be really tough to clean up the coding problems to keep the compiled binary parallel. There is some comfort in just having the Rosetta code consistently "work". I just put Rosetta on 10% resource setting and devote just a fraction of a host to them. It helps cool off my CPU from running the modern projects. The Rosetta code works really well at what it is intended to do; one of the best, if not, the best tertiary protein structure predictor out there. It isn't optimized to squeeze everything out of modern CPUs unfortunately. ID: 78803 · Rating: 0 · rate: / Reply Quote

Robby1959 Send message Joined: 10 May 07 Posts: 38 Credit: 9,298,741 RAC: 0	Message 78838 - Posted: 21 Sep 2015, 1:13:12 UTC I too have a computer that has lots of validate errors . it does more work units and gets fewer points than my other system. ID: 78838 · Rating: 0 · rate: / Reply Quote

[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2118 Credit: 12,390,943 RAC: 265	Message 78844 - Posted: 22 Sep 2015, 9:21:16 UTC - in response to Message 78802. I would be interested in the number of 32-bit hosts and number of ACTIVE 32-bit hosts. I suspect it is low but I would not be too surprised if it was significant. If the work of 32-bit hosts was below the percentage of improvement seen by a 64-bit optimization, then it might make sense to release only a 64-bit version .... if the decision was made to forego the 32-bit hosts. I thought the improvement was near 20% (from memory). In firsts 1000 top hosts no 32 bit OS. So, a simple 10% more for these rac monsters outclasses all the 32 bit power. ID: 78844 · Rating: 0 · rate: / Reply Quote

Link Send message Joined: 4 May 07 Posts: 356 Credit: 382,349 RAC: 0	Message 78845 - Posted: 23 Sep 2015, 16:27:47 UTC - in response to Message 78844. Last modified: 23 Sep 2015, 16:28:29 UTC In firsts 1000 top hosts no 32 bit OS. On the first few pages this might be right, although it's hard to tell what all those Linux machines are. Anyway, there are still many powerfull machines using 32-bit OS, example host 1651904, currently with a RAC of almost 5000 on rank 250, ist using Windows 7 x86. That's not really slow. So, a simple 10% more for these rac monsters outclasses all the 32 bit power. You might want to explain how exactly you get this number. . ID: 78845 · Rating: 0 · rate: / Reply Quote

Dr. Merkwürdigliebe Send message Joined: 5 Dec 10 Posts: 81 Credit: 2,657,273 RAC: 0	Message 78846 - Posted: 23 Sep 2015, 17:04:40 UTC - in response to Message 78845. In firsts 1000 top hosts no 32 bit OS. On the first few pages this might be right, although it's hard to tell what all those Linux machines are. Anyway, there are still many powerfull machines using 32-bit OS, example host 1651904, currently with a RAC of almost 5000 on rank 250, ist using Windows 7 x86. That's not really slow. No way. That's one craptastic CPU. No way. Decimal points?! ID: 78846 · Rating: 0 · rate: / Reply Quote