Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 274 · 275 · 276 · 277 · 278 · 279 · 280 . . . 306 · Next
Author | Message |
---|---|
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
That's not our problem. This is a Rosetta place not a Folding. It's not our issue. Take it up with Folding. And you dilemma is for you to figure out, we don't need to know nor do we care about your dilemma. That is your alone to figure out. I'd like to comment. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
And again, not a word from the project. Wonder what it was this time. |
Kissagogo27 Send message Joined: 31 Mar 20 Posts: 86 Credit: 2,958,343 RAC: 2,758 |
i notice the very low update rate ~25KB/s and the "waiting for validation" about the results just uploaded ... |
Kissagogo27 Send message Joined: 31 Mar 20 Posts: 86 Credit: 2,958,343 RAC: 2,758 |
somes are validated now |
Kissagogo27 Send message Joined: 31 Mar 20 Posts: 86 Credit: 2,958,343 RAC: 2,758 |
État de l'exécution Travail Tasks ready to send 78401 Tâches en cours 112944 Workunits waiting for validation 21612 Workunits waiting for assimilation 5284 Workunits waiting for file deletion 0 Tasks waiting for file deletion 0 Transitioner backlog (hours) 0.00 Utilisateurs With credit 1379962 With recent credit 15782 Registered in past 24 hours 2 Ordinateurs With credit 4530477 With recent credit 32018 Registered in past 24 hours 7 Current GigaFLOPS 132514 |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
Project seems to be alive now I would hope so. The site hasn't been up to reallocate the tasks to anyone else, so we should get credit even having missed deadline as we're the first to return the task. That said, I wonder if any of the new tasks being issued might be resends... might be worth checking within an hour or two. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
État de l'exécution Hopefully this is just a reflection of all the tasks being returned now the site is back up. Most of my tasks aren't validated yet, but some are. I have got one weird one though: Completed, can't validate Edit: It's just changed while I was typing this to validated and credited. That's weird - never seen that happen before. Sounds like we need to be a little patient as everything gets processed. Edit 2: I refreshed the page again and loads more of my tasks were Validated. We should definitely give the site a chance to blast through them |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
Project seems to be alive now I've got 15 tasks returned after deadline and they've all validated and credited. I have a further 6 awaiting validation. Just checking further, the tasks I returned after deadline have been reissued to other users 10 minutes before I returned them. One of the reissues has been cancelled by the Server. The others haven't. Which tells me that any tasks we're downloading may fall into the same bracket. My understanding is that if the tasks haven't been started, they will be cancelled by the server. <But> If they have been started they'll run to completion and <be awarded no credit> because the previous user has already been awarded them! This is all getting a bit ugly. Check the status of all your tasks online, people... |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 397 Credit: 12,254,928 RAC: 11,616 |
I'd like to comment. This is not a problem with Boinc. It is not a problem with Folding. It is a problem with your configuration which is preventing the two projects, which have no way of knowing the other is there, from working together. You have been given the configuration changes required, all that’s needed now is for you to try them. |
kotenok2000 Send message Joined: 22 Feb 11 Posts: 264 Credit: 507,897 RAC: 900 |
I hope they will still get points when script runs, because each task would still generate unique data. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
I'd like to comment. I understand the issue better now. Irrespective of fault, it seems like all Boinc projects are having problems coexisting with Folding@home, evidenced by Grant's comment And the same issue is happening with your other projects. This is only a problem to the extent that tasks miss deadlines, which is what you have, so check these settings in turn: 1. Ensure "at most xx% of CPU time" is set to 100% for all Boinc tasks. 2. You may think Rosetta is set to 8hrs, but every one of your tasks runs to 43,200secs of CPU time, which is 12hrs. So go to your account online and within rosetta@home preferences reaffirm "Target CPU run time" is set explicitly to 8hrs and Update Preferences. Rosetta certainly thinks it's set to 12hrs. 3. If you still can't complete tasks within the deadline, reduce your cache size in Boinc, so you don't download too many tasks to complete before deadline. I think Point 2 will be the solution. Rosetta is a bit weird when non-default runtimes are set. They're all downloaded as if they're 8hrs tasks, but when it gets close to that runtime only then does it adjust the remaining time up toward 12hrs. So they run 4hrs longer, then projects the size of the rest of the cache as if it will be 8hrs again. It's been programmed to <not> adjust based on past history. I forget why but I do recall when it was deliberately made to work that way. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
This will solve the <entirety> of your problems, while (coincidentally) massively increasing your contribution to <all> the projects you run within your preferred settings.He's running Folding at home as well. Ta, I didn't pick up the Folding@home involvement - that explains part of it. But I do think it's the Target CPU time aspect that's tipping things over the edge - partly because I'm set to 12hr tasks too and it is a bit weird, but I run a small enough cache and only two projects so it never affects me. The part about using 12hr tasks not changing the projected runtime of the rest of the Rosetta cache is something that was brought in... about 4 years ago. I'm pretty sure that's not a coincidence. Which means the the tipping point <is> a Rosetta issue after all |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1699 Credit: 18,186,917 RAC: 24,275 |
I'd like to comment.No you don't, you just ignore what you are told as to how to fix it. Twice now. The result of asking each time is the same, basically, the BOINC folk tell me the problem is Folding, the Folding folk tell me it is not.And since it is occurring with a BOINC project- actually all of your BOINC projects, not just this one- might it be somewhat obvious that those of us here doing BOINC work might have some idea of what is actually going on? While those at Folding- unless they do BOINC work as well- won't have the slightest idea of what you are complaining to them about? And if you had paid the slightest bit of attention to the responses i gave you previously, you would understand what the problem is & how to fix it. I have set no new tasks at both. I would seem to face a choice, I can support one or the other. Both are important to me.The third option would be to fix it so that both can co-exist, hundreds (if not a thousand +) of other people have done so. Twice i have told you what the problem is. Twice i have told you how you could fix the problem. And twice you have ignored completely everything you were told that would allow you to sort it out. So, yeah, not doing either of them is probably the best option for you. Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1699 Credit: 18,186,917 RAC: 24,275 |
Which means the the tipping point <is> a Rosetta issue after allNope. If it took 12 hours to do 12 hours of work, there'd be no problem. But because it takes 24hrs to do 12hrs work, it's a big problem. Even set to 8 hours, it would still take 16hrs, so still Panic mode. Make it so the CPU isn't over committed, and all would be OK. His problem is purely down to it taking 2-4 times longer than it should to process any BOINC Tasks, because the CPU is also processing Folding work on the same CPU cores/threads- X cores/threads trying to process X+1 or X+2 applications (that are using 100% of each core/thread) is always going to cause problems. As long as the number of applications being run is equal to or less than the number of cores/threads, all will be well- so limiting the number of cores/threads available to BOINC so Folding has as many as it needs (1, 2, 4 or however many that is) would sort it out. Of course if "Use at most xx % of CPU time" is anything other than 100%, that would just add to the issues of doing Folding on the same cores/threads as BOINC work (as would any GPU Tasks from BOINC projects that require 1 core/thread per GPU Task being run to support it, and that too can be resolved, although it's more difficult than it needs to be). Grant Darwin NT |
robertmiles Send message Joined: 16 Jun 08 Posts: 1233 Credit: 14,324,975 RAC: 3,637 |
I remember from when I was running Folding@Home also that Folding@Home expects to use entire CPU cores, not just the available threads in that CPU core. An easy way to handle this is to start the Folding@Home program at least a full minute before starting any BOINC program. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1997 Credit: 9,747,451 RAC: 10,562 |
I've got 15 tasks returned after deadline and they've all validated and credited. Good for you. I have a lot of "cancelled by the server" |
rilian Send message Joined: 16 Jun 07 Posts: 27 Credit: 3,008,219 RAC: 12,615 |
same here, i lost a hundred tasks :( i crunch for Ukraine. Join our team forums about Rosetta@home |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 194 Credit: 6,540,448 RAC: 8,022 |
I've got 15 tasks returned after deadline and they've all validated and credited. So do I (although mostly they run OK). There seems to be something wrong with the server. It sends out a task, and before it returns its result or times out it sends the same one to me. Then the first user returns the result, and mine gets cancelled. Just plain sloppy. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
Which means the the tipping point <is> a Rosetta issue after allNope. I don't completely agree. It's not just that a 12hr task (that Rosetta only shows Boinc as 8hrs for the bulk of its run) is taking 20-32hrs to complete, it's that the next tasks in the cache are showing 8hrs to Boinc but will also take 20-32hrs too. Changing the target runtime back to 8hrs, even with the folding@home contention, will take 7-11hrs out of the running tasks and a further 7-11hrs out of the cached tasks. 14-22hrs less processing time to complete tasks will make a huge difference to whether Panic mode arises. I'd guess <all> the difference. This is only an issue if the cache is set above a day. It can be made to work by ensuring Rosetta tasks only run for the time Adrian already thought they were set to (8hrs rather than 12hrs they actually run for). It can certainly be solved your way, but that gets a bit fiddly imo and doesn't resolve the confusion Rosetta runtime introduces. I'd rather my solution if I were him too, especially if RAM and disk space don't come into the equation. And we already know Adrian didn't like your solution, so let's see what he thinks of my alternative. It's entirely up to him. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2130 Credit: 41,424,155 RAC: 16,102 |
I've got 15 tasks returned after deadline and they've all validated and credited. It was a very early call - in the first few hours. In the end I had 13 cancelled by the server, none of which had started to run. However, I did have 1 task that ran to completion, but came up with a validate error because the previous host reported it late. On balance, it could've been a lot worse on a 16-thread machine. I'll live with it. |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2024 University of Washington
https://www.bakerlab.org