Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 56 · 57 · 58 · 59 · 60 · 61 · 62 . . . 309 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,524,949
RAC: 10,364
Message 97578 - Posted: 24 Jun 2020, 20:22:21 UTC - in response to Message 97567.  

I do note the very large number of people who have left and continue to leave the project and it does not surprise me in the least.
It has to be time to rethink some things and that includes the three day time limit.

Before Seti shut down there were 40-50k hosts returning tasks daily here. Currently it's around 550k daily. Good observation.

After the massive influx of new users and hosts, the project did have a massive rethink, doubling the minimum runtime and cutting the deadline from 8 days to 3.
It was massively successful, enabling 10x as many tasks to be returned with more useful information in a lot less time, while not exceeding the server capacity (which was being hammered at the time)

That being the case, if you could tweak your settings appropriately to your ability to run and meet the deadline the project requires, that would be great.
Or you could continue being late to return your tasks and get no credit for them. Your choice.
ID: 97578 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 97579 - Posted: 24 Jun 2020, 20:22:36 UTC - in response to Message 97577.  


Have you tried setting No new tasks about the time any more tasks downloaded will not finish in time?


Yes but it doesn't work out well enough. Reducing the cache helped better but i still have to cancel some. It doesn't help that i sometimes stop to crunch for Folding. :)

Folding@home? If so, I have Folding@home doing GPU work only, with BOINC projects taking up all of the CPU time they can without interfering with Folding@home or with my email reading. That allows both Folding@home and BOINC to crunch nearly 24 hours a day.
ID: 97579 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,524,949
RAC: 10,364
Message 97580 - Posted: 24 Jun 2020, 20:31:43 UTC - in response to Message 97577.  

Have you tried setting No new tasks about the time any more tasks downloaded will not finish in time?

Yes but it doesn't work out well enough. Reducing the cache helped better but i still have to cancel some. It doesn't help that i sometimes stop to crunch for Folding. :)

I have no idea as I've never used it, but would it help to set something in Boinc's "Daily Schedules" tab under Computing Preferences? I wonder what it would do.
ID: 97580 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Daedalus

Send message
Joined: 1 Aug 08
Posts: 39
Credit: 10,107,661
RAC: 56
Message 97581 - Posted: 24 Jun 2020, 20:37:01 UTC - in response to Message 97580.  


I have no idea as I've never used it, but would it help to set something in Boinc's "Daily Schedules" tab under Computing Preferences? I wonder what it would do.


I don't know. It doesn't seem to forbid specific days. Just specific hours some days. In the meantime, i will try to reduce further the cache to see if it helps well enough. :)
ID: 97581 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 97582 - Posted: 24 Jun 2020, 21:08:51 UTC - in response to Message 97581.  
Last modified: 24 Jun 2020, 21:11:03 UTC


I have no idea as I've never used it, but would it help to set something in Boinc's "Daily Schedules" tab under Computing Preferences? I wonder what it would do.


I don't know. It doesn't seem to forbid specific days. Just specific hours some days. In the meantime, i will try to reduce further the cache to see if it helps well enough. :)

That appears to mean that you can exclude certain days by excluding all of the hours on those days.

I'd expect how much it helps to depend on how well BOINC uses the excluded days in its calculations of how much work to download.
ID: 97582 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 97585 - Posted: 24 Jun 2020, 23:00:48 UTC - in response to Message 97573.  

The BOINC manager is not that good at predicting my compute time and anyway i am not home on weekends so i always have some tasks due to finish a day my computer will be off. So at the end of the week i have to cancel a batch. Systematically.

I understand they want their results fast but i cannot always provide.


Just leave the machine on. Computers enjoy crunching :-) I've got 6 running 24/7. 5 of them in another room where I can't hear the fans!
ID: 97585 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Arnav Sood

Send message
Joined: 20 Aug 18
Posts: 2
Credit: 11,782,086
RAC: 0
Message 97590 - Posted: 25 Jun 2020, 7:21:55 UTC

I've been seeing a lot of
error while downloading
tasks, and the website sometimes gives me the message "Project is Down."

Is this just me or are others seeing this too?
ID: 97590 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 97592 - Posted: 25 Jun 2020, 10:22:38 UTC - in response to Message 97590.  

Is this just me or are others seeing this too?

Me too.
https://boinc.bakerlab.org/rosetta/results.php?hostid=4684790
ID: 97592 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
2fifty6

Send message
Joined: 24 Mar 20
Posts: 2
Credit: 946,426
RAC: 0
Message 97602 - Posted: 25 Jun 2020, 14:16:19 UTC - in response to Message 97581.  


I have no idea as I've never used it, but would it help to set something in Boinc's "Daily Schedules" tab under Computing Preferences? I wonder what it would do.

I don't know. It doesn't seem to forbid specific days. Just specific hours some days. In the meantime, i will try to reduce further the cache to see if it helps well enough. :)

In this era of high speed, always-on internet connections, what purpose does the cache even serve? I have mine set to zero so it just downloads WUs as others finish and it works just fine. And since those WUs are always "fresh," I never have to worry about running up against the deadline.
ID: 97602 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 97603 - Posted: 25 Jun 2020, 14:33:38 UTC - in response to Message 97602.  
Last modified: 25 Jun 2020, 14:34:27 UTC


I have no idea as I've never used it, but would it help to set something in Boinc's "Daily Schedules" tab under Computing Preferences? I wonder what it would do.

I don't know. It doesn't seem to forbid specific days. Just specific hours some days. In the meantime, i will try to reduce further the cache to see if it helps well enough. :)

In this era of high speed, always-on internet connections, what purpose does the cache even serve? I have mine set to zero so it just downloads WUs as others finish and it works just fine. And since those WUs are always "fresh," I never have to worry about running up against the deadline.


I run Milkyway, where the tasks take one minute on a graphics card. I can't download them that often, the server doesn't allow it. I only set my cache to 3+3 hours though. For example, on my CPUs I currently have 66 CPU tasks running and 12 tasks queued. It doesn't seem right to me to be constantly downloading 1 task at a time, I'm sure the server prefers I download batches of say 6. In fact on my 24 core machines, they could be trying to get tasks from Rosetta extremely often if they got one at a time. What if one task finished at 3:00pm, so it downloaded a new task, then another core finished at 3:01pm? And another at 3:02pm? I suspect I'd hit a server limit of how often I can contact.
ID: 97603 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
2fifty6

Send message
Joined: 24 Mar 20
Posts: 2
Credit: 946,426
RAC: 0
Message 97606 - Posted: 25 Jun 2020, 15:20:03 UTC - in response to Message 97603.  

I run Milkyway, where the tasks take one minute on a graphics card. I can't download them that often, the server doesn't allow it. I only set my cache to 3+3 hours though. For example, on my CPUs I currently have 66 CPU tasks running and 12 tasks queued. It doesn't seem right to me to be constantly downloading 1 task at a time, I'm sure the server prefers I download batches of say 6. In fact on my 24 core machines, they could be trying to get tasks from Rosetta extremely often if they got one at a time. What if one task finished at 3:00pm, so it downloaded a new task, then another core finished at 3:01pm? And another at 3:02pm? I suspect I'd hit a server limit of how often I can contact.
Ok, that sounds like a reasonable fringe case. But for the vast majority of people whose computers take hours to run each WU, it doesn't seem like a cache of jobs really adds much benefit from a project standpoint. It just unnecessarily delays WU completion as they sit claimed-but-idle in someone's queue.
ID: 97606 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 97608 - Posted: 25 Jun 2020, 15:40:11 UTC - in response to Message 97606.  

Ok, that sounds like a reasonable fringe case. But for the vast majority of people whose computers take hours to run each WU, it doesn't seem like a cache of jobs really adds much benefit from a project standpoint. It just unnecessarily delays WU completion as they sit claimed-but-idle in someone's queue.


Agreed, for normal computers without Milkyway on GPU, or with huge numbers of cores. A long time ago I'd use a big cache on projects which had a lot of server downtime, but I don't think many do that much now. Even if they did, if you have two projects your computer can fall back on the other one.

But a queue of 3+3 hours (I think the Boinc default is pretty similar) isn't much, since it also includes the time for the running tasks to complete. Say you have 4 cores, and they have all recently started an 8 hour Rosetta task, they all have 7 hours to completion. No tasks will be queued, because Boinc can see all 4 cores are going to be busy for 7 hours. Only when it gets below 3 hours will it download anything. At this point with my setting, it would get one task per core, and if you have 24 cores like two of my machines do, zero queue would mean constantly hammering the server for work.
ID: 97608 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
IT_POWER_SALZKOTTEN

Send message
Joined: 19 Feb 20
Posts: 1
Credit: 1,110,154
RAC: 772
Message 97615 - Posted: 25 Jun 2020, 19:08:27 UTC - in response to Message 97592.  

i don't get new work, too, Some days ago the "NumberFields@home" had a problem: Expired SSL certificates. User action was required. Maybe here the same?
ID: 97615 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 97616 - Posted: 25 Jun 2020, 19:11:18 UTC - in response to Message 97615.  

i don't get new work, too, Some days ago the "NumberFields@home" had a problem: Expired SSL certificates. User action was required. Maybe here the same?


That was a while ago. If you're on the latest Boinc version that's not why.

And you can see in Boinc's messages whether it just says "Got no new tasks" or "SSL error". At the moment they've just run out, which you can see on their server status page.
ID: 97616 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 97617 - Posted: 25 Jun 2020, 19:52:03 UTC
Last modified: 25 Jun 2020, 20:02:17 UTC

IT_POWER_SALZKOTTEN,

I see that your computer is still using BOINC 7.16.5. Upgrading to BOINC 7.16.7 is known to fix that problem for SOME of the BOINC projects running under Windows, so could you try that?

https://boinc.berkeley.edu/download_all.php

Also, Rosetta@home now has so many users that it cannot generate work fast enough to keep all the computers busy all of the time.

You might want to add another BOINC project to fill in for such times. I prefer World Community Grid for that purpose, since they are currently doing COVID-19 work. Choose their Open Pandemics subproject for that purpose, but you might want to allow work from their other subprojects as well to help with other medical research.

https://join.worldcommunitygrid.org?recruiterId=480838
ID: 97617 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,524,949
RAC: 10,364
Message 97620 - Posted: 26 Jun 2020, 0:02:03 UTC - in response to Message 97594.  

Not sure if you count it as a sensible reason, but around 6,000 people are dying in the world of CV19 every day for the last few months.

Which is reducing as we become immune.

UK isn't the only country in the world, Peter.
The wave is predominantly in the Americas now and isn't significantly reducing. Several states in the US and in Brazil are still increasing.

And immunity, if it even exists, which is completely unproven atm, would require 20x as many to be infected as have currently been.
And that's not even allowing for whatever immunity as exists, will even last long enough for others to get to the level of a herd.
In past SARS epidemics, immunity that initially existed was found to be only 15% as strong after 9 months.
That is, herd immunity will <never> happen outside of vaccination. So we continue.

If it's any consolation, I've been back at work since the beginning of last week now that the building owners have allowed us to return - it had been a public liability issue for them.
Of the other 7 business on my floor, only 1 other has returned. The other 6 were allowed to open throughout, but chose not to due to the lack of footfall, which is continuing tbh. Last week we reached just 22% of the same week last year.
ID: 97620 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cobes

Send message
Joined: 18 Mar 20
Posts: 1
Credit: 1,107,798
RAC: 0
Message 97621 - Posted: 26 Jun 2020, 0:31:19 UTC
Last modified: 26 Jun 2020, 0:32:31 UTC

Anyone else not getting work units? 3900X running 16 hours a day at idle...


EDIT: Yes, should have read the thread.
ID: 97621 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 9 May 07
Posts: 7
Credit: 5,399,250
RAC: 0
Message 97622 - Posted: 26 Jun 2020, 0:50:38 UTC - in response to Message 97621.  
Last modified: 26 Jun 2020, 1:08:28 UTC

Anyone else not getting work units? 3900X running 16 hours a day at idle...


EDIT: Yes, should have read the thread.

Err, do fill us in as I don't see the answer recently ;) [edit] Nm, apparently just been bled dry of WUs.
Although, why now? They hadn't run out for some weeks AFAIK.
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN

Main - Ryzen 5 3600, TR Ultima90, MSI B450, 32GB DDR4 3200, RX580 8GB, Seasonic Prime PX-550
2nd - i7 4930k @4.1GHz, TR Ultra120 E, 16GB DDR3 1866, HD7870 XT 3GB(DS
ID: 97622 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 97623 - Posted: 26 Jun 2020, 1:52:04 UTC

[TA]Assimilator1,

So you haven't read far enough back in this thread?

The number of computers trying to download Rosetta@home tasks is now at least 10 times as much as it was a few months ago.

The project team often doesn't have enough ideas for new work to generate 10 times as many tasks as a few months ago.
ID: 97623 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 107
Credit: 865,910
RAC: 814
Message 97625 - Posted: 26 Jun 2020, 6:30:34 UTC - in response to Message 97568.  

Peter Hucker wrote, "I don't understand what you're getting at. You receive a task that takes 8 hours to complete, and you have to send it back in 3 days (9 times longer than it takes to do it). How can that possibly cause you not to get them done in time?

In my case, they could not be done in time because Rosetta sent me 26 tasks due in 3 days. My computer takes a little over 7 hours to complete one task. Completing them all would take 182 hours of computer time. There are only 72 hours in three days. The computer might be able to complete ten tasks in that time. The remaining 16 tasks will not be completed and will be reported as errors. (These do not include the 15 tasks that were reported as "errors while downloading," which seems to happen a lot with Rosetta, although it rarely happens with my other projects.)

Now do you understand?

Steven Gaber
ID: 97625 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 56 · 57 · 58 · 59 · 60 · 61 · 62 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org