Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 60 · 61 · 62 · 63 · 64 · 65 · 66 . . . 311 · Next

AuthorMessage
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 97741 - Posted: 28 Jun 2020, 0:02:24 UTC - in response to Message 97737.  

I'd observe what Task Manager says about how much memory is is use, and how close it is to the maximum amount BOINC is allowed to use, before changing the memory settings. It's been at least months since I found a good reason to change the BOINC memory settings on my computer.


I generally set Boinc to 80%. And that's just a request, Boinc has been known to use more or less than that, a work unit doesn't always know how much memory it will need, and Boinc doesn't seem to be able to monitor it very well.
ID: 97741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 97742 - Posted: 28 Jun 2020, 0:03:46 UTC - in response to Message 97738.  
Last modified: 28 Jun 2020, 0:05:29 UTC

How do you want that plan adjusted to handle times when one of the two projects had no work to download for the last few days, or very little work like it's happened on Rosetta@home recently? Many people do not agree that the adjustment should take effect immediately, without even waiting for several tasks to be reported and receive credits.

Many people just don't have enough spare time to frequently adjust their project setting, like you appear to want to do.


It should be exactly as it currently is, but just reset the debts when the weights are changed by the user. This would have no effect on anyone not changing the weights, but would allow a user to suddenly decide they want more time spent on a project they consider important, without having to wait a few days for it to settle. When I changed from Rosetta 1 and two other projects on 0, to 1/5/25 ratio, I did not get lots of Rosetta running, so it wasn't doing the project I considered important.
ID: 97742 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 97743 - Posted: 28 Jun 2020, 0:08:06 UTC - in response to Message 97739.  
Last modified: 28 Jun 2020, 0:08:29 UTC

EricM wrote:
I wonder if I should alter the memory settings?
In my recent observation, tasks are allocating anything from 300 to 3000, and averaging around 700, MB of memory each. At a limit of 75% of 8 GB for 4 cores, most tasks will be fine. But if nothing else is using the computer while BOINC is active, you could increase that to, say, 93%. (I don’t recommend 100% because the operating system will always need some memory for itself.)


I've had an 8GB computer completely lock up with a setting of 90%. Boinc using 90% (I think it had actually gone above that) and Windows using at least 10%, there were running tasks having to page constantly to the (mechanical) disk. I couldn't even get the task manager to respond it was so bogged down. I had to use the power button. I now always use 80% and that seems fine.
ID: 97743 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 826
Message 97744 - Posted: 28 Jun 2020, 0:08:37 UTC - in response to Message 97723.  

I've found that this will handle CPU time problems, However, the paused BOINC applications usually do NOT release any memory they have allocated, unless you shut down BOINC. Therefore, your plan does not help if the important contention is for memory instead of for CPU time, unless you use the setting for releasing memory from BOINC applications that are not running.


Surely the paused Boinc programs would be shoved in the swap file by Windows?

You might hope so, but that's not what actually happens. Many programs load memory in sections, keeping pointers to the start of those sections. If the sections are not reloaded to the same memory address where they were before, the program is likely to crash as soon as it tries to use something in an out of place section.

Instead, BOINC normally uses a method where the programs reach a point suitable for a checkpoint, write all of the memory in use but not marked as unchangeable to a checkpoint file, then go on. If there is an interruption that requires using a checkpoint, then the unchangeable part of the program will be reloaded as if the program was about to start over, then the changeable information is reloaded from the checkpoint file with suitable adjustments to pointers to where the sections are in memory, and then the program restarts from just after the point where it previously wrote the checkpoint.

Most, but not all, BOINC programs try to keep the last two checkpoint files written, so that if there was a problem writing the most recent one, the next older one can be used instead. A few BOINC projects using very big checkpoint files erase the previous one just before they write a new one. I suspect that those are projects that do not understand the structures of their programs well enough that they can separate the changeable sections from the unchangeable sections.
ID: 97744 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1734
Credit: 18,532,940
RAC: 17,945
Message 97745 - Posted: 28 Jun 2020, 0:10:31 UTC - in response to Message 97739.  
Last modified: 28 Jun 2020, 0:23:38 UTC

EricM wrote:
I wonder if I should alter the memory settings?
In my recent observation, tasks are allocating anything from 300 to 3000, and averaging around 700, MB of memory each. At a limit of 75% of 8 GB for 4 cores, most tasks will be fine. But if nothing else is using the computer while BOINC is active, you could increase that to, say, 93%. (I don’t recommend 100% because the operating system will always need some memory for itself.)
I've got my BOINC settings set for 95%
But i don't keep Tasks in memory when not active. If BOINC needs the RAM, it can use it. If other programmes need to run, then they do & the BOINC projects release the memory they were using.




I basically allow 1.3GB RAM per Task. Some use a lot more, most use a lot less, so overall as long as you allow 1.3GB per Task, set the number of cores/threads you use to match the available RAM in the system and you won't run in to issues.

eg 4 core/thread CPU 6GB min RAM.
12 core/thread CPU 16GB min RAM.
64 core/thread CPU 84GB min RAM.

The number of Tasks requiring more than 1GB are a lot less than there used to be, but they are still around & things could change back to larger RAM requirements in the future.
Given the low percentage of large RAM requirement Tasks overall, with really high core/thread count systems 1GB RAM per Task would be good enough not to run in to issues.
Grant
Darwin NT
ID: 97745 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 826
Message 97746 - Posted: 28 Jun 2020, 0:17:03 UTC - in response to Message 97735.  

I would still recommend reducing Store up to an additional to zero, though in my experience that setting doesn’t seem to influence the amount of work downloaded in any event.
It does, just in very odd ways- It is best to have it as low as possible.

Having a large value for Additional days will result the cache running down significantly below the Store at least value, before it then gets more work to build it back up to the Store at least + Additional days value (and the larger each of those values, the greater the run down before it rebuilds the cache again). If people really feel a need for a cache, set it in Store at least value. The Additional days value should be as small as possible.
Weird things are much less likely to occur that way.


It seems a very simple setting to me, and it should obviously do the following (although Boinc is never sensible):

I set 6 hours + 12 hours. The queue falls to 6 hours, then it downloads an extra 12. That's a minimum of 6 and a maximum of 6+12. It can't mean anything else.

Yes if your mean 6 hours + 12 hours of run time. No if you mean 6 hours + 12 hours of clock time.
ID: 97746 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
EHM-1
Avatar

Send message
Joined: 21 Mar 20
Posts: 23
Credit: 183,782
RAC: 0
Message 97747 - Posted: 28 Jun 2020, 0:17:30 UTC - in response to Message 97737.  
Last modified: 28 Jun 2020, 0:24:25 UTC

Well, you are all very helpful! I'm happy to say that some tweaks I tried an hour ago, before I read the past several posts, are somewhat in line with your suggestions. So I'm tentatively upgrading my self-assessment from dolt to simpleton. @Brian, your first sentence there, so simply put, immensely increased my understanding of how these work units are handled. I think that concept should be presented to every BOINC user on day one. Years ago, at the inception of BOINC, if I'd be told half of what you all have imparted to me today, I would have been a much better contributor all these years. Until now, seeking guidance in the forums has been frustrating and mostly unfruitful. Thankee!

PS- As I've been writing, BOINC is processing three work units simultaneously, which I assume means it's using three cores per my new CPU setting. I still have the switch-between interval set at 1,000 minutes, so I hope that does not cause my OpenPandemic tasks to expire. We'll see.

system: up-to-date Windows 10, Intel quad-core 3.6 GHz processor, 8 GB RAM
ID: 97747 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1734
Credit: 18,532,940
RAC: 17,945
Message 97748 - Posted: 28 Jun 2020, 0:22:29 UTC - in response to Message 97745.  
Last modified: 28 Jun 2020, 0:23:13 UTC

Double post.
Grant
Darwin NT
ID: 97748 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 826
Message 97749 - Posted: 28 Jun 2020, 0:25:29 UTC - in response to Message 97736.  

Peter Hucker wrote:
I guess my tariff is lower than yours, because it's about half that. I do remember saving 30% by changing supplier a long time ago.

Mind you, I'm on one of those direct debit things that, like Boinc, never manages to settle down. So I could be racking up a massive debt....

What I'd love is Economy 7. My friend has it (he's still got storage radiators!) and only pays 5p a unit at night!
Good point – I’m in a rented flat and still on the supplier and tariff I inherited when I moved in. It’s unlikely that’s the best deal I could get…

In another place I neglected the direct debit discrepancy for so long it ended up with the supplier owing me £2,500!

Is the average cost still cheaper on Economy 7, despite the higher daytime rate? You can still change, can’t you? Though I suppose you’d have to pay £££ for a new meter…

But you’re really pulling 400 W per machine? I know the X5600s were not reknowned for their power efficiency, but they’re only 95 W TDP.

Looks like you're counting only the power used by the CPU chip, but not the power used by the rest of the computer. The computer's power supply must be able to handle the surges of power used to start up various sections of the computer, plus the power it uses itself, with enough of a margin that it does not run into the inefficiency of being too close to its limits.
ID: 97749 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 108
Credit: 866,895
RAC: 389
Message 97750 - Posted: 28 Jun 2020, 0:27:10 UTC
Last modified: 28 Jun 2020, 0:29:32 UTC

Having problems with Rosetta and World Community Grid.

I signed on to WCG and the computer started crunching.
However, it removed BONIC from my computer. In the WCG client, Asteroids and WCG are shown, as are SETI@Home and POGS, which are no longer active.

But Rosetta is MISSING!!

WCG won't let me on its forum page.

I had no active Rosetta tasks, but it should still show on my computer, along with the credits it has amassed.

Moreover, when I tried to log into BOINC, it says there is no such account for my email address. I've had a BOINC account for over 20 years.

What gives? I don't like how WGC has taken over my computer. Is that a common thing?

Steven Gaber
Oldsmar, FL
ID: 97750 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 97751 - Posted: 28 Jun 2020, 0:29:50 UTC - in response to Message 97740.  

They should have given you that earlier, I thought they had to by law
This was eight years ago; I suspect the rules were different then. I do notice they adjust the payment (down as well as up) more readily these days.

It's not the CPUs that eat the juice, it's the GPUs
Ah, of course. With my Rosetta blinkers on I hadn’t thought about non-CPU work…
ID: 97751 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,217,610
RAC: 822
Message 97752 - Posted: 28 Jun 2020, 0:30:17 UTC - in response to Message 97731.  

Easy answer to your hypothesis...the Boinc Developers made that formula and they are the only ones that can change it. Join there email group and ask them why it's the way it is, I'm sure Dr A will give you a response at some point.


Yeah ok.... they're not the listening type.


Agreed...in short it is what it is and will be for as long as the people in charge stay there, it's better to work with the settings then try to figure out why they work the way they do.
ID: 97752 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 826
Message 97753 - Posted: 28 Jun 2020, 0:55:41 UTC - in response to Message 97750.  
Last modified: 28 Jun 2020, 1:13:42 UTC

Bad post deleted.
ID: 97753 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 826
Message 97754 - Posted: 28 Jun 2020, 1:11:44 UTC - in response to Message 97753.  

Having problems with Rosetta and World Community Grid.

I signed on to WCG and the computer started crunching.
However, it removed BONIC from my computer. In the WCG client, Asteroids and WCG are shown, as are SETI@Home and POGS, which are no longer active.

But Rosetta is MISSING!!

WCG won't let me on its forum page.

I had no active Rosetta tasks, but it should still show on my computer, along with the credits it has amassed.

Moreover, when I tried to log into BOINC, it says there is no such account for my email address. I've had a BOINC account for over 20 years.

What gives? I don't like how WGC has taken over my computer. Is that a common thing?

Steven Gaber
Oldsmar, FL

Not very common.

First, check if you have BOINC installed twice, not just once. Two copies will not share a computer well.

You might have selected Rosetta@home under the Projects list, then accidentally clicked on Remove.

Also Rosetta@home may have just completed its switch from http to https by deleting any remaining Rosetta@home accounts for Windows computers still using the http connection.

For either of these, click on Tools under Advanced view, then Add project. Scroll down to Rosetta@home, then click on it.

If the line in the Project URL box does not start with https:, insert the s. Now click on Next, and set up the account like before. This should recover the saved credits. If your computer still has the same name as before, it should also recover how much that computer has done.

It's unclear what you mean by "log into BOINC". Give us more details if you need help with that.
ID: 97754 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 97755 - Posted: 28 Jun 2020, 1:23:08 UTC - in response to Message 97747.  

Eric,

You’re welcome. It is not obvious how BOINC works – and from what I’ve gathered Rosetta is unusual by having tasks that run for a fixed amount of (CPU) time within which they achieve a variable amount of work (depending on the performance of the machine), rather than a fixed amount of work which takes a variable amount of time to complete. It would indeed be nice if this were made clearer to new starters on the project Web pages.

Like you, I did not know any of this when I started. Where did I learn it? Right here in these forums. Yes, an amount of cruft has built up over the years – old, long threads with information that may or may not still be relevant. A lot of what’s written is unclear at best. And input from insiders who can tell us how it actually works is infrequent. But amongst it all are some useful nuggets – and passing those on is the least I can do.

Happy crunching!
ID: 97755 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 97757 - Posted: 28 Jun 2020, 1:41:48 UTC - in response to Message 97749.  

robertmiles wrote:
Looks like you're counting only the power used by the CPU chip
I assumed Peter was talking about measured consumption, not a number printed on the label of his power supply. Of course there’s overhead to consider on top of the CPU – but his numbers struck me as excessive. (By comparison, my machine with a [single] same-TDP CPU draws 120 W from the wall; a second CPU would need its own RAM and extra cooling, but not 185 W worth.) As Peter pointed out, it was his GPUs I hadn’t considered.
ID: 97757 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 97760 - Posted: 28 Jun 2020, 3:38:52 UTC - in response to Message 97676.  

Hello Sid and all-
Not sure if this is related to what Steven is encountering:
On my computer Rosetta will run for days as expected, then do nothing for a day or two, then resume normal behavior. I've never seen any explanation for this. I'm currently in one of these down times. Yesterday I added a second project to my BOINC acct, and it is running as expected. As far as I know, Rosetta is still paused in mid-task. Anyone have an idea what might be causing this?

In short, no. Sounds very odd.
I run an i3-8350 remotely so I'm familiar with your PC, though mine has 16Gb RAM compared to your 8Gb, but that ought not to make any difference with just 4-cores.

The only thing I can ask is, on the Computing tab under Computing preferences in Boinc Manager, do you have anything in the section titled "When to suspend" ?
Everything in that section should be unselected.
In the section above, titled 'Usage Limits', both should be 100%
ID: 97760 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 97761 - Posted: 28 Jun 2020, 5:50:28 UTC - in response to Message 97760.  

I run an i3-8350 remotely so I'm familiar with your PC, though mine has 16Gb RAM compared to your 8Gb, but that ought not to make any difference with just 4-cores.

I think it does. He is only allowing 50% of the memory to be used by BOINC, so only 4 GB, or 1 GB/core.
Rosetta often uses more than 1 GB per work unit.

He is probably "waiting for memory" to become available.
ID: 97761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 108
Credit: 866,895
RAC: 389
Message 97763 - Posted: 28 Jun 2020, 6:27:17 UTC - in response to Message 97754.  

Not very common.

First, check if you have BOINC installed twice, not just once. Two copies will not share a computer well.

You might have selected Rosetta@home under the Projects list, then accidentally clicked on Remove.

Also Rosetta@home may have just completed its switch from http to https by deleting any remaining Rosetta@home accounts for Windows computers still using the http connection.

For either of these, click on Tools under Advanced view, then Add project. Scroll down to Rosetta@home, then click on it.

If the line in the Project URL box does not start with https:, insert the s. Now click on Next, and set up the account like before. This should recover the saved credits. If your computer still has the same name as before, it should also recover how much that computer has done.

It's unclear what you mean by "log into BOINC". Give us more details if you need help with that.[/quote]
ID: 97763 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 108
Credit: 866,895
RAC: 389
Message 97764 - Posted: 28 Jun 2020, 6:41:00 UTC - in response to Message 97763.  
Last modified: 28 Jun 2020, 6:44:20 UTC

.It's unclear what you mean by "log into BOINC". Give us more details if you need help with that.


It seems that I can have either the BOINC Manager or the World Community Grid, but not both.

When I said I was trying to log into BOINC, I meant that I could not find the BOINC Manager. WCG had deleted it, along with Roosetta.

BOINC Help said there was no such account with my email address. I had to download BOINC again to get to the BOINC Manager. Rosetta shows up there, but not in my WCG account.

WCG does not show up in my FreeDC page. My computer already completed four WCG projects, but they don't show up anywhere. WCG help won't llet in, says to delete cookies, which I did, but still couldn't get in.

I may just opt out of the WCG. Too many problems. I'll just wait for Rosetta to perk up again, or maybe add a different project that is not so intrusive.
ID: 97764 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 60 · 61 · 62 · 63 · 64 · 65 · 66 . . . 311 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org