Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 87 · 88 · 89 · 90 · 91 · 92 · 93 . . . 309 · Next

AuthorMessage
Profile Garry Heather

Send message
Joined: 23 Nov 20
Posts: 10
Credit: 362,743
RAC: 0
Message 100984 - Posted: 1 Apr 2021, 15:23:51 UTC - in response to Message 100983.  
Last modified: 1 Apr 2021, 15:40:48 UTC

This is true, but lets have some context here. I just wanted my single Pi to be kept busy because the cost in leaving it on 24/7 is not insignificant to me. There are some people here with multiple monsters processing work units. My solitary Pi was never going to make a dent on their requirements so please do not think badly of me for trying to cache enough to to keep it busy for a couple of days.

I will complete the units currently being processed but suspect that this project is not for me. I have aborted my cached units back into the pool.
ID: 100984 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 100985 - Posted: 1 Apr 2021, 15:56:58 UTC - in response to Message 100984.  
Last modified: 1 Apr 2021, 16:22:05 UTC

Nobody was asking or expecting you to abort the jobs – but what’s done is done, and cannot be undone. It makes no difference to the project who runs them, so please don’t be dissuaded from participating. The ones that weren’t resends are already out to other hosts. My machines are out of Rosetta work primarily because of the way I chose to set them up, and I’m too lazy to go round and change them all just to work around a bug in the work unit configuration. It’s arguably better that machines capable of running the ‘big’ tasks don’t pick up the ‘small’ ones, so that less-powerful machines do have a chance to run something.
ID: 100985 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 100989 - Posted: 1 Apr 2021, 17:58:17 UTC - in response to Message 100954.  


Don't you just hate folk who put @ in a sentence?

Like you just did? :^P
Are you one of those pricks who said "made you look" in the school playground as a kid? If so, how's the broken nose?
ID: 100989 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 100990 - Posted: 1 Apr 2021, 17:59:25 UTC - in response to Message 100957.  

Duplicate post deleted.
You'd think there'd be a delete button. Who designs these things?
ID: 100990 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,338,560
RAC: 2,456
Message 100993 - Posted: 1 Apr 2021, 18:45:40 UTC - in response to Message 100990.  

Duplicate post deleted.
You'd think there'd be a delete button. Who designs these things?

There's a workaround. If you use the same way to mark it as a duplicate every time, the software will see it as multiple identical posts, and delete all but one of them.
ID: 100993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 100994 - Posted: 1 Apr 2021, 22:54:49 UTC - in response to Message 100977.  

should I set up any firewall rules?
Assuming it’s the same as on Windows:

The only thing that requires Internet access is the client, and it only makes HTTP(S) connections to the project servers. So you need to open tcp/80 and/or tcp/443 outbound (plus udp/53 or whatever else your DNS needs if that’s not handled by a separate resolver); everything else can be blocked.


Those ports seem to be open by default so I guess that I'm okay. Thanks for your reply.
ID: 100994 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 100995 - Posted: 1 Apr 2021, 23:17:02 UTC - in response to Message 100979.  

I have seen several periods of downtime where work units have not been deployed for days at a time.

For an individual host's circumstances it's fine if you have a specific reason

This kind of reminds me of the hoarding that takes place here (even prior to the pandemic). There's a supply problem, which leads to hoarding, which makes it worse.

Kind of remarkable that we have too much unused CPU time to go around.
ID: 100995 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 100996 - Posted: 1 Apr 2021, 23:25:33 UTC - in response to Message 100989.  


Are you one of those pricks who said "made you look" in the school playground as a kid? If so, how's the broken nose?


Woah, dude, where did that come from? Over the use of an "at" symbol?

If you get spun up that hard, that fast over what I write, maybe the better solution is to stop reading my posts, okay?
ID: 100996 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 100997 - Posted: 1 Apr 2021, 23:36:14 UTC - in response to Message 100984.  


I will complete the units currently being processed but suspect that this project is not for me.


Don't take it personally. There's a three roll limit on toilet paper here because of some hoarders (not you). That's the rule. But best practice for the community at large is for folks to take less, if they can. If everybody does it, then there is more likely to be a ready supply available, including for you. It's something worth repeating, just so everyone is aware of it.
ID: 100997 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,378,164
RAC: 20,578
Message 100999 - Posted: 2 Apr 2021, 1:05:24 UTC - in response to Message 100923.  

Say hello to two less hosts after they finish their current tasks, @Rosetta. I don't know if I have the time that's required to provide the space that is needed.
You’re not alone. Look at the recent results graphs – ‘tasks in progress’ has dropped by around 200,000 (a third)…
In the past it has taken several days for In progress numbers to get back to their pre-work shortage numbers. And that's with out running out of work again only a few hours after new work started coming through (which occurred this time).
If we don't run out of work again over the next few days, we should see how things actually are by early next week.
A few days in and the impact of the mis-configured Work Units is becoming clearer. Looks like the amount of work being done has dropped by almost a third, and isn't showing any signs of recovering.
For all of the latest & greatest systems there are, there are an awful lot more older much more resource limited systems.


Grant
Darwin NT
ID: 100999 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 101000 - Posted: 2 Apr 2021, 1:25:33 UTC - in response to Message 100999.  
Last modified: 2 Apr 2021, 1:26:59 UTC



Looks like the profile of a dead body lying in a shallow grave. How metaphorical.
ID: 101000 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,214,047
RAC: 1,768
Message 101005 - Posted: 2 Apr 2021, 12:01:39 UTC - in response to Message 100999.  

Say hello to two less hosts after they finish their current tasks, @Rosetta. I don't know if I have the time that's required to provide the space that is needed.
You’re not alone. Look at the recent results graphs – ‘tasks in progress’ has dropped by around 200,000 (a third)…
In the past it has taken several days for In progress numbers to get back to their pre-work shortage numbers. And that's with out running out of work again only a few hours after new work started coming through (which occurred this time).
If we don't run out of work again over the next few days, we should see how things actually are by early next week.
A few days in and the impact of the mis-configured Work Units is becoming clearer. Looks like the amount of work being done has dropped by almost a third, and isn't showing any signs of recovering.
For all of the latest & greatest systems there are, there are an awful lot more older much more resource limited systems.



Just means more work for the rest of us!!
ID: 101005 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jsm

Send message
Joined: 4 Apr 20
Posts: 3
Credit: 77,825,233
RAC: 32,838
Message 101006 - Posted: 2 Apr 2021, 14:28:03 UTC

Bandwidth usage massively increased in March
I migrated to Rosetta from Seti almost exactly one year ago. For eleven months there was little impact on my capped 50gb bandwidth allowance but in March the usage has more than doubled. I am using the same 6 computers and the same preferences so nothing on my side has changed. When my ISP notified me of the sudden cap half way through March I installed wireshark after a difficult setup to capture packets at the router rather than at specific computers. Imagine my horror when I found that the culprit was rosetta using over 1gb per 6 hours. This is unsustainable and I will either have to shell out for an expensive unlimited contract (because I have an Ultima connection at over 100mbps) or cut back on Rosetta work.
Has there been a significant project change which could be the cause of this increased usage or am I looking for another problem?
Any suggestions most welcome. I have clawed my way to league position 599 and would like to break 500 if possible.
Capt
ID: 101006 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 101007 - Posted: 2 Apr 2021, 17:48:52 UTC - in response to Message 100993.  

Duplicate post deleted.
You'd think there'd be a delete button. Who designs these things?

There's a workaround. If you use the same way to mark it as a duplicate every time, the software will see it as multiple identical posts, and delete all but one of them.
Shouldn't it have already done that when the 2nd genuine one was posted?
ID: 101007 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 101008 - Posted: 2 Apr 2021, 17:50:21 UTC - in response to Message 100995.  

I have seen several periods of downtime where work units have not been deployed for days at a time.

For an individual host's circumstances it's fine if you have a specific reason

This kind of reminds me of the hoarding that takes place here (even prior to the pandemic). There's a supply problem, which leads to hoarding, which makes it worse.

Kind of remarkable that we have too much unused CPU time to go around.
Same happens in real life with toilet paper because of the plandemic. Some people are selfish idiots.
ID: 101008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 101009 - Posted: 2 Apr 2021, 17:52:03 UTC - in response to Message 100996.  


Are you one of those pricks who said "made you look" in the school playground as a kid? If so, how's the broken nose?


Woah, dude, where did that come from? Over the use of an "at" symbol?

If you get spun up that hard, that fast over what I write, maybe the better solution is to stop reading my posts, okay?
No, because you did an "I know you are" variant saying I'd used @ when telling you not to use @.

Do people seriously say "dude"?

Anyway "prick" is a compliment, it means you have a big appendage.
ID: 101009 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 101010 - Posted: 2 Apr 2021, 17:53:29 UTC - in response to Message 100997.  


I will complete the units currently being processed but suspect that this project is not for me.


Don't take it personally. There's a three roll limit on toilet paper here because of some hoarders (not you). That's the rule. But best practice for the community at large is for folks to take less, if they can. If everybody does it, then there is more likely to be a ready supply available, including for you. It's something worth repeating, just so everyone is aware of it.
That limit doesn't work, you just buy from more shops at once. Not that I do that with toilet paper, but I do a similar thing to buy more paracetamol (painkiller) than you're "allowed" by the government.
ID: 101010 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 101011 - Posted: 2 Apr 2021, 17:55:07 UTC - in response to Message 100999.  

Say hello to two less hosts after they finish their current tasks, @Rosetta. I don't know if I have the time that's required to provide the space that is needed.
You’re not alone. Look at the recent results graphs – ‘tasks in progress’ has dropped by around 200,000 (a third)…
In the past it has taken several days for In progress numbers to get back to their pre-work shortage numbers. And that's with out running out of work again only a few hours after new work started coming through (which occurred this time).
If we don't run out of work again over the next few days, we should see how things actually are by early next week.
A few days in and the impact of the mis-configured Work Units is becoming clearer. Looks like the amount of work being done has dropped by almost a third, and isn't showing any signs of recovering.
For all of the latest & greatest systems there are, there are an awful lot more older much more resource limited systems.

That means nothing. For example I might (manually or Boinc did it) download a load of work from another project when this one runs out. Now that has to be completed before it will get work from Rosetta again.
ID: 101011 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 398
Credit: 12,294,748
RAC: 7,588
Message 101012 - Posted: 2 Apr 2021, 18:15:20 UTC

This problem with tasks erroring out with computation error is now getting serious.

Up until now my attitude has been “it’s only a few seconds a task, no sweat” but because I’m running a very small cache with multiple projects it runs Rosetta on a one out, one in basis so it gets one, errors it and then has to wait ages before it uploads the result and asks for another. Now it’s gone to the next level because my last n tasks have all errored it is extending the back off period to many hours before it will allow another request and I’m almost to the point where Rosetta is no longer running on my main machine.

Does the panel know (a) how long these errors will continue (b) how many good tasks I need to return to get back into Rosetta’s good books?
ID: 101012 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 12,028
Message 101013 - Posted: 2 Apr 2021, 18:23:38 UTC - in response to Message 101012.  

This problem with tasks erroring out with computation error is now getting serious.

Up until now my attitude has been “it’s only a few seconds a task, no sweat” but because I’m running a very small cache with multiple projects it runs Rosetta on a one out, one in basis so it gets one, errors it and then has to wait ages before it uploads the result and asks for another. Now it’s gone to the next level because my last n tasks have all errored it is extending the back off period to many hours before it will allow another request and I’m almost to the point where Rosetta is no longer running on my main machine.

Does the panel know (a) how long these errors will continue (b) how many good tasks I need to return to get back into Rosetta’s good books?

I do the same as you with a small buffer, but all that happens is Boinc builds up a Rosetta debt, and you'll end up doing more of them when they fix it.
ID: 101013 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 87 · 88 · 89 · 90 · 91 · 92 · 93 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org