Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 273 · 274 · 275 · 276 · 277 · 278 · 279 . . . 303 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 109102 - Posted: 10 Apr 2024, 11:19:41 UTC - in response to Message 109101.  

Just need the Invalids re-validated & all will be good.

No optimism here

Nor here, if not done by now
ID: 109102 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 109103 - Posted: 12 Apr 2024, 2:25:21 UTC - in response to Message 109100.  

And we're back...

Looks like the whole website went down for about 10hours today.
Couldn't even get to the Rosetta home page let alone upload results.
Everything going through fine now

And again, we're back.
Another 2-3hr outage of the entire website.

I did send another email - and mentioned how Validation didn't come back last time, so to double-check that.
And snuck in a request for revalidation of tasks from April 4-6 that all errored out, just in case they're in a good mood (I didn't mention it last time)

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently
ID: 109103 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1995
Credit: 9,653,827
RAC: 7,305
Message 109104 - Posted: 12 Apr 2024, 8:03:42 UTC - in response to Message 109103.  

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently


yes. Also Ralph, that is on other hw, went down for some hrs

And no work....
ID: 109104 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rilian
Avatar

Send message
Joined: 16 Jun 07
Posts: 23
Credit: 2,878,958
RAC: 5,518
Message 109105 - Posted: 12 Apr 2024, 16:03:45 UTC

I see there are about 7000 tasks in progress, and one of my computers got one resend
i crunch for Ukraine. Join our team forums about Rosetta@home
ID: 109105 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 192
Credit: 6,475,014
RAC: 6,131
Message 109106 - Posted: 12 Apr 2024, 20:17:08 UTC - in response to Message 109103.  
Last modified: 12 Apr 2024, 20:17:44 UTC

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently


Yes, but it is up right now and I just got a bunch of tasks -- Rosetta Beta 6.05.
ID: 109106 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 262
Credit: 499,663
RAC: 802
Message 109107 - Posted: 12 Apr 2024, 20:45:48 UTC

Graphics do not work with linux version of rosetta beta.

When i start graphical app it immediately closes and outputs this in stderrgfx.txt:

cat /var/lib/boinc/slots/6/stderrgfx.txt

ERROR: Unable to open file: /var/lib/boinc/projects/boinc.bakerlab.org_rosetta/../database/chemical/residue_type_sets/fa_standard/residue_types.txt

ERROR:: Exit from: src/core/chemical/GlobalResidueTypeSet.cc line: 145
23:25:39 (68987): called boinc_finish(0)


It should look for database at /var/lib/boinc/projects/boinc.bakerlab.org_rosetta/database_0f7f01a1b07/database , not /var/lib/boinc/projects/database
ID: 109107 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 109108 - Posted: 13 Apr 2024, 0:08:39 UTC - in response to Message 109106.  

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently

Yes, but it is up right now and I just got a bunch of tasks -- Rosetta Beta 6.05.

Looks like another million tasks got released.
I'm not going to say the site is running well recently, in a variety of ways, but what seems like a regular million tasks each week with a few blank days in between is the best we've had for a very long time.
If they can keep this going I won't be too unhappy, however much better it's been in the distant past.
ID: 109108 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 23
Message 109116 - Posted: 16 Apr 2024, 6:01:20 UTC

I've set no new tasks again. The current jobs have 8 hours as their runtime, but here, they are running for three times that, (4GHz i7), which is pushing my system into panic mode.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 109116 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1689
Credit: 18,016,030
RAC: 23,735
Message 109117 - Posted: 16 Apr 2024, 7:09:30 UTC - in response to Message 109116.  
Last modified: 16 Apr 2024, 7:18:04 UTC

I've set no new tasks again. The current jobs have 8 hours as their runtime, but here, they are running for three times that, (4GHz i7), which is pushing my system into panic mode.
And the same issue is happening with your other projects.
Asteroids- 2hrs Runtime,1hr CPU time.
SIdock- 31.5hrs Runtime, 27hrs 40min CPU time.
Denis- 3hr 40min Runtime, 1hr CPU time.
Got to love Denis, almost 4 times as much time spent to do a given amount of work. Even worse than your Seti times.


And you have been told repeatedly how to resolve the issue, yet you continue to ignore that advice.
So why bother even posting about it?
Grant
Darwin NT
ID: 109117 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 23
Message 109118 - Posted: 16 Apr 2024, 13:26:49 UTC - in response to Message 109117.  

I simply stated facts.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 109118 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 262
Credit: 499,663
RAC: 802
Message 109119 - Posted: 16 Apr 2024, 13:47:41 UTC

We should restrict registering to invites-only to reduce spam posts.
Projects that have implemented registration protection:
https://gene.disi.unitn.it/test/
https://srbase.my-firewall.org/sr5/
https://lhcathomedev.cern.ch/lhcathome-dev/
ID: 109119 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,284,221
RAC: 1,121
Message 109120 - Posted: 16 Apr 2024, 19:50:28 UTC - in response to Message 109119.  

We should restrict registering to invites-only to reduce spam posts.
Projects that have implemented registration protection:
https://gene.disi.unitn.it/test/
https://srbase.my-firewall.org/sr5/
https://lhcathomedev.cern.ch/lhcathome-dev/


Restricting posts in most of the forums to users who have finished running at least one hour of work should also give such results.

ONE forum to asking and answering questions about how to complete that one hour should be enough for the users is needed, though. Those who use that one forum to post spam can then be restricted from posting anything at all.
ID: 109120 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1689
Credit: 18,016,030
RAC: 23,735
Message 109123 - Posted: 17 Apr 2024, 5:31:23 UTC - in response to Message 109118.  

I simply stated facts.
This forum is for Issues with Rosetta@home.
If you want to post just for the sake of posting, then make use of the Cafe Rosetta forum- that's what it is there for.
Grant
Darwin NT
ID: 109123 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rilian
Avatar

Send message
Joined: 16 Jun 07
Posts: 23
Credit: 2,878,958
RAC: 5,518
Message 109125 - Posted: 17 Apr 2024, 14:16:22 UTC - in response to Message 109119.  

We should restrict registering to invites-only to reduce spam posts.
Projects that have implemented registration protection:
https://gene.disi.unitn.it/test/
https://srbase.my-firewall.org/sr5/
https://lhcathomedev.cern.ch/lhcathome-dev/


I think invite-only will deflect more prospective crunchers from project comparing to spam posts.. Maybe adding couple more moderators will help

It would be better to restrict posting based on RAC.. But allow some forum for tech questions if possible
i crunch for Ukraine. Join our team forums about Rosetta@home
ID: 109125 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109127 - Posted: 17 Apr 2024, 18:50:30 UTC - in response to Message 109116.  

I've set no new tasks again. The current jobs have 8 hours as their runtime, but here, they are running for three times that, (4GHz i7), which is pushing my system into panic mode.



Check your system resources, what all is using your memory and CPU?
Your processor is also very old.
Launched August 5, 2015
Discontinued March 4, 2019 (desktop processors)


Maybe you should put a restriction on how many active tasks you have going at one time.

But look, everyone offers your suggestions and you say everything is working fine, I don't want to mess with anything. Then you come here and complain about this or that. So why do we even bother answering you anymore? You don't listen. If Grant says something needs to change, then listen to him. If you don't then your system will just continue to be "slow" or whatever the problem is now.


I think your processor is to slow for these high level tasks.
I have a AMD Ryzen 7 and I have no trouble chewing up these tasks.

You have lost 5 tasks to Time Out because you can not complete all the tasks you download in the 3 day period the project has given you to process the tasks. So again, the idea of restricting the amount of Rosetta work or other project work seems to make sense. Whatever it is, other projects, processor speed, something is slowing you down. You have a 2 day turn around time. You have 3 days to download, process and report back a task.

Grant might have some suggestions or thoughts, but on the other hand maybe he is is tired of suggesting things and being ignored.

Good luck figuring it out, I'm out of ideas.
ID: 109127 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 109131 - Posted: 19 Apr 2024, 11:03:21 UTC - in response to Message 109116.  

I've set no new tasks again. The current jobs have 8 hours as their runtime, but here, they are running for three times that, (4GHz i7), which is pushing my system into panic mode.

This is not true. If it was true things would be slightly better, but still not ideal.

All your completed Rosetta tasks are running approximately 43,200 seconds, which is 12hrs target runtime, not the 8 hours Rosetta default.
This isn't a massive problem in itself. I switched to 12hrs too so I get a bit more from the rare amount of tasks we get nowadays, but it's simple to change that if you want (and if it's what you thought was already happening).

But the much bigger problem is your PC is taking between 72.000 and 116,000 secs, which is between 20 & 32hrs to run just 12hrs of Boinc work.

Tasks are taking that long because of 'not-Boinc' activity on your PC, not Boinc activity.

You need to change "use at most xx% of CPU time" so that the %age is much higher than (I'm guessing) your 40-50%.

Every higher % will reduce the time <you> tell Boinc to <not> work on tasks and take longer to complete.

If you're anxious about the effect of doing so, try 75% for a while, then edge it up another 5% if you notice no problems and stop when you reach a level of processing you're comfortable with, while reducing task runtime to completion.

This will solve the <entirety> of your problems, while (coincidentally) massively increasing your contribution to <all> the projects you run within your preferred settings.
ID: 109131 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1689
Credit: 18,016,030
RAC: 23,735
Message 109133 - Posted: 19 Apr 2024, 23:06:25 UTC - in response to Message 109131.  

This will solve the <entirety> of your problems, while (coincidentally) massively increasing your contribution to <all> the projects you run within your preferred settings.
He's running Folding at home as well.
He asked about this issue 4 years ago and ignored all advice as to how to fix it. He asked about it again about a month or so back, and once again refused to take any advice on how to resolve it.
He just likes to whinge about things he's not prepared to do anything about- ie Look in Task manager to see exactly what processes are using CPU time, and then limiting the number of cores/threads BOINC can use so it's not impacted by those used by Folding.
Grant
Darwin NT
ID: 109133 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 23
Message 109134 - Posted: 20 Apr 2024, 7:01:38 UTC
Last modified: 20 Apr 2024, 7:03:58 UTC

I'd like to comment.

I see a problem, a problem that I should not be seeing. I try to make headway to resolve it, so ask. The result of asking each time is the same, basically, the BOINC folk tell me the problem is Folding, the Folding folk tell me it is not.

I have set no new tasks at both. I would seem to face a choice, I can support one or the other. Both are important to me.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 109134 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rilian
Avatar

Send message
Joined: 16 Jun 07
Posts: 23
Credit: 2,878,958
RAC: 5,518
Message 109135 - Posted: 22 Apr 2024, 18:29:15 UTC

Project seems to be alive now
i crunch for Ukraine. Join our team forums about Rosetta@home
ID: 109135 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1995
Credit: 9,653,827
RAC: 7,305
Message 109136 - Posted: 22 Apr 2024, 18:40:29 UTC - in response to Message 109135.  

Project seems to be alive now


And i report some wus over the deadline.
I don't know if they will consider these as valid
ID: 109136 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 273 · 274 · 275 · 276 · 277 · 278 · 279 . . . 303 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org