Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 117 · 118 · 119 · 120 · 121 · 122 · 123 . . . 310 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1729
Credit: 18,491,225
RAC: 20,847
Message 102358 - Posted: 7 Aug 2021, 23:10:28 UTC - in response to Message 102357.  
Last modified: 7 Aug 2021, 23:18:12 UTC

Finally got my machine to start downloading tasks again (Problem at Boincstats) and now everything it's downloading is failing after a short computation time. Is there a way around this issue?
As mentioned a few posts earlier, the majority of the new Tasks will error out in a few seconds, but the rest will run OK.

As it is, it looks like that batch has all been sent out, so it'll just be resends again until another larger batch of work is released.



NB- since you are running more than one project, i would suggest reducing your cache size to a half day or less. At times like these where Rosetta is out of work, your system will do just Einstein, when Rosetta gets more work then it will do more of Rosetta until your Resource share settings are being met. No need to cache work.

eg
Computing preferences, Other,
           Store at least 0.4  days of work
Store up to an additional 0.01 days of work

Grant
Darwin NT
ID: 102358 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 1,227
Message 102359 - Posted: 7 Aug 2021, 23:17:44 UTC - in response to Message 102357.  

Finally got my machine to start downloading tasks again (Problem at Boincstats) and now everything it's downloading is failing after a short computation time. Is there a way around this issue?

Errors after very short computation times are usually due to errors in one of the input files. In that case, not much can be done except to have the project staff cancel every task that shares the defective input file. Getting them to notice the need to do do is usually quite difficult.

I saw two of the last batch I got fail quickly, but the other five have been running for an hour. This may mean that they have filtered out some, but not all, of the tasks with defective inputs.
ID: 102359 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TribbleRED

Send message
Joined: 24 Jun 10
Posts: 2
Credit: 28,020,519
RAC: 3,870
Message 102360 - Posted: 8 Aug 2021, 1:57:27 UTC - in response to Message 102356.  

Thank you Grant. I'll give it a go
ID: 102360 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Albert H.

Send message
Joined: 31 Jan 14
Posts: 2
Credit: 8,917,754
RAC: 49
Message 102361 - Posted: 8 Aug 2021, 9:45:51 UTC

HI,
Since 3 days no new tasks, is there a problem on my side ?

08/08/2021 11:13:17 | Rosetta@home | Project requested delay of 31 seconds
08/08/2021 11:13:52 | Rosetta@home | Sending scheduler request: To fetch work.
08/08/2021 11:13:52 | Rosetta@home | Requesting new tasks for CPU
08/08/2021 11:13:53 | Rosetta@home | Scheduler request completed: got 0 new tasks
08/08/2021 11:13:53 | Rosetta@home | No tasks sent
08/08/2021 11:13:53 | Rosetta@home | Project requested delay of 31 seconds
08/08/2021 11:28:33 | Rosetta@home | Sending scheduler request: To fetch work.
08/08/2021 11:28:33 | Rosetta@home | Requesting new tasks for CPU
08/08/2021 11:28:34 | Rosetta@home | Scheduler request completed: got 0 new tasks
08/08/2021 11:28:34 | Rosetta@home | No tasks sent

Thanks
Albert H.
ID: 102361 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1729
Credit: 18,491,225
RAC: 20,847
Message 102362 - Posted: 8 Aug 2021, 10:00:17 UTC - in response to Message 102361.  

HI,
Since 3 days no new tasks, is there a problem on my side ?
The project is presently out of work.
There are some resends, and a couple of small batches of work being released, but it's down to luck as to whether you get any or not.
Hopefully come the start of the new work week in the US more work will be loaded up at some stage and will be readily available.
Grant
Darwin NT
ID: 102362 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kissagogo27

Send message
Joined: 31 Mar 20
Posts: 86
Credit: 2,981,693
RAC: 1,845
Message 102367 - Posted: 8 Aug 2021, 19:28:18 UTC

i got only resent ^^

and some GB10_3CL errored out too,
ID: 102367 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2145
Credit: 41,555,266
RAC: 8,961
Message 102369 - Posted: 8 Aug 2021, 21:32:53 UTC - in response to Message 102362.  

Since 3 days no new tasks, is there a problem on my side ?
The project is presently out of work.
There are some resends, and a couple of small batches of work being released, but it's down to luck as to whether you get any or not.
Hopefully come the start of the new work week in the US more work will be loaded up at some stage and will be readily available.

I managed to get 2.64 seconds total runtime from my last 40 errored tasks #Blessed
Time to give WCG its head until they fix it - sometime whenever
ID: 102369 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 108
Credit: 866,895
RAC: 578
Message 102370 - Posted: 8 Aug 2021, 23:27:21 UTC - in response to Message 102369.  

I also had nothing from Roseta for almost a week. Then I got a string or tasks that worked out as "errors while computing."

Must be something amiss with the new setup?

S. Gaber
ID: 102370 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,790,281
RAC: 4,437
Message 102371 - Posted: 9 Aug 2021, 8:44:47 UTC - in response to Message 102370.  

I also had nothing from Roseta for almost a week. Then I got a string or tasks that worked out as "errors while computing."

No work here and problems on Ralph
Seems: "Closed for holiday" :-P
ID: 102371 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 5144
Credit: 0
RAC: 0
Message 102372 - Posted: 9 Aug 2021, 15:51:57 UTC

I've notified the researcher that submitted the jobs that are failing.

We are looking into the HTTP download issues at the moment.
ID: 102372 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile xroule
Avatar

Send message
Joined: 9 Feb 15
Posts: 4
Credit: 59,082,533
RAC: 7,316
Message 102373 - Posted: 10 Aug 2021, 0:27:57 UTC - in response to Message 102372.  

Not a new problem. It often and it last many days. Not very serious! :-(
I had to go back to W,C,G.
ID: 102373 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 1,227
Message 102374 - Posted: 10 Aug 2021, 2:45:18 UTC - in response to Message 102373.  

Not a new problem. It often and it last many days. Not very serious! :-(
I had to go back to W,C,G.

W.C.G. is now almost out of tasks, so you may need to add another project.
ID: 102374 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile xroule
Avatar

Send message
Joined: 9 Feb 15
Posts: 4
Credit: 59,082,533
RAC: 7,316
Message 102375 - Posted: 10 Aug 2021, 2:53:17 UTC - in response to Message 102374.  

I have lots of WCG w.u. in store. And yes I can join an other project. I can stop crunching altogether. All that does not solve the fact that this project runs out of W.U. just too often without explanation. That show how much they care about the crunchers. And no I do not know who (they) are.
ID: 102375 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 4,566
Message 102376 - Posted: 10 Aug 2021, 7:23:32 UTC - in response to Message 102375.  
Last modified: 10 Aug 2021, 7:24:30 UTC

Their job is to do research, not to create meaningless work, so sometimes there is down-time.

It is especially understandable at the moment with such big improvements being made in the field since AlphaFold 2's showing at CASP13 and more recently, since releasing their source code.

It would be good to have more information posted about what is happening behind the scenes and the different versions of Rosetta though, especially the vbox versions.
ID: 102376 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Albert H.

Send message
Joined: 31 Jan 14
Posts: 2
Credit: 8,917,754
RAC: 49
Message 102379 - Posted: 10 Aug 2021, 19:38:50 UTC

I got lots of work at the moment.

Albert
ID: 102379 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
UBT - wbiz

Send message
Joined: 5 Feb 21
Posts: 6
Credit: 968,608
RAC: 12
Message 102380 - Posted: 10 Aug 2021, 21:29:06 UTC - in response to Message 102376.  

Their job is to do research, not to create meaningless work, so sometimes there is down-time.

It is especially understandable at the moment with such big improvements being made in the field since AlphaFold 2's showing at CASP13 and more recently, since releasing their source code.

It would be good to have more information posted about what is happening behind the scenes and the different versions of Rosetta though, especially the vbox versions.


I thought RoseTTAFold was one step ahead of AlphaFold 2 now?
ID: 102380 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 4,566
Message 102383 - Posted: 11 Aug 2021, 8:48:10 UTC - in response to Message 102380.  

I don't think so - I think RoseTTaFold was closing the gap, but the reality is probably more complicated than that. I would guess that there are lots of areas where the are differences, like training improving the training, as well as the modelling.
ID: 102383 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1729
Credit: 18,491,225
RAC: 20,847
Message 102384 - Posted: 12 Aug 2021, 7:49:18 UTC

And it looks like we're out of work again.
Grant
Darwin NT
ID: 102384 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 102394 - Posted: 13 Aug 2021, 19:35:59 UTC - in response to Message 102384.  
Last modified: 13 Aug 2021, 20:11:52 UTC

Tasks ready to send 0
Tasks in progress 169215

And now that it is the weekend, nothing will change.
RAH is beginning to make me wonder if its stable or not.
Bugs, no work for days, etc.
This is not the RAH I started with.
ID: 102394 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Cobra

Send message
Joined: 9 Nov 05
Posts: 7
Credit: 16,618,655
RAC: 799
Message 102401 - Posted: 16 Aug 2021, 3:39:52 UTC - in response to Message 102394.  
Last modified: 16 Aug 2021, 3:40:15 UTC

Tasks ready to send 0
Tasks in progress 169215

And now that it is the weekend, nothing will change.
RAH is beginning to make me wonder if its stable or not.
Bugs, no work for days, etc.
This is not the RAH I started with.

We've been told to expect periods of down time. It's even at the top of the project home page.

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=14290
https://boinc.bakerlab.org/rosetta/
ID: 102401 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 117 · 118 · 119 · 120 · 121 · 122 · 123 . . . 310 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org