Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 277 · 278 · 279 · 280 · 281 · 282 · 283 . . . 302 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109225 - Posted: 3 May 2024, 2:26:32 UTC - in response to Message 109224.  

Server is still dead.
It seem mostly up for me.
Nope.
The boinc-process server is still dead, that's according to the Server Staus page & the number of Tasks that are piling up waiting for Validation & Assimilation.
Waiting for Validation is over 325,000 now.

That's why even though people are returning work, their Credit isn't increasing & their RAC is going down.
Grant
Darwin NT
ID: 109225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109226 - Posted: 3 May 2024, 4:21:57 UTC

I don't want to tempt fate, but the boinc-process server appears to be alive again (at least for now).
Grant
Darwin NT
ID: 109226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109227 - Posted: 3 May 2024, 4:25:08 UTC

I really wish they'd fix the application error handling, or at least the data they send out to process. Got a bunch of Tasks that have errored out.

ERROR: Error in protocols::cyclic_peptide_predict::SimpleCycpepPredictpplication::set_up_n_to_c_cyclization_mover() function: residue 1 does not have a LOWER_CONNECT.
*deep sigh*
Grant
Darwin NT
ID: 109227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109228 - Posted: 3 May 2024, 11:06:28 UTC - in response to Message 109226.  

I don't want to tempt fate, but the boinc-process server appears to be alive again (at least for now).
And the backlog has cleared.
Grant
Darwin NT
ID: 109228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chris Raisin

Send message
Joined: 18 May 16
Posts: 2
Credit: 5,537,464
RAC: 397
Message 109234 - Posted: 7 May 2024, 18:49:49 UTC

I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.

The message (relating solely to Rosetta@Home) is:

"Could not determine location of executable.
Could not find database. Either specify -database or set variable ROSETTA3_db"

Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification?

Many thanks, Chris Raisin
ID: 109234 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,281,662
RAC: 943
Message 109235 - Posted: 7 May 2024, 19:14:52 UTC - in response to Message 109234.  

I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.

The message (relating solely to Rosetta@Home) is:

"Could not determine location of executable.
Could not find database. Either specify -database or set variable ROSETTA3_db"

Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification?

Many thanks, Chris Raisin


I've seen that message many times. Until those workunits get some hard to guess change, expect many more workunits running under Windows to have the same problem.
ID: 109235 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109236 - Posted: 7 May 2024, 21:43:28 UTC - in response to Message 109234.  

I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.

The message (relating solely to Rosetta@Home) is:

"Could not determine location of executable.
Could not find database. Either specify -database or set variable ROSETTA3_db"

Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification?

Many thanks, Chris Raisin
Where are those error messages being shown?
Looking at your results, there are only 2 that have errored out,
ERROR: Error in protocols::cyclic_peptide_predict::SimpleCycpepPredictpplication::set_up_n_to_c_cyclization_mover() function: residue 1 does not have a LOWER_CONNECT.
Which has been an issue with some Tasks for ages now.

Other than what appears to be a heavily loaded system (11.5 hours to do 8 hours work, 4 hrs 15 min to do 3 hrs work), other than the 2 errored Tasks(due to a configuration issue with the Tasks themselves), all the others have processed & Validated without issue.
Grant
Darwin NT
ID: 109236 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1995
Credit: 9,646,358
RAC: 6,848
Message 109237 - Posted: 8 May 2024, 7:40:35 UTC - in response to Message 109236.  

Where are those error messages being shown?
Other than what appears to be a heavily loaded system (11.5 hours to do 8 hours work, 4 hrs 15 min to do 3 hrs work), other than the 2 errored Tasks(due to a configuration issue with the Tasks themselves), all the others have processed & Validated without issue.


Seems the message of the screensaver...
ID: 109237 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 393
Credit: 12,121,358
RAC: 3,867
Message 109250 - Posted: 15 May 2024, 8:12:34 UTC

A strange error, sadly I can only give a sketchy report but I hope it’s enough :-

Host = https://boinc.bakerlab.org/rosetta/results.php?hostid=6231982

Boinc 7.24.1, Ubuntu 22.04.4

I allowed Ubuntu to update and then rebooted, subsequent to this Boinc Manager disconnected after running for about a minute - the event log showed a Rosetta task restarting and immediately Boinc closing having received signal 15. This would repeat each time I restated the host and the Boinc service restarted.

I have now aborted all of the Rosetta tasks and this behaviour has now stopped.

(How) can a Rosetta task kill Boinc?

Just a notification as I’ve never heard this described before.
ID: 109250 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 18
Credit: 26,120,491
RAC: 18,191
Message 109255 - Posted: 16 May 2024, 5:10:07 UTC - in response to Message 109250.  

You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.
ID: 109255 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109256 - Posted: 16 May 2024, 6:41:47 UTC - in response to Message 109255.  
Last modified: 16 May 2024, 6:50:26 UTC

You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.
That might do it.
I've got half that many cores/threads & twice that amount of RAM and over the last couple of days when i had mostly Rosetta_VS Tasks there have been times i've had over 60% of my RAM in use.
Even without the 2GB + Tasks, there were plenty of others using 1-1.5GB.


But normally if lack of RAM is an issue, the Taks should have suspended with a "Waiting for memory" note. It shouldn't cause things to crash & burn.
Grant
Darwin NT
ID: 109256 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 393
Credit: 12,121,358
RAC: 3,867
Message 109257 - Posted: 16 May 2024, 8:03:43 UTC - in response to Message 109255.  

You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.


Ach, I thought I had 32gb.

I remember now, the 2 sticks wouldn't play with each other :-(

The other machine has 64gb, I'll update this one to match

Thanks
ID: 109257 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109268 - Posted: 22 May 2024, 9:13:28 UTC
Last modified: 22 May 2024, 9:21:47 UTC

Looks like the boinc-process server is having issues yet again- Rosetta beta Validator & Assimilator are down (along with a few other processes). How far behind witll the Validator get this time?
Presently 11,825 Workunits waiting for Validation.
Grant
Darwin NT
ID: 109268 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109269 - Posted: 22 May 2024, 10:09:59 UTC - in response to Message 109268.  

Looks like the boinc-process server is having issues yet again- Rosetta beta Validator & Assimilator are down (along with a few other processes). How far behind witll the Validator get this time?
Presently 11,825 Workunits waiting for Validation.
Backlog is now 20,000, but Validator now shows as running. Will have to wait a while to see if it actually is.
Grant
Darwin NT
ID: 109269 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1995
Credit: 9,646,358
RAC: 6,848
Message 109270 - Posted: 22 May 2024, 12:06:54 UTC - in response to Message 109269.  

Backlog is now 20,000, but Validator now shows as running. Will have to wait a while to see if it actually is.

Now is 0. Validator queue is empty.
ID: 109270 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 28 May 06
Posts: 70
Credit: 268,055
RAC: 300
Message 109272 - Posted: 22 May 2024, 16:07:03 UTC

Me & all wingman Seeing lots of errors on Android due to what appears to be misconfigured Rosetta task:
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1395840251
[ ERROR ]: Caught exception:


File: src/core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: nan
 ------------------------ Begin developer's backtrace ------------------------- 
BACKTRACE:
 ------------------------- End developer's backtrace -------------------------- 


ID: 109272 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,007,137
RAC: 23,713
Message 109273 - Posted: 23 May 2024, 6:17:05 UTC - in response to Message 109272.  

Me & all wingman Seeing lots of errors on Android due to what appears to be misconfigured Rosetta task:
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1395840251
[ ERROR ]: Caught exception:


File: src/core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: nan
 ------------------------ Begin developer's backtrace ------------------------- 
BACKTRACE:
 ------------------------- End developer's backtrace -------------------------- 
Been a problem for years now.
Grant
Darwin NT
ID: 109273 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1995
Credit: 9,646,358
RAC: 6,848
Message 109275 - Posted: 23 May 2024, 8:40:54 UTC - in response to Message 109273.  

chi angle must be between -180 and 180: nan

Been a problem for years now.


A great classic!!
ID: 109275 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dcs1955

Send message
Joined: 2 Dec 22
Posts: 13
Credit: 6,026,425
RAC: 13,411
Message 109277 - Posted: 23 May 2024, 16:20:17 UTC

Waiting for Memory.... For the past two weeks I have had one of four core processes held up for needing memory.. It happens on two of my desktops with 16 GRAM. In over 8 years crunching WCG and Rosetta I have not had this happen. Since all the work is Rosetta Beta 6.04. Is this a known issue??
ID: 109277 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 259
Credit: 497,912
RAC: 658
Message 109278 - Posted: 23 May 2024, 16:22:47 UTC
Last modified: 23 May 2024, 16:23:27 UTC

RosettaVS tasks use more memory than 8a_hal
ID: 109278 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 277 · 278 · 279 · 280 · 281 · 282 · 283 . . . 302 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org