Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 277 · 278 · 279 · 280

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1509
Credit: 15,232,900
RAC: 23,453
Message 109225 - Posted: 3 May 2024, 2:26:32 UTC - in response to Message 109224.  

Server is still dead.
It seem mostly up for me.
Nope.
The boinc-process server is still dead, that's according to the Server Staus page & the number of Tasks that are piling up waiting for Validation & Assimilation.
Waiting for Validation is over 325,000 now.

That's why even though people are returning work, their Credit isn't increasing & their RAC is going down.
Grant
Darwin NT
ID: 109225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1509
Credit: 15,232,900
RAC: 23,453
Message 109226 - Posted: 3 May 2024, 4:21:57 UTC

I don't want to tempt fate, but the boinc-process server appears to be alive again (at least for now).
Grant
Darwin NT
ID: 109226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1509
Credit: 15,232,900
RAC: 23,453
Message 109227 - Posted: 3 May 2024, 4:25:08 UTC

I really wish they'd fix the application error handling, or at least the data they send out to process. Got a bunch of Tasks that have errored out.

ERROR: Error in protocols::cyclic_peptide_predict::SimpleCycpepPredictpplication::set_up_n_to_c_cyclization_mover() function: residue 1 does not have a LOWER_CONNECT.
*deep sigh*
Grant
Darwin NT
ID: 109227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1509
Credit: 15,232,900
RAC: 23,453
Message 109228 - Posted: 3 May 2024, 11:06:28 UTC - in response to Message 109226.  

I don't want to tempt fate, but the boinc-process server appears to be alive again (at least for now).
And the backlog has cleared.
Grant
Darwin NT
ID: 109228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chris Raisin

Send message
Joined: 18 May 16
Posts: 2
Credit: 5,432,660
RAC: 1,496
Message 109234 - Posted: 7 May 2024, 18:49:49 UTC

I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.

The message (relating solely to Rosetta@Home) is:

"Could not determine location of executable.
Could not find database. Either specify -database or set variable ROSETTA3_db"

Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification?

Many thanks, Chris Raisin
ID: 109234 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1226
Credit: 13,940,531
RAC: 3,125
Message 109235 - Posted: 7 May 2024, 19:14:52 UTC - in response to Message 109234.  

I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.

The message (relating solely to Rosetta@Home) is:

"Could not determine location of executable.
Could not find database. Either specify -database or set variable ROSETTA3_db"

Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification?

Many thanks, Chris Raisin


I've seen that message many times. Until those workunits get some hard to guess change, expect many more workunits running under Windows to have the same problem.
ID: 109235 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1509
Credit: 15,232,900
RAC: 23,453
Message 109236 - Posted: 7 May 2024, 21:43:28 UTC - in response to Message 109234.  

I am receiving a constant error message via BOINC re Rosetta@Home and I am not sure how to resolve it.

The message (relating solely to Rosetta@Home) is:

"Could not determine location of executable.
Could not find database. Either specify -database or set variable ROSETTA3_db"

Can someone advise where in user files (I assume) a configuration file relating to BOINC and Rosetta@Home needs modification?

Many thanks, Chris Raisin
Where are those error messages being shown?
Looking at your results, there are only 2 that have errored out,
ERROR: Error in protocols::cyclic_peptide_predict::SimpleCycpepPredictpplication::set_up_n_to_c_cyclization_mover() function: residue 1 does not have a LOWER_CONNECT.
Which has been an issue with some Tasks for ages now.

Other than what appears to be a heavily loaded system (11.5 hours to do 8 hours work, 4 hrs 15 min to do 3 hrs work), other than the 2 errored Tasks(due to a configuration issue with the Tasks themselves), all the others have processed & Validated without issue.
Grant
Darwin NT
ID: 109236 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1887
Credit: 8,440,178
RAC: 10,663
Message 109237 - Posted: 8 May 2024, 7:40:35 UTC - in response to Message 109236.  

Where are those error messages being shown?
Other than what appears to be a heavily loaded system (11.5 hours to do 8 hours work, 4 hrs 15 min to do 3 hrs work), other than the 2 errored Tasks(due to a configuration issue with the Tasks themselves), all the others have processed & Validated without issue.


Seems the message of the screensaver...
ID: 109237 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 378
Credit: 11,011,089
RAC: 12,685
Message 109250 - Posted: 15 May 2024, 8:12:34 UTC

A strange error, sadly I can only give a sketchy report but I hope it’s enough :-

Host = https://boinc.bakerlab.org/rosetta/results.php?hostid=6231982

Boinc 7.24.1, Ubuntu 22.04.4

I allowed Ubuntu to update and then rebooted, subsequent to this Boinc Manager disconnected after running for about a minute - the event log showed a Rosetta task restarting and immediately Boinc closing having received signal 15. This would repeat each time I restated the host and the Boinc service restarted.

I have now aborted all of the Rosetta tasks and this behaviour has now stopped.

(How) can a Rosetta task kill Boinc?

Just a notification as I’ve never heard this described before.
ID: 109250 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 18
Credit: 23,343,683
RAC: 26,989
Message 109255 - Posted: 16 May 2024, 5:10:07 UTC - in response to Message 109250.  

You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.
ID: 109255 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1509
Credit: 15,232,900
RAC: 23,453
Message 109256 - Posted: 16 May 2024, 6:41:47 UTC - in response to Message 109255.  
Last modified: 16 May 2024, 6:50:26 UTC

You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.
That might do it.
I've got half that many cores/threads & twice that amount of RAM and over the last couple of days when i had mostly Rosetta_VS Tasks there have been times i've had over 60% of my RAM in use.
Even without the 2GB + Tasks, there were plenty of others using 1-1.5GB.


But normally if lack of RAM is an issue, the Taks should have suspended with a "Waiting for memory" note. It shouldn't cause things to crash & burn.
Grant
Darwin NT
ID: 109256 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 378
Credit: 11,011,089
RAC: 12,685
Message 109257 - Posted: 16 May 2024, 8:03:43 UTC - in response to Message 109255.  

You ran out of memory. Six jobs of 2.6 GB and you have 16 GB.


Ach, I thought I had 32gb.

I remember now, the 2 sticks wouldn't play with each other :-(

The other machine has 64gb, I'll update this one to match

Thanks
ID: 109257 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 277 · 278 · 279 · 280

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org