Computation errors

Message boards : Number crunching : Computation errors

To post messages, you must log in.

AuthorMessage
Profile Davide Cioni

Send message
Joined: 17 Jul 17
Posts: 5
Credit: 31,215
RAC: 1,087
Message 90595 - Posted: 30 Mar 2019, 18:06:19 UTC

Hi, since I've come back to this project I've been seeing some strange errors in some of my WUs, especially in the ones that study big proteins, here are a few examples:
-https://boinc.bakerlab.org/rosetta/result.php?resultid=1065314770
-https://boinc.bakerlab.org/rosetta/result.php?resultid=1065314768
-https://boinc.bakerlab.org/rosetta/result.php?resultid=1065460662

How can I keep these errors from happening?
ID: 90595 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rjs5

Send message
Joined: 22 Nov 10
Posts: 249
Credit: 7,934,025
RAC: 8,231
Message 90599 - Posted: 31 Mar 2019, 16:29:03 UTC - in response to Message 90595.  

Hi, since I've come back to this project I've been seeing some strange errors in some of my WUs, especially in the ones that study big proteins, here are a few examples:
-https://boinc.bakerlab.org/rosetta/result.php?resultid=1065314770
-https://boinc.bakerlab.org/rosetta/result.php?resultid=1065314768
-https://boinc.bakerlab.org/rosetta/result.php?resultid=1065460662

How can I keep these errors from happening?


Rosetta developers were quite sloppy in their allocation and use of memory.

Task 1065460662 ran out of memory.
https://boinc.bakerlab.org/rosetta/result.php?resultid=1065460662

The other two error out with "Funzione non corretta" or "incorrect function"

When one WU runs out of memory, other WU may get strange error messages from function calls as developers don't always check the return results of all system calls.

The WU you are running are 64-bit and sometimes take large amounts of memory ... frequently over a GB each.

8gb should be enough to run 4 Rosetta 64-bit WU, so I would examine how memory is being used and change the workload.
Buy more memory if practical.
Lower the number of Rosetta WU running simultaneously with app_config.xml or BOINC -> OPTIONS -> COMPUTING PREFERENCES -> USAGE LIMITS
ID: 90599 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Davide Cioni

Send message
Joined: 17 Jul 17
Posts: 5
Credit: 31,215
RAC: 1,087
Message 90600 - Posted: 31 Mar 2019, 19:25:31 UTC - in response to Message 90599.  

Ok, thank you!
ID: 90600 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Computation errors



©2019 University of Washington
http://www.bakerlab.org