Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 96 · 97 · 98 · 99 · 100 · 101 · 102 . . . 311 · Next

AuthorMessage
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,217,610
RAC: 822
Message 101254 - Posted: 12 Apr 2021, 13:19:53 UTC - in response to Message 101250.  

The 6.5GB problem goes away on an 8GB machine if you set it to use 100% memory. It never actually uses 100% since everything overestimates. I just changed my old Boinc-only machines [1] and Rosettas downloaded and ran.

[1] Who has 8GB on a machine they actually interact with? You could maybe load Windows 10 and 1 application. But dare to play a game, or use email and a photo editor at once and it'll grind to a halt. Another example of modern shoddy lazy bloated programming. I can boot Linux off a 1GB flash drive. Yet Windows is 20 times bigger.


Windows10 runs just fine with 8gb of ram, even on a laptop, and can even crunch Boinc projects quite well if you have the right processor and choose your projects wisely. Playing games is a whole other story though and you are correct unless you are playing a non competitive game like MineCraft or the sort. The size of the Windows OS is what it is it's not like it can be changed by any of us so you just learn to deal with what you have to deal with or you change to something else.
ID: 101254 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MarkJ

Send message
Joined: 28 Mar 20
Posts: 72
Credit: 25,238,680
RAC: 0
Message 101255 - Posted: 12 Apr 2021, 13:27:43 UTC - in response to Message 101205.  
Last modified: 12 Apr 2021, 13:33:17 UTC

Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien.

Anyone else? Or do I start checking my hardware?

Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour.
BOINC blog
ID: 101255 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 101256 - Posted: 12 Apr 2021, 14:14:59 UTC - in response to Message 101254.  

The 6.5GB problem goes away on an 8GB machine if you set it to use 100% memory. It never actually uses 100% since everything overestimates. I just changed my old Boinc-only machines [1] and Rosettas downloaded and ran.

[1] Who has 8GB on a machine they actually interact with? You could maybe load Windows 10 and 1 application. But dare to play a game, or use email and a photo editor at once and it'll grind to a halt. Another example of modern shoddy lazy bloated programming. I can boot Linux off a 1GB flash drive. Yet Windows is 20 times bigger.


Windows10 runs just fine with 8gb of ram, even on a laptop, and can even crunch Boinc projects quite well if you have the right processor and choose your projects wisely. Playing games is a whole other story though and you are correct unless you are playing a non competitive game like MineCraft or the sort. The size of the Windows OS is what it is it's not like it can be changed by any of us so you just learn to deal with what you have to deal with or you change to something else.
My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor. If I make a computer for someone it has 16Gb, or 32GB for games or anything else demanding. I put 64GB in my own. Programmers don't code as neatly as they used to!
ID: 101256 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 101257 - Posted: 12 Apr 2021, 14:17:21 UTC - in response to Message 101252.  

No idea what you think I've changed
I know. It's that damned Dunning-Kruger thingy.
No context, no conversation.

Unless you are a relative -- which you are not -- it's not my duty to compensate for your inability to keep up with a conversation due to age-related infirmities. I counsel making use of Google.
You seem confused. "Context" in this context (titter) means that you failed to quote enough text so I knew what the conversation was about. It has nothing to do with the hypothetical Dunning-Kruger bullshit. Virtually nobody can remember every single conversation they have, I'm probably in 200 of them.
ID: 101257 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 101258 - Posted: 12 Apr 2021, 14:18:22 UTC - in response to Message 101255.  

Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien.

Anyone else? Or do I start checking my hardware?

Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour.
Same here, and on prehelical (although I didn't check the error type).
ID: 101258 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CIA

Send message
Joined: 3 May 07
Posts: 100
Credit: 21,059,812
RAC: 0
Message 101260 - Posted: 12 Apr 2021, 16:38:23 UTC - in response to Message 101258.  
Last modified: 12 Apr 2021, 16:38:46 UTC

Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien.

Anyone else? Or do I start checking my hardware?

Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour.
Same here, and on prehelical (although I didn't check the error type).



Pretty much all of my mini protein_relax8 units are seconds (meaning they failed on another machine before I got them), and almost all of them are completing but taking 18 hours to do so. They are creating very few decoys.

Example: https://boinc.bakerlab.org/rosetta/result.php?resultid=1366333671
ID: 101260 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 4,044
Message 101262 - Posted: 12 Apr 2021, 17:30:41 UTC - in response to Message 101260.  

Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien.

Anyone else? Or do I start checking my hardware?

Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour.
Same here, and on prehelical (although I didn't check the error type).



Pretty much all of my mini protein_relax8 units are seconds (meaning they failed on another machine before I got them), and almost all of them are completing but taking 18 hours to do so. They are creating very few decoys.

Example: https://boinc.bakerlab.org/rosetta/result.php?resultid=1366333671
Have you changed the setting to allow 18 hours? Because all mine are sticking to the 8 hours. I'm getting 50% of the mini protein_relax8 completing in 8 hours, and the other 50% failing, usually taking 5 hours to do so.
ID: 101262 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CIA

Send message
Joined: 3 May 07
Posts: 100
Credit: 21,059,812
RAC: 0
Message 101263 - Posted: 12 Apr 2021, 18:18:03 UTC - in response to Message 101262.  
Last modified: 12 Apr 2021, 18:28:58 UTC


Pretty much all of my mini protein_relax8 units are seconds (meaning they failed on another machine before I got them), and almost all of them are completing but taking 18 hours to do so. They are creating very few decoys.

Example: https://boinc.bakerlab.org/rosetta/result.php?resultid=1366333671
Have you changed the setting to allow 18 hours? Because all mine are sticking to the 8 hours. I'm getting 50% of the mini protein_relax8 completing in 8 hours, and the other 50% failing, usually taking 5 hours to do so.


During the latest drought I had this machine set to 36 hours, but Friday when it became clear the drought has ended I set it back to its normal default 8 hour runtime. So it's running for the standard 8hr and then 10 additional hours on top as others have mentioned before the auto-cutoff happens.

All my other machines are set to 36 hours, and while none of them have completed any of these longer units, some of them are showing signs it will happen to them also. For example on one machine I have a miniprotein WU that is only 57% done 22 hours in. I have a feeling it's going to crunch for 46 hours (set time limit +10hr cutoff).


/edit. Just to add a datapoint. While it's not conclusive, all the Miniprotein_relax8 units I'm getting that run long do "complete" and show as valid, even after going 10 hours over. Of these units that run over, many are "seconds" sent to me from other machines that failed to process the WU. My machine is running OSX and completes them fine (beyond running 10hrs over). All the failed machines are windows or linux based. That said, I know Macs make up a small percentage of computers on this project, so I might have just not gotten a resend from a Mac in my small sample.
ID: 101263 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 101264 - Posted: 12 Apr 2021, 21:50:03 UTC - in response to Message 101257.  

you failed to quote enough text so I knew what the conversation was about.


There was enough for you to recognize that I was replying to you, but not enough for you to remember what we were talking about, from a conversation within the past 24 hours, even though you knew it was you. Got it.

Just between us girls, isn't the real issue here the same as the one with "dood" and "@": you're immensely irritated at some features of my posting style. Including quoting only the essence of an exchange.

I think Letterman said it best: "An old man in a bathrobe on his front porch, shaking his fist at passing cars."
ID: 101264 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 404
Credit: 12,294,748
RAC: 2,551
Message 101265 - Posted: 12 Apr 2021, 22:23:43 UTC - in response to Message 101255.  

Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien.

Anyone else? Or do I start checking my hardware?

Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour.


Thanks, I was hoping it was the tasks rather than my hardware.
ID: 101265 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 826
Message 101268 - Posted: 13 Apr 2021, 0:59:28 UTC - in response to Message 101265.  

Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien.

Anyone else? Or do I start checking my hardware?

Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour.


Thanks, I was hoping it was the tasks rather than my hardware.

I've had several miniprotein_relax8 tasks fail also, but only one of them failed after one hour. The rest ran for at least two hours before failing. All were reissued to someone else, and either failed for that someone else as well, or aren't yet finished for that someone else.

I've thought of a possible reason why some tasks are set to ask for 6 GB of memory. Quite a bit more is loaded to produce a core dump if they fail, but isn't needed if they don't fail. Not the best idea, but possible.
ID: 101268 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
DizzyD

Send message
Joined: 23 Nov 20
Posts: 6
Credit: 1,438,330
RAC: 0
Message 101270 - Posted: 13 Apr 2021, 2:24:59 UTC - in response to Message 101263.  

/edit. Just to add a datapoint. While it's not conclusive, all the Miniprotein_relax8 units I'm getting that run long do "complete" and show as valid, even after going 10 hours over. Of these units that run over, many are "seconds" sent to me from other machines that failed to process the WU. My machine is running OSX and completes them fine (beyond running 10hrs over). All the failed machines are windows or linux based. That said, I know Macs make up a small percentage of computers on this project, so I might have just not gotten a resend from a Mac in my small sample.

I am also running on a Mac. The mini protein_relax8 units also do complete after ~18.7 hours and provide credit; however, the credit is in the "two-hundred" range for 67,000+ seconds of work. So, I've gone in and aborted all of the "ready to start" mini protein_relax8 units and now I have all pre-helical-bundles_round1_attempt1 queued up.
ID: 101270 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1734
Credit: 18,532,940
RAC: 17,945
Message 101272 - Posted: 13 Apr 2021, 7:38:49 UTC - in response to Message 101256.  

My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.
The issue is the photo editor.
I know several people running Windows 10 systems with 4GB of RAM with no issues (i was one for quite some time myself). Of course if you use software that requires huge amounts of RAM to do the work it needs to do- such as photo editing- then you need a system with the appropriate amount of RAM. That has always been the case.
It also helps (a massive amount) if you have a SSD and not a HDD.
Grant
Darwin NT
ID: 101272 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,790,281
RAC: 2,986
Message 101275 - Posted: 13 Apr 2021, 8:01:30 UTC

Still "- _abinitio_1_abinitio_" wus error.
Please, stop these wus
ID: 101275 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 404
Credit: 12,294,748
RAC: 2,551
Message 101276 - Posted: 13 Apr 2021, 8:09:31 UTC - in response to Message 101272.  

My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.
The issue is the photo editor.
I know several people running Windows 10 systems with 4GB of RAM with no issues (i was one for quite some time myself). Of course if you use software that requires huge amounts of RAM to do the work it needs to do- such as photo editing- then you need a system with the appropriate amount of RAM. That has always been the case.
It also helps (a massive amount) if you have a SSD and not a HDD.


I’ve just upgraded my Lenovo L520 Win10 laptop from 2gb to its max of 4gb and whilst it’s slightly faster it still runs fine with Firefox, boinctasksjs and libre office calc as its normal workload. My one failure has been to get ms team to access the built in mic - it sees it ok but I cannot get any volume from it.
ID: 101276 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,217,610
RAC: 822
Message 101279 - Posted: 13 Apr 2021, 13:24:35 UTC - in response to Message 101276.  

My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.
The issue is the photo editor.
I know several people running Windows 10 systems with 4GB of RAM with no issues (i was one for quite some time myself). Of course if you use software that requires huge amounts of RAM to do the work it needs to do- such as photo editing- then you need a system with the appropriate amount of RAM. That has always been the case.
It also helps (a massive amount) if you have a SSD and not a HDD.


I’ve just upgraded my Lenovo L520 Win10 laptop from 2gb to its max of 4gb and whilst it’s slightly faster it still runs fine with Firefox, boinctasksjs and libre office calc as its normal workload. My one failure has been to get ms team to access the built in mic - it sees it ok but I cannot get any volume from it.


4gb of ram running win10 is painful at best and you are unlikely to get any tasks from here at Rosetta until they make the needed changes to the minimum memory required for each task as it's current over the total amount you have, then when you add in the Windows overhead of about 1gb it would be a better idea to take that machine to another project that would also love to have your support. Lots of projects end out tasks that require less than 0.5gb of ram per task and some are down in the low 2 to 300mb range.
ID: 101279 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 404
Credit: 12,294,748
RAC: 2,551
Message 101280 - Posted: 13 Apr 2021, 13:40:42 UTC - in response to Message 101279.  

My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.
The issue is the photo editor.
I know several people running Windows 10 systems with 4GB of RAM with no issues (i was one for quite some time myself). Of course if you use software that requires huge amounts of RAM to do the work it needs to do- such as photo editing- then you need a system with the appropriate amount of RAM. That has always been the case.
It also helps (a massive amount) if you have a SSD and not a HDD.


I’ve just upgraded my Lenovo L520 Win10 laptop from 2gb to its max of 4gb and whilst it’s slightly faster it still runs fine with Firefox, boinctasksjs and libre office calc as its normal workload. My one failure has been to get ms team to access the built in mic - it sees it ok but I cannot get any volume from it.


4gb of ram running win10 is painful at best and you are unlikely to get any tasks from here at Rosetta until they make the needed changes to the minimum memory required for each task as it's current over the total amount you have, then when you add in the Windows overhead of about 1gb it would be a better idea to take that machine to another project that would also love to have your support. Lots of projects end out tasks that require less than 0.5gb of ram per task and some are down in the low 2 to 300mb range.


That laptop is underpowered and wouldn’t contribute much. I leave the crunching to my desktops whilst I do my day to day work / study on the laptop :-)

It’s the only Windows machine in the house, my wife’s a technophobe and won’t even try to use Linux.
ID: 101280 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
micha

Send message
Joined: 13 Feb 21
Posts: 1
Credit: 707,373
RAC: 0
Message 101282 - Posted: 13 Apr 2021, 14:25:57 UTC

Hi all, I am getting all of my rosetta tasks fail 10 seconds before end... I have 32gb of ram so it should not be a problem....

Is there any troubleshooting I can do?
One of the runs that failed: VXwWYtpw_BABBAAAA_B_A_YAAAAA_AAAAAA_CGGGGGGGGCGCGCGGGGGGCGGGGGGC_1-4_2-6_3-5.pdb_0001_abinitio_1_abinitio_SAVE_ALL_OUT_1390473_18 in rosetta_4.20_windows_x86_64.exe
ID: 101282 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ww

Send message
Joined: 17 Mar 20
Posts: 3
Credit: 455,936
RAC: 0
Message 101283 - Posted: 13 Apr 2021, 15:21:28 UTC - in response to Message 101282.  
Last modified: 13 Apr 2021, 15:22:28 UTC

Is there any troubleshooting I can do?


If you can complete tasks from World Compute Grid or another project, then don't worry too much about your hardware or BOINC configuration. Rosetta is having software problems right now.
ID: 101283 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 404
Credit: 12,294,748
RAC: 2,551
Message 101284 - Posted: 13 Apr 2021, 15:37:49 UTC - in response to Message 101282.  

Hi all, I am getting all of my rosetta tasks fail 10 seconds before end... I have 32gb of ram so it should not be a problem....

Is there any troubleshooting I can do?
One of the runs that failed: VXwWYtpw_BABBAAAA_B_A_YAAAAA_AAAAAA_CGGGGGGGGCGCGCGGGGGGCGGGGGGC_1-4_2-6_3-5.pdb_0001_abinitio_1_abinitio_SAVE_ALL_OUT_1390473_18 in rosetta_4.20_windows_x86_64.exe


You’re having two types of failure, the one you mention, that failed about 20 seconds after it started because a file was missing from the download and another series of tasks with names starting miniprotein about half of which fail after a couple of hours.

Both of these are known faults and not a problem with your setup.
ID: 101284 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 96 · 97 · 98 · 99 · 100 · 101 · 102 . . . 311 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org