Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 96 · 97 · 98 · 99 · 100 · 101 · 102 . . . 309 · Next
Author | Message |
---|---|
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,214,047 RAC: 1,768 |
The 6.5GB problem goes away on an 8GB machine if you set it to use 100% memory. It never actually uses 100% since everything overestimates. I just changed my old Boinc-only machines [1] and Rosettas downloaded and ran. Windows10 runs just fine with 8gb of ram, even on a laptop, and can even crunch Boinc projects quite well if you have the right processor and choose your projects wisely. Playing games is a whole other story though and you are correct unless you are playing a non competitive game like MineCraft or the sort. The size of the Windows OS is what it is it's not like it can be changed by any of us so you just learn to deal with what you have to deal with or you change to something else. |
MarkJ Send message Joined: 28 Mar 20 Posts: 72 Credit: 25,238,680 RAC: 0 |
Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien. Its not just you. I've got 29 that failed across a number of machines. They are all miniprotein_relax8 series that have died after running for an hour. BOINC blog |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 12,116,986 RAC: 12,028 |
My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor. If I make a computer for someone it has 16Gb, or 32GB for games or anything else demanding. I put 64GB in my own. Programmers don't code as neatly as they used to!The 6.5GB problem goes away on an 8GB machine if you set it to use 100% memory. It never actually uses 100% since everything overestimates. I just changed my old Boinc-only machines [1] and Rosettas downloaded and ran. |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 12,116,986 RAC: 12,028 |
You seem confused. "Context" in this context (titter) means that you failed to quote enough text so I knew what the conversation was about. It has nothing to do with the hypothetical Dunning-Kruger bullshit. Virtually nobody can remember every single conversation they have, I'm probably in 200 of them.No context, no conversation.No idea what you think I've changedI know. It's that damned Dunning-Kruger thingy. |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 12,116,986 RAC: 12,028 |
Same here, and on prehelical (although I didn't check the error type).Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien. |
CIA Send message Joined: 3 May 07 Posts: 100 Credit: 21,059,812 RAC: 0 |
Same here, and on prehelical (although I didn't check the error type).Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien. Pretty much all of my mini protein_relax8 units are seconds (meaning they failed on another machine before I got them), and almost all of them are completing but taking 18 hours to do so. They are creating very few decoys. Example: https://boinc.bakerlab.org/rosetta/result.php?resultid=1366333671 |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 12,116,986 RAC: 12,028 |
Have you changed the setting to allow 18 hours? Because all mine are sticking to the 8 hours. I'm getting 50% of the mini protein_relax8 completing in 8 hours, and the other 50% failing, usually taking 5 hours to do so.Same here, and on prehelical (although I didn't check the error type).Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien. |
CIA Send message Joined: 3 May 07 Posts: 100 Credit: 21,059,812 RAC: 0 |
Have you changed the setting to allow 18 hours? Because all mine are sticking to the 8 hours. I'm getting 50% of the mini protein_relax8 completing in 8 hours, and the other 50% failing, usually taking 5 hours to do so. During the latest drought I had this machine set to 36 hours, but Friday when it became clear the drought has ended I set it back to its normal default 8 hour runtime. So it's running for the standard 8hr and then 10 additional hours on top as others have mentioned before the auto-cutoff happens. All my other machines are set to 36 hours, and while none of them have completed any of these longer units, some of them are showing signs it will happen to them also. For example on one machine I have a miniprotein WU that is only 57% done 22 hours in. I have a feeling it's going to crunch for 46 hours (set time limit +10hr cutoff). /edit. Just to add a datapoint. While it's not conclusive, all the Miniprotein_relax8 units I'm getting that run long do "complete" and show as valid, even after going 10 hours over. Of these units that run over, many are "seconds" sent to me from other machines that failed to process the WU. My machine is running OSX and completes them fine (beyond running 10hrs over). All the failed machines are windows or linux based. That said, I know Macs make up a small percentage of computers on this project, so I might have just not gotten a resend from a Mac in my small sample. |
mrhastyrib Send message Joined: 18 Feb 21 Posts: 90 Credit: 2,541,890 RAC: 0 |
you failed to quote enough text so I knew what the conversation was about. There was enough for you to recognize that I was replying to you, but not enough for you to remember what we were talking about, from a conversation within the past 24 hours, even though you knew it was you. Got it. Just between us girls, isn't the real issue here the same as the one with "dood" and "@": you're immensely irritated at some features of my posting style. Including quoting only the essence of an exchange. I think Letterman said it best: "An old man in a bathrobe on his front porch, shaking his fist at passing cars." |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 398 Credit: 12,294,748 RAC: 7,588 |
Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien. Thanks, I was hoping it was the tasks rather than my hardware. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1233 Credit: 14,338,560 RAC: 2,456 |
Over the course of this afternoon I’ve had 6 segv errors, all on files starting miniprotien. I've had several miniprotein_relax8 tasks fail also, but only one of them failed after one hour. The rest ran for at least two hours before failing. All were reissued to someone else, and either failed for that someone else as well, or aren't yet finished for that someone else. I've thought of a possible reason why some tasks are set to ask for 6 GB of memory. Quite a bit more is loaded to produce a core dump if they fail, but isn't needed if they don't fail. Not the best idea, but possible. |
DizzyD Send message Joined: 23 Nov 20 Posts: 6 Credit: 1,438,330 RAC: 0 |
/edit. Just to add a datapoint. While it's not conclusive, all the Miniprotein_relax8 units I'm getting that run long do "complete" and show as valid, even after going 10 hours over. Of these units that run over, many are "seconds" sent to me from other machines that failed to process the WU. My machine is running OSX and completes them fine (beyond running 10hrs over). All the failed machines are windows or linux based. That said, I know Macs make up a small percentage of computers on this project, so I might have just not gotten a resend from a Mac in my small sample. I am also running on a Mac. The mini protein_relax8 units also do complete after ~18.7 hours and provide credit; however, the credit is in the "two-hundred" range for 67,000+ seconds of work. So, I've gone in and aborted all of the "ready to start" mini protein_relax8 units and now I have all pre-helical-bundles_round1_attempt1 queued up. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1725 Credit: 18,378,164 RAC: 20,578 |
My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.The issue is the photo editor. I know several people running Windows 10 systems with 4GB of RAM with no issues (i was one for quite some time myself). Of course if you use software that requires huge amounts of RAM to do the work it needs to do- such as photo editing- then you need a system with the appropriate amount of RAM. That has always been the case. It also helps (a massive amount) if you have a SSD and not a HDD. Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2002 Credit: 9,780,807 RAC: 6,697 |
Still "- _abinitio_1_abinitio_" wus error. Please, stop these wus |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 398 Credit: 12,294,748 RAC: 7,588 |
My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.The issue is the photo editor. I’ve just upgraded my Lenovo L520 Win10 laptop from 2gb to its max of 4gb and whilst it’s slightly faster it still runs fine with Firefox, boinctasksjs and libre office calc as its normal workload. My one failure has been to get ms team to access the built in mic - it sees it ok but I cannot get any volume from it. |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,214,047 RAC: 1,768 |
My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.The issue is the photo editor. 4gb of ram running win10 is painful at best and you are unlikely to get any tasks from here at Rosetta until they make the needed changes to the minimum memory required for each task as it's current over the total amount you have, then when you add in the Windows overhead of about 1gb it would be a better idea to take that machine to another project that would also love to have your support. Lots of projects end out tasks that require less than 0.5gb of ram per task and some are down in the low 2 to 300mb range. |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 398 Credit: 12,294,748 RAC: 7,588 |
My Aunt doesn't play games. She finds 4GB (Hewlett Packard actually sold her a laptop with such a stupidly pitiful amount, which could not be upgraded!) unusable, and 8GB ok if she only runs one program at a time, 12GB was needed just to use email and a photo editor.The issue is the photo editor. That laptop is underpowered and wouldn’t contribute much. I leave the crunching to my desktops whilst I do my day to day work / study on the laptop :-) It’s the only Windows machine in the house, my wife’s a technophobe and won’t even try to use Linux. |
micha Send message Joined: 13 Feb 21 Posts: 1 Credit: 707,373 RAC: 0 |
Hi all, I am getting all of my rosetta tasks fail 10 seconds before end... I have 32gb of ram so it should not be a problem.... Is there any troubleshooting I can do? One of the runs that failed: VXwWYtpw_BABBAAAA_B_A_YAAAAA_AAAAAA_CGGGGGGGGCGCGCGGGGGGCGGGGGGC_1-4_2-6_3-5.pdb_0001_abinitio_1_abinitio_SAVE_ALL_OUT_1390473_18 in rosetta_4.20_windows_x86_64.exe |
ww Send message Joined: 17 Mar 20 Posts: 3 Credit: 455,936 RAC: 0 |
Is there any troubleshooting I can do? If you can complete tasks from World Compute Grid or another project, then don't worry too much about your hardware or BOINC configuration. Rosetta is having software problems right now. |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 398 Credit: 12,294,748 RAC: 7,588 |
Hi all, I am getting all of my rosetta tasks fail 10 seconds before end... I have 32gb of ram so it should not be a problem.... You’re having two types of failure, the one you mention, that failed about 20 seconds after it started because a file was missing from the download and another series of tasks with names starting miniprotein about half of which fail after a couple of hours. Both of these are known faults and not a problem with your setup. |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2024 University of Washington
https://www.bakerlab.org