Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 150 · 151 · 152 · 153 · 154 · 155 · 156 . . . 309 · Next

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104013 - Posted: 3 Jan 2022, 7:39:48 UTC - in response to Message 104011.  

Am counting on Dr. Baker to bring back the good old days.
The Good Old Days were pre-AI (artificial intelligence). The world has changed.
They have not bothered to update us though.



Dr. Baker is not involved in the tech side.
That is DEK and others.
Since years ago DEK and friends have disappeared.
There is no mod there is no updates, its a black hole with regards to RAH.
The AI system takes all their time now.
AND...they don't like change.
Look how you don't have any control on how many Pythons you get.
Its a all or nothing approach.

This is the new reality.
ID: 104013 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104019 - Posted: 3 Jan 2022, 16:32:08 UTC - in response to Message 104013.  

Am counting on Dr. Baker to bring back the good old days.
The Good Old Days were pre-AI (artificial intelligence). The world has changed.
They have not bothered to update us though.



Dr. Baker is not involved in the tech side.
That is DEK and others.
Since years ago DEK and friends have disappeared.
There is no mod there is no updates, its a black hole with regards to RAH.
The AI system takes all their time now.
AND...they don't like change.
Look how you don't have any control on how many Pythons you get.
Its a all or nothing approach.

This is the new reality.


I don't see the problem. So it doesn't run on antique machines. So it doesn't work well with gridcoin. But normal machines are just fine. LHC has been using Vbox for ages and they don't have all this grumbling.
ID: 104019 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104020 - Posted: 3 Jan 2022, 18:02:10 UTC - in response to Message 104019.  

I don't see the problem. So it doesn't run on antique machines. So it doesn't work well with gridcoin. But normal machines are just fine. LHC has been using Vbox for ages and they don't have all this grumbling.

See if you can figure out how to install Linux (Ubuntu is easy enough) and run the pythons.
Let us know how you do (or you could read this thread if it is not too much trouble).
ID: 104020 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104021 - Posted: 3 Jan 2022, 18:14:04 UTC - in response to Message 104020.  

I don't see the problem. So it doesn't run on antique machines. So it doesn't work well with gridcoin. But normal machines are just fine. LHC has been using Vbox for ages and they don't have all this grumbling.

See if you can figure out how to install Linux (Ubuntu is easy enough) and run the pythons.
Let us know how you do (or you could read this thread if it is not too much trouble).

Most people use Windows, it's not like they're removing a huge amount of users. If you want things to run smoothly, don't use weird operating systems.
ID: 104021 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104022 - Posted: 3 Jan 2022, 18:56:00 UTC - in response to Message 104021.  

Windows works better on the pythons, and Linux is weird. That is true enough.
But any normal project with a real administrator would have a sticky with Best Practices for running the pythons.

This is not a normal forum.
ID: 104022 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104023 - Posted: 3 Jan 2022, 19:07:22 UTC - in response to Message 104022.  

If they weren't getting the results back fast enough, they'd give us some help, so I can only assume they're happy with the speed we're going at.

Are the pythons actual research, or is it a test program?
ID: 104023 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104024 - Posted: 3 Jan 2022, 19:30:38 UTC - in response to Message 104023.  

If they weren't getting the results back fast enough, they'd give us some help, so I can only assume they're happy with the speed we're going at.

Are the pythons actual research, or is it a test program?

I wonder how happy they are. They don't tell us one way or the other, or what their goals are.
We only know how many they have in the hopper, which are a lot.

The pythons are research of sorts. We don't know much about that either.
https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=102761#102761
ID: 104024 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104025 - Posted: 3 Jan 2022, 19:46:04 UTC - in response to Message 104024.  

Looks like this is why my Ryzen got banned (and I can't unban it since I'm doing it through a gridcoin pool): "We recommend the VirtualBox version that is included in the recommended BOINC Windows installer, since newer VirtualBox versions do not work correctly for all projects."

Yeah I'd like more information, but this is the only project that gives us the big hopper value, most just show the small customer facing buffer.
ID: 104025 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104030 - Posted: 4 Jan 2022, 19:44:44 UTC - in response to Message 104025.  

Looks like this is why my Ryzen got banned (and I can't unban it since I'm doing it through a gridcoin pool): "We recommend the VirtualBox version that is included in the recommended BOINC Windows installer, since newer VirtualBox versions do not work correctly for all projects."

Yeah I'd like more information, but this is the only project that gives us the big hopper value, most just show the small customer facing buffer.



That's odd since I am running 6.1.30 and everything is fine.
Official version (included in BOINC download) is 6.1.12 from October last year.
ID: 104030 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104031 - Posted: 4 Jan 2022, 19:47:59 UTC

Just purchased 32gb of RAM to boost my system.
When python was running this morning along with the other projects it ate up 97% of my memory.
Thats 24 GB gone in a heartbeat!
So I swap out some old 4GB modules and put in 2x 16GB plus 2X existing 8GB modules and I think I should have enough memory for awhile. The 4's go back to my first PC. Just been reusing them every new build.
ID: 104031 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104032 - Posted: 4 Jan 2022, 20:01:04 UTC - in response to Message 104030.  

That's odd since I am running 6.1.30 and everything is fine.
Official version (included in BOINC download) is 6.1.12 from October last year.
I can't think why my Ryzen kept failing tasks. They said ok here, but they didn't validate at their end. It's fine with everything else in Boinc and otherwise and never goes wrong. It has 64GB of the fastest RAM available and is not overclocked.
ID: 104032 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104033 - Posted: 4 Jan 2022, 22:45:19 UTC - in response to Message 104032.  

That's odd since I am running 6.1.30 and everything is fine.
Official version (included in BOINC download) is 6.1.12 from October last year.
I can't think why my Ryzen kept failing tasks. They said ok here, but they didn't validate at their end. It's fine with everything else in Boinc and otherwise and never goes wrong. It has 64GB of the fastest RAM available and is not overclocked.



Your Ryzen is showing all 0's, so there is no way to see what's going on from this side.
Maybe includes some links to the failed tasks or run some more and post the links to those tasks before they disappear.

What tasks are you trying to run? 4.2 or Python?
Are you doing any sort of OC on your cores?
One other option to try, kind of a catch all hail mary idea, run out all your RAH work and then reset the project.


That's about all I have for answers right now.
Jim1348 might have some other ideas....
ID: 104033 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104034 - Posted: 4 Jan 2022, 23:25:27 UTC - in response to Message 104033.  

That's about all I have for answers right now.
Jim1348 might have some other ideas....

I looked at his setup and didn't see anything obviously wrong with it.
How long are they running? It should be around 2 to 4 hours or so.

As for validation, I would suspect an antivirus might be interfering with them.
Just excluding the project does not always work, since "real time protection" still inspects the packets, and maybe changes them.
I would temporarily disable the AV and see what happens.
ID: 104034 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104036 - Posted: 5 Jan 2022, 1:47:18 UTC - in response to Message 104033.  
Last modified: 5 Jan 2022, 1:52:15 UTC

Your Ryzen is showing all 0's, so there is no way to see what's going on from this side.
Maybe includes some links to the failed tasks or run some more and post the links to those tasks before they disappear.
Sorry, you're probably looking at my own account, the Ryzen is currently operating through grcpool.com to earn gridcoins. This is the host ID for it currently: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6167614 - and under application details, https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=6167614 which shows for Python "Number of tasks completed 89" and "Consecutive valid tasks 0", and all the 4.2s worked ok, although I can't see the tasks. I can't run more as the server seems to have banned me from doing Python.

What tasks are you trying to run? 4.2 or Python?
Both. The Pythons failed. They looked ok here but did not validate.

Are you doing any sort of OC on your cores?
No.

One other option to try, kind of a catch all hail mary idea, run out all your RAH work and then reset the project.
Tried that, the project won't give me any. It says "no tasks sent".
ID: 104036 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104037 - Posted: 5 Jan 2022, 1:50:18 UTC - in response to Message 104034.  

I looked at his setup and didn't see anything obviously wrong with it.
How long are they running? It should be around 2 to 4 hours or so.
Something like that. They show as completed ok here, but don't validate.

As for validation, I would suspect an antivirus might be interfering with them.
Just excluding the project does not always work, since "real time protection" still inspects the packets, and maybe changes them.
I would temporarily disable the AV and see what happens.
Same AV (AVG free) as on all 7 PCs, only this one failed. The one (the i5) that has run Python ok also has AVG with no exceptions set for Rosetta.
ID: 104037 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104040 - Posted: 5 Jan 2022, 13:25:14 UTC - in response to Message 104037.  

Same AV (AVG free) as on all 7 PCs, only this one failed. The one (the i5) that has run Python ok also has AVG with no exceptions set for Rosetta.

Then you might have some corrupted files. I would set No New Work, abort all the ones you have in progress, and detach from the project.
Then, re-attach. It will download new files.
ID: 104040 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104042 - Posted: 5 Jan 2022, 17:41:23 UTC - in response to Message 104040.  

Same AV (AVG free) as on all 7 PCs, only this one failed. The one (the i5) that has run Python ok also has AVG with no exceptions set for Rosetta.

Then you might have some corrupted files. I would set No New Work, abort all the ones you have in progress, and detach from the project.
Then, re-attach. It will download new files.

Already tried. I give up. If I leave gridcoin, I can get to the allow button. It seems to pay bugger all so I probably will.
ID: 104042 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104044 - Posted: 5 Jan 2022, 22:44:56 UTC - in response to Message 104036.  

Your Ryzen is showing all 0's, so there is no way to see what's going on from this side.
Maybe includes some links to the failed tasks or run some more and post the links to those tasks before they disappear.
Sorry, you're probably looking at my own account, the Ryzen is currently operating through grcpool.com to earn gridcoins. This is the host ID for it currently: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6167614 - and under application details, https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=6167614 which shows for Python "Number of tasks completed 89" and "Consecutive valid tasks 0", and all the 4.2s worked ok, although I can't see the tasks. I can't run more as the server seems to have banned me from doing Python.

What tasks are you trying to run? 4.2 or Python?
Both. The Pythons failed. They looked ok here but did not validate.

Are you doing any sort of OC on your cores?
No.

One other option to try, kind of a catch all hail mary idea, run out all your RAH work and then reset the project.
Tried that, the project won't give me any. It says "no tasks sent".



I checked the links you sent, but I see no tasks.
Which version of BOINC and Vbox are you running?
You can always try what Jim has said in the past and downgrade to 5.x in Vbox and its add on package and see what happens. That will only affect python. Why 4.2 doesn't work is odd. That's based on the root program they started with years ago. Usually no problems.

Perhaps you getting data corruption as he says through GRCpool. I have never operated through that, so no idea if that could affect things or not.

You might as a last resort go to the BOINC forum as ask there.

It's quite late in the EU, so I don't have time to investigate.
RAH will take you off Python if you send out to many errors. I had that happen and was able to reconnect.
But first try working out where the problem is at with your 4.2 tasks when you get them.
And maybe try running a clean version (no outside websites or whatever, just straight comms to RAH) and see what happens.
ID: 104044 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104045 - Posted: 6 Jan 2022, 17:27:23 UTC - in response to Message 104044.  

I checked the links you sent, but I see no tasks.
I think they're not shown after a certain time, I was looking at https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=6167614 which shows 89 pythons completed, but 0 consecutive valid tasks.

Which version of BOINC and Vbox are you running?
Latest Boinc, latest Vbox 6.1, although the computer which works ok is on 5.2.

You can always try what Jim has said in the past and downgrade to 5.x in Vbox and its add on package and see what happens. That will only affect python. Why 4.2 doesn't work is odd. That's based on the root program they started with years ago. Usually no problems.
I downgraded Vbox but am unable to reset the ban as I don't have access to the grcpool Boinc account.

Perhaps you getting data corruption as he says through GRCpool. I have never operated through that, so no idea if that could affect things or not.
It didn't stop my other computer running it, it's still doing them now.

And maybe try running a clean version (no outside websites or whatever, just straight comms to RAH) and see what happens.
I could do, but I've decided to stick to Gridcoin as I've worked out it pays for the hardware in 1.5 years of crunching, so not to be sneezed at.
ID: 104045 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104046 - Posted: 6 Jan 2022, 17:39:10 UTC - in response to Message 104045.  
Last modified: 6 Jan 2022, 17:39:28 UTC

Just got an email from the grcpool admin about the previous ID problem which I've fixed anyway, so I've asked him if he could press the allow button for me now I'm on the earlier Vbox version.
ID: 104046 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 150 · 151 · 152 · 153 · 154 · 155 · 156 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org