Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 156 · 157 · 158 · 159 · 160 · 161 · 162 . . . 276 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1481
Credit: 14,583,258
RAC: 14,559
Message 104204 - Posted: 13 Jan 2022, 18:19:54 UTC - in response to Message 104196.  

And on the machine that I'm not sure has virtualization enabled (Ryzen 5 3600), I got a bunch of VBox jobs and Rosetta v4.20 jobs one time where all 11 of them errored out instantly.
Your account only shows 2 systems, one of which is a Ryzen 5 3600, which only shows 2 Tasks that errored out recently, both Python.

And the error message includes this
VBoxManage.exe: error: AMD-V is disabled in the BIOS (or by the host OS) (VERR_SVM_DISABLED)
That system, either through a BIOS setting or an OS setting doesn't support Virtualisation.
Grant
Darwin NT
ID: 104204 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5661
Credit: 5,699,284
RAC: 2,079
Message 104206 - Posted: 13 Jan 2022, 19:12:26 UTC - in response to Message 104202.  

I see Sidock gave me a load of Eprot tasks, which are 15 times longer than the 3CLpro ones. According to the server status there's just a bit of each to get done.

I've sent a private message to the "admin" user in here to see if they'll come and check this thread and work out why a lot of us are having difficulty with the pythons. Not sure if that mailbox is read. It's the account that posts news articles and removes off topic posts.



admin really doesn't monitor anything here.
I hit him up about a spammer in the news section, no action.
I hit him up some time back about being banned from Python and nothing and then he changes some stuff.

As I have said before, it will take more than just 2 computers to skew the results.
If the wingpersons running the tasks after you error out complete them ok, then it falls under the acceptable error rate. If every single computer was to come back with errors on multiple tasks, then they might take notice.

News and help here is pretty non existent. We are on our own to figure it out.
If your getting Vbox errors, then go over to Oracle Vbox forum and post the section from the stderr file that has the state of the machine and then all the gibberish below with all the location errors and stuff.
They might be able to help you better.
ID: 104206 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,445,585
RAC: 54,963
Message 104207 - Posted: 13 Jan 2022, 19:46:48 UTC - in response to Message 104196.  

Have you tried running the leomoon app to check VBox works?
ID: 104207 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,445,585
RAC: 54,963
Message 104208 - Posted: 13 Jan 2022, 19:46:51 UTC - in response to Message 104196.  

Have you tried running the leomoon app to check VBox works?
ID: 104208 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,445,585
RAC: 54,963
Message 104209 - Posted: 13 Jan 2022, 19:46:53 UTC - in response to Message 104196.  

Have you tried running the leomoon app to check VBox works?
ID: 104209 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,445,585
RAC: 54,963
Message 104210 - Posted: 13 Jan 2022, 19:46:54 UTC - in response to Message 104196.  

Have you tried running the leomoon app to check VBox works?
ID: 104210 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,445,585
RAC: 54,963
Message 104211 - Posted: 13 Jan 2022, 19:46:56 UTC - in response to Message 104196.  
Last modified: 13 Jan 2022, 19:47:34 UTC

Oops, spammed!
ID: 104211 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,692,114
RAC: 7,360
Message 104212 - Posted: 13 Jan 2022, 20:22:24 UTC - in response to Message 104211.  

Oops, spammed!
If you edit your message (you have to do it fairly soon) and change it to contain two spaces and nothing else, it will be automatically deleted.
ID: 104212 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,692,114
RAC: 7,360
Message 104214 - Posted: 13 Jan 2022, 20:35:09 UTC
Last modified: 13 Jan 2022, 20:55:27 UTC

This address has a lot of contacts:

https://www.bakerlab.org/index.php/contact/

I have emailed David Baker directly.
ID: 104214 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5661
Credit: 5,699,284
RAC: 2,079
Message 104216 - Posted: 13 Jan 2022, 22:54:49 UTC - in response to Message 104214.  

This address has a lot of contacts:

https://www.bakerlab.org/index.php/contact/

I have emailed David Baker directly.



Highly unlikely he will do anything.
He's to far up the food chain.
That's like emailing a CEO about a webpage problem.
I think ipdamin is the only solution and again good luck with that.
ID: 104216 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,692,114
RAC: 7,360
Message 104217 - Posted: 13 Jan 2022, 23:36:46 UTC - in response to Message 104216.  

This address has a lot of contacts:

https://www.bakerlab.org/index.php/contact/

I have emailed David Baker directly.
Highly unlikely he will do anything.
He's to far up the food chain.
That's like emailing a CEO about a webpage problem.
I think ipdamin is the only solution and again good luck with that.
I'm sure David will be interested to know about the problem. If he doesn't want to deal with it, he can pass it down the food chain to where it's relevant. If I don't hear anything in a while, I'll try the admin. I doubt the media contact will be of any help.
ID: 104217 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael E.@ team Carl Sagan

Send message
Joined: 5 Apr 08
Posts: 16
Credit: 1,844,757
RAC: 183
Message 104220 - Posted: 14 Jan 2022, 3:30:37 UTC - in response to Message 104195.  

In your Task list, click on your computer number.
There will be an option to "Skip VirtualBox VM jobs". Make sure you don't skip them.


Thank you Jim1348!

The colorful Skip button at the bottom seems to be a Toggle switch.
ID: 104220 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Azrael

Send message
Joined: 5 Jan 06
Posts: 2
Credit: 13,963,176
RAC: 8,440
Message 104226 - Posted: 14 Jan 2022, 15:33:35 UTC
Last modified: 14 Jan 2022, 15:47:09 UTC

Hello Everyone,
i have multiple workunits that are crunching since more than 1day (one is 3 days) while normal is 3 to 6 hours.
They are reporting only minutes until they are finished .... but will not finish.

See https://boinc.bakerlab.org/rosetta/result.php?resultid=1463789734

Screenshot https://imgur.com/a/1YMF5B9

What should i do?
ID: 104226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,445,585
RAC: 54,963
Message 104227 - Posted: 14 Jan 2022, 15:51:01 UTC - in response to Message 104226.  
Last modified: 14 Jan 2022, 15:51:16 UTC

You'll have to abort them unfortunately, unless anyone here has any tricks to get them to finish? I've had to abort loads recently.
ID: 104227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Azrael

Send message
Joined: 5 Jan 06
Posts: 2
Credit: 13,963,176
RAC: 8,440
Message 104228 - Posted: 14 Jan 2022, 16:06:02 UTC - in response to Message 104227.  

Thanks i'll do that. Have a good weekend!
ID: 104228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104229 - Posted: 14 Jan 2022, 17:17:38 UTC - in response to Message 104227.  
Last modified: 14 Jan 2022, 17:18:43 UTC

You'll have to abort them unfortunately, unless anyone here has any tricks to get them to finish? I've had to abort loads recently.

No tricks that I have found. But the project seems to be doing more server aborts (202) to weed them out. There are fewer than there used to be.
It looks like they were created with the wrong permissions, or missing files, or something like that.
They should get them fixed eventually.
ID: 104229 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5661
Credit: 5,699,284
RAC: 2,079
Message 104230 - Posted: 14 Jan 2022, 17:24:30 UTC - in response to Message 104226.  

Hello Everyone,
i have multiple workunits that are crunching since more than 1day (one is 3 days) while normal is 3 to 6 hours.
They are reporting only minutes until they are finished .... but will not finish.

See https://boinc.bakerlab.org/rosetta/result.php?resultid=1463789734

Screenshot https://imgur.com/a/1YMF5B9

What should i do?



Download and install emfer Boinc Tasks program. Then you can look at CPU usage. If its 1% or around that, then the task is stuck in a loop and will not finish and will have to be aborted.
And if you set the task % to x.xx% then you can see each time the program updates (2 seconds or whatever) whether the % increases at all or if only by .02% per 2 second update. Again, if you see that kind of value then you have to abort it.

But just out of curiosity, have you shut of your system without suspending BOINC? And then shutting down the connected client before exiting the program?

If you don't shut down, its just a bug. I have had that happen with LHC ATLAS tasks as well. It's just one of those things.
ID: 104230 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5661
Credit: 5,699,284
RAC: 2,079
Message 104231 - Posted: 14 Jan 2022, 17:27:27 UTC
Last modified: 14 Jan 2022, 17:37:24 UTC

Rosetta@home: Notice from server
rosetta python projects needs 18329.51MB more disk space. You currently have 743.98 MB available and it needs 19073.49 MB.


WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

It was trying to run 8 pythons at one time. And I see now the drive is full, so maybe I just buy a new digital drive of 2TB and put BOINC on that on its own and my old drive can run windows and the physical drive can be storage like it is now.
ID: 104231 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104232 - Posted: 14 Jan 2022, 17:47:30 UTC - in response to Message 104231.  

WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

Been there, done that.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5736&postid=45603#45603
ID: 104232 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5661
Credit: 5,699,284
RAC: 2,079
Message 104233 - Posted: 14 Jan 2022, 18:35:21 UTC - in response to Message 104232.  
Last modified: 14 Jan 2022, 18:43:43 UTC

WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

Been there, done that.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5736&postid=45603#45603


Ok..interesting. So I set it to manually use 200GB and leave 8 free.
But what is interesting even before I did that it went back to 10 python, 1 QuChem (Like ATLAS I can only run one at a time without running into errors), 1 ATLAS and I found a GPU WCG COVID! along with the usual PrimeGrid and now its out of space again...shees. Going to have to put a project restrictor on RAH.

Got enough RAM now. 60% usage. Guess I'll have to up the drive capacity.

What do you think of a WESTERN DIGITAL 1000GB Caviar Blue HDD? $49.50/43.40 Euro
SSD are much more for the same capacity.
Do HDD's hold up better for loads of read/write operations than a SSD?
ID: 104233 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 156 · 157 · 158 · 159 · 160 · 161 · 162 . . . 276 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org