Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 156 · 157 · 158 · 159 · 160 · 161 · 162 . . . 311 · Next

AuthorMessage
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104212 - Posted: 13 Jan 2022, 20:22:24 UTC - in response to Message 104211.  

Oops, spammed!
If you edit your message (you have to do it fairly soon) and change it to contain two spaces and nothing else, it will be automatically deleted.
ID: 104212 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104214 - Posted: 13 Jan 2022, 20:35:09 UTC
Last modified: 13 Jan 2022, 20:55:27 UTC

This address has a lot of contacts:

https://www.bakerlab.org/index.php/contact/

I have emailed David Baker directly.
ID: 104214 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104216 - Posted: 13 Jan 2022, 22:54:49 UTC - in response to Message 104214.  

This address has a lot of contacts:

https://www.bakerlab.org/index.php/contact/

I have emailed David Baker directly.



Highly unlikely he will do anything.
He's to far up the food chain.
That's like emailing a CEO about a webpage problem.
I think ipdamin is the only solution and again good luck with that.
ID: 104216 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104217 - Posted: 13 Jan 2022, 23:36:46 UTC - in response to Message 104216.  

This address has a lot of contacts:

https://www.bakerlab.org/index.php/contact/

I have emailed David Baker directly.
Highly unlikely he will do anything.
He's to far up the food chain.
That's like emailing a CEO about a webpage problem.
I think ipdamin is the only solution and again good luck with that.
I'm sure David will be interested to know about the problem. If he doesn't want to deal with it, he can pass it down the food chain to where it's relevant. If I don't hear anything in a while, I'll try the admin. I doubt the media contact will be of any help.
ID: 104217 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael E.@ team Carl Sagan

Send message
Joined: 5 Apr 08
Posts: 16
Credit: 1,947,553
RAC: 71
Message 104220 - Posted: 14 Jan 2022, 3:30:37 UTC - in response to Message 104195.  

In your Task list, click on your computer number.
There will be an option to "Skip VirtualBox VM jobs". Make sure you don't skip them.


Thank you Jim1348!

The colorful Skip button at the bottom seems to be a Toggle switch.
ID: 104220 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Azrael

Send message
Joined: 5 Jan 06
Posts: 2
Credit: 15,727,785
RAC: 304
Message 104226 - Posted: 14 Jan 2022, 15:33:35 UTC
Last modified: 14 Jan 2022, 15:47:09 UTC

Hello Everyone,
i have multiple workunits that are crunching since more than 1day (one is 3 days) while normal is 3 to 6 hours.
They are reporting only minutes until they are finished .... but will not finish.

See https://boinc.bakerlab.org/rosetta/result.php?resultid=1463789734

Screenshot https://imgur.com/a/1YMF5B9

What should i do?
ID: 104226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 2,520
Message 104227 - Posted: 14 Jan 2022, 15:51:01 UTC - in response to Message 104226.  
Last modified: 14 Jan 2022, 15:51:16 UTC

You'll have to abort them unfortunately, unless anyone here has any tricks to get them to finish? I've had to abort loads recently.
ID: 104227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Azrael

Send message
Joined: 5 Jan 06
Posts: 2
Credit: 15,727,785
RAC: 304
Message 104228 - Posted: 14 Jan 2022, 16:06:02 UTC - in response to Message 104227.  

Thanks i'll do that. Have a good weekend!
ID: 104228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104229 - Posted: 14 Jan 2022, 17:17:38 UTC - in response to Message 104227.  
Last modified: 14 Jan 2022, 17:18:43 UTC

You'll have to abort them unfortunately, unless anyone here has any tricks to get them to finish? I've had to abort loads recently.

No tricks that I have found. But the project seems to be doing more server aborts (202) to weed them out. There are fewer than there used to be.
It looks like they were created with the wrong permissions, or missing files, or something like that.
They should get them fixed eventually.
ID: 104229 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104230 - Posted: 14 Jan 2022, 17:24:30 UTC - in response to Message 104226.  

Hello Everyone,
i have multiple workunits that are crunching since more than 1day (one is 3 days) while normal is 3 to 6 hours.
They are reporting only minutes until they are finished .... but will not finish.

See https://boinc.bakerlab.org/rosetta/result.php?resultid=1463789734

Screenshot https://imgur.com/a/1YMF5B9

What should i do?



Download and install emfer Boinc Tasks program. Then you can look at CPU usage. If its 1% or around that, then the task is stuck in a loop and will not finish and will have to be aborted.
And if you set the task % to x.xx% then you can see each time the program updates (2 seconds or whatever) whether the % increases at all or if only by .02% per 2 second update. Again, if you see that kind of value then you have to abort it.

But just out of curiosity, have you shut of your system without suspending BOINC? And then shutting down the connected client before exiting the program?

If you don't shut down, its just a bug. I have had that happen with LHC ATLAS tasks as well. It's just one of those things.
ID: 104230 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104231 - Posted: 14 Jan 2022, 17:27:27 UTC
Last modified: 14 Jan 2022, 17:37:24 UTC

Rosetta@home: Notice from server
rosetta python projects needs 18329.51MB more disk space. You currently have 743.98 MB available and it needs 19073.49 MB.


WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

It was trying to run 8 pythons at one time. And I see now the drive is full, so maybe I just buy a new digital drive of 2TB and put BOINC on that on its own and my old drive can run windows and the physical drive can be storage like it is now.
ID: 104231 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104232 - Posted: 14 Jan 2022, 17:47:30 UTC - in response to Message 104231.  

WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

Been there, done that.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5736&postid=45603#45603
ID: 104232 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104233 - Posted: 14 Jan 2022, 18:35:21 UTC - in response to Message 104232.  
Last modified: 14 Jan 2022, 18:43:43 UTC

WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

Been there, done that.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5736&postid=45603#45603


Ok..interesting. So I set it to manually use 200GB and leave 8 free.
But what is interesting even before I did that it went back to 10 python, 1 QuChem (Like ATLAS I can only run one at a time without running into errors), 1 ATLAS and I found a GPU WCG COVID! along with the usual PrimeGrid and now its out of space again...shees. Going to have to put a project restrictor on RAH.

Got enough RAM now. 60% usage. Guess I'll have to up the drive capacity.

What do you think of a WESTERN DIGITAL 1000GB Caviar Blue HDD? $49.50/43.40 Euro
SSD are much more for the same capacity.
Do HDD's hold up better for loads of read/write operations than a SSD?
ID: 104233 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104234 - Posted: 14 Jan 2022, 18:44:06 UTC - in response to Message 104220.  

In your Task list, click on your computer number.
There will be an option to "Skip VirtualBox VM jobs". Make sure you don't skip them.


Thank you Jim1348!

The colorful Skip button at the bottom seems to be a Toggle switch.
Since I'm using Gridcoin, I can't access that button, once I'm banned, I'm banned forever :-(
ID: 104234 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104235 - Posted: 14 Jan 2022, 18:45:46 UTC - in response to Message 104229.  

You'll have to abort them unfortunately, unless anyone here has any tricks to get them to finish? I've had to abort loads recently.

No tricks that I have found. But the project seems to be doing more server aborts (202) to weed them out. There are fewer than there used to be.
It looks like they were created with the wrong permissions, or missing files, or something like that.
They should get them fixed eventually.
I get the odd one on my "good" i5. But I get them all like that on the other 6 computers, including a brand new Ryzen 9. Whatever the bug is, it's related to the computer in use. Or there are two bugs.
ID: 104235 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104236 - Posted: 14 Jan 2022, 18:46:48 UTC - in response to Message 104230.  

Download and install emfer Boinc Tasks program. Then you can look at CPU usage. If its 1% or around that, then the task is stuck in a loop and will not finish and will have to be aborted.
And if you set the task % to x.xx% then you can see each time the program updates (2 seconds or whatever) whether the % increases at all or if only by .02% per 2 second update. Again, if you see that kind of value then you have to abort it.
+1. That program is amazing. I have 9 machines all displayed on one screen, very easy to watch what everything is doing, all colour coded, loads of information, I can control the whole lot very easily.
ID: 104236 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104237 - Posted: 14 Jan 2022, 18:48:12 UTC - in response to Message 104231.  
Last modified: 14 Jan 2022, 18:53:22 UTC

Rosetta@home: Notice from server
rosetta python projects needs 18329.51MB more disk space. You currently have 743.98 MB available and it needs 19073.49 MB.


WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

It was trying to run 8 pythons at one time. And I see now the drive is full, so maybe I just buy a new digital drive of 2TB and put BOINC on that on its own and my old drive can run windows and the physical drive can be storage like it is now.
With LHC and Rosetta running on 24 cores, my 0.5TB SSD got too full, the only other thing on it is Fallout 4, a rather large computer game. So Boinc is demoted to the rust spinner, which unfortunately is making a little noise with write access once every second. ARGH!!! Boinc never listens to what I tell it to do!
ID: 104237 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 3,317
Message 104238 - Posted: 14 Jan 2022, 18:51:51 UTC - in response to Message 104233.  
Last modified: 14 Jan 2022, 18:52:05 UTC

WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!

Wow! now LHC it kicking up.

Been there, done that.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5736&postid=45603#45603


Ok..interesting. So I set it to manually use 200GB and leave 8 free.
But what is interesting even before I did that it went back to 10 python, 1 QuChem (Like ATLAS I can only run one at a time without running into errors), 1 ATLAS and I found a GPU WCG COVID! along with the usual PrimeGrid and now its out of space again...shees. Going to have to put a project restrictor on RAH.

Got enough RAM now. 60% usage. Guess I'll have to up the drive capacity.

What do you think of a WESTERN DIGITAL 1000GB Caviar Blue HDD? $49.50/43.40 Euro
SSD are much more for the same capacity.
Do HDD's hold up better for loads of read/write operations than a SSD?
I think SSD are actually now more reliable than HDDs, so get one if you can afford it, NVME preferably, 8 times faster again - you can get adapter cards if your motherboard doesn't have a connector. And I've bought 4TB Sonnics HDDs for not much more, check on Ebay. They are very good. I've used 7 of them, one was DOA, probably a clumsy postman, but they replaced it immediately, the others have worked flawlessly ever since.
ID: 104238 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104239 - Posted: 14 Jan 2022, 18:53:17 UTC - in response to Message 104236.  

Download and install emfer Boinc Tasks program. Then you can look at CPU usage. If its 1% or around that, then the task is stuck in a loop and will not finish and will have to be aborted.
And if you set the task % to x.xx% then you can see each time the program updates (2 seconds or whatever) whether the % increases at all or if only by .02% per 2 second update. Again, if you see that kind of value then you have to abort it.
+1. That program is amazing. I have 9 machines all displayed on one screen, very easy to watch what everything is doing, all colour coded, loads of information, I can control the whole lot very easily.



Very good! I use it to monitor the progress and also the project update feature. Also i like the color coding.
My main use for that program is to monitor RAM and CPU usage and project %.
ID: 104239 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 104240 - Posted: 14 Jan 2022, 18:54:08 UTC - in response to Message 104231.  
Last modified: 14 Jan 2022, 19:09:03 UTC

Rosetta@home: Notice from server
rosetta python projects needs 18329.51MB more disk space. You currently have 743.98 MB available and it needs 19073.49 MB.
WTH? It needs 19 gigs of disk space? No freaking way! That's out of control!
Wow! now LHC it kicking up.
It was trying to run 8 pythons at one time. And I see now the drive is full, so maybe I just buy a new digital drive of 2TB and put BOINC on that on its own and my old drive can run windows and the physical drive can be storage like it is now.

Been getting those messages for a while , here
are some I copied, and the first is only ten minits after a reboot, even witn 200GB free , even rosetta got in on the act
-------------------
26/12/2021 16:20:28 | | Starting BOINC client version 7.14.2 for windows_x86_64
26/12/2021 16:20:28 | | log flags: file_xfer, sched_ops, task
26/12/2021 16:20:28 | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8
26/12/2021 16:20:28 | | Data directory: C:ProgramDataBOINC
26/12/2021 16:20:28 | | Running under account clair
26/12/2021 16:20:30 | | CAL: ATI GPU 0: AMD Radeon HD 7870/7950/7970/R9 280/R9 280X series (Tahiti) (CAL version 1.4.1741, 3072MB, 3032MB available, 9472 GFLOPS peak)
26/12/2021 16:20:30 | | CAL: ATI GPU 1: AMD Radeon HD 7870/7950/7970/R9 280/R9 280X series (Tahiti) (CAL version 1.4.1741, 3072MB, 3032MB available, 9472 GFLOPS peak)
26/12/2021 16:20:30 | | OpenCL: AMD/ATI GPU 0: AMD Radeon HD 7870/7950/7970/R9 280/R9 280X series (Tahiti) (driver version 1084.4 (VM), device version OpenCL 1.2 AMD-APP (1084.4), 3072MB, 3032MB available, 9472 GFLOPS peak)
26/12/2021 16:20:30 | | OpenCL: AMD/ATI GPU 1: AMD Radeon HD 7870/7950/7970/R9 280/R9 280X series (Tahiti) (driver version 1084.4 (VM), device version OpenCL 1.2 AMD-APP (1084.4), 3072MB, 3032MB available, 9472 GFLOPS peak)
26/12/2021 16:20:30 | | Host name: KAMINSKI
26/12/2021 16:20:30 | | Processor: 48 GenuineIntel Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz [Family 6 Model 62 Stepping 4]
26/12/2021 16:20:30 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes f16c rdrandsyscall nx lm avx vmx smx tm2 dca pbe fsgsbase smep
26/12/2021 16:20:30 | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00)
26/12/2021 16:20:30 | | Memory: 127.95 GB physical, 128.05 GB virtual
26/12/2021 16:20:30 | | Disk: 465.76 GB total, 226.80 GB free
26/12/2021 16:20:30 | | Local time is UTC +0 hours
26/12/2021 16:20:30 | | No WSL found.
26/12/2021 16:20:30 | | VirtualBox version: 5.2.8
26/12/2021 16:20:30 | Milkyway@Home | Found app_config.xml
26/12/2021 16:20:30 | | Config: use all coprocessors
26/12/2021 16:20:30 | Milkyway@Home | URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 742885; resource share 100
26/12/2021 16:20:30 | Rosetta@home | URL https://boinc.bakerlab.org/rosetta/; Computer ID 3279599; resource share 1
26/12/2021 16:20:30 | | General prefs: from http://einstein.phys.uwm.edu/ (last modified 28-Apr-2021 18:41:17)
26/12/2021 16:20:30 | | Host location: none
26/12/2021 16:20:30 | | General prefs: using your defaults
26/12/2021 16:20:30 | | Reading preferences override file
26/12/2021 16:20:30 | | Preferences:
26/12/2021 16:20:30 | | max memory usage when active: 129710.24 MB
26/12/2021 16:20:30 | | max memory usage when idle: 131020.45 MB
26/12/2021 16:20:32 | | max disk usage: 350.78 GB
26/12/2021 16:20:32 | | max CPUs used: 38
26/12/2021 16:20:32 | | (to change preferences, visit a project web site or select Preferences in the Manager)
26/12/2021 16:20:32 | | Setting up project and slot directories
26/12/2021 16:20:32 | | Checking active tasks
26/12/2021 16:20:32 | | Setting up GUI RPC socket
26/12/2021 16:20:32 | | Checking presence of 52 project files
26/12/2021 16:21:10 Milkyway@Home | Sending scheduler request: To fetch work.
26/12/2021 16:21:10 Milkyway@Home | Requesting new tasks for AMD/ATI GPU
26/12/2021 16:30:01 Milkyway@Home | Scheduler request completed: got 0 new tasks
26/12/2021 16:31:26 Rosetta@home | work fetch resumed by user
26/12/2021 16:31:26 Rosetta@home | Sending scheduler request: To fetch work.
26/12/2021 16:31:26 Rosetta@home | Requesting new tasks for CPU
26/12/2021 16:31:29 Rosetta@home | Scheduler request completed: got 0 new tasks
26/12/2021 16:31:29 Rosetta@home | No tasks sent
26/12/2021 16:31:29 Rosetta@home Message from server:Rosetta needs 1907.35MB more disk space.You currently have 0.00 MB available and it needs 1907.35 MB.
26/12/2021 16:31:29 Rosetta@home Message from server:rosetta python projects needs 19073.49MB more disk space.You currently have 0.00 MB available and it needs 19073.49 MB.


--------------------
and that is with - on the `Disk` tab
used by boinc - 123.97GB
free . available to BOINC - 226.19GB
used by other programs - 114.99GB
-------------------------------------

V Mem cut
C = 134GB free
Restart
-------------------------------
O wot fun

09/01/2022 00:36:42 | Rosetta@home | Your app_config.xml file refers to an unknown application 'max_bitch'. Known applications: 'rosetta_python_projects', 'rosetta'
---------------------------------
11/01/2022 03:09:28 | Rosetta@home | Message from server: rosetta python projects needs 19073.49MB more disk space. You currently have 0.00 MB available and it needs 19073.49 MB.
--------------------------------------
13/01/2022 10:52:12 | Rosetta@home | Message from server: rosetta python projects needs 19073.49MB more disk space. You currently have 0.00 MB available and it needs 19073.49 MB.
------------------------------------
Makes me think is it the server that is low on disk space not us !
ID: 104240 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 156 · 157 · 158 · 159 · 160 · 161 · 162 . . . 311 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org