Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 322 · 323 · 324 · 325 · 326 · 327 · 328 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1804
Credit: 18,534,891
RAC: 2
Message 112330 - Posted: 28 Mar 2025, 21:15:09 UTC

Whatever the problem is, i still think it's all to do with the problem we're having with the lack of work that's Ready to send.
These issues both seem to have occurred around the same time frame.

Millions of Tasks queued up to process, but none Ready to send, and the Assimilators not keeping up with the very minimal load they have at present.
It's all related to Rosetta's servers and their issues.


Looking at the graphs, you can see periods where the Assimilator backlog clears, and there are spurts of Tasks ready to send for an hour or so (although more likely it's for 10-20 min or less- the updating of the stats occurs vey infrequently. It needs to be every 5min or so to really see what is going on, not every hour or so).
And the Tasks In progress climbs for a while. Then it all falls over again.


It's been a week of these issues, and still nothing has been done to rectify it.
Grant
Darwin NT
ID: 112330 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Boone

Send message
Joined: 16 Mar 20
Posts: 2
Credit: 1,300,961
RAC: 1,529
Message 112332 - Posted: 28 Mar 2025, 22:09:57 UTC - in response to Message 112326.  
Last modified: 28 Mar 2025, 22:10:40 UTC

Hi,
some config in ipv6 is wrong on their end.
You have to edit your hosts-file to use ipv4:

Linux: sudo nano /etc/hosts
Windows: notepad C:WindowsSystem32driversetchosts

paste these lines:

#Rosetta@home
128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org
ID: 112332 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CMHDan
Avatar

Send message
Joined: 16 May 20
Posts: 3
Credit: 328,826
RAC: 4,942
Message 112333 - Posted: 28 Mar 2025, 23:05:12 UTC - in response to Message 112332.  
Last modified: 28 Mar 2025, 23:46:19 UTC

some config in ipv6 is wrong on their end.
It looks as if they haven't put in any IPv6 DNS entries for boinc-files.bakerlab.org, but have for some the other servers.


128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org
What you have listed above for boinc-files doesn't match DNS, you may want to try removing (updating, or commenting out) that entry.

boinc.bakerlab.org
          2607:4000:406::160:156
          2607:4000:406::160:157
          128.95.160.156
          128.95.160.157

bwsrv1.bakerlab.org
          2607:4000:406::160:156
          128.95.160.156

bwsrv2.bakerlab.org
          2607:4000:406::160:157
          128.95.160.157

boinc-files.bakerlab.org
          128.95.160.134
          128.95.160.135

I know using DNS in the past hasn't always worked, I've even had to hard code the boinc-files IP in the hosts file to get things working, but I removed that a couple weeks ago, with no issues. For what it's worth, I'm not getting the error on any of my machines.

edit: grammar / dns entries
ID: 112333 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5727
Credit: 5,968,085
RAC: 1,692
Message 112334 - Posted: 28 Mar 2025, 23:42:21 UTC

ID: 112334 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 279
Credit: 528,663
RAC: 81
Message 112335 - Posted: 28 Mar 2025, 23:46:51 UTC

For linux uncomment precedence ::ffff:0:0/96 100 in /etc/gai.conf This will make resolver prefer ipv4 addresses and connect to ipv6 address only if ipv4 address is not available.
ID: 112335 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2293
Credit: 43,197,319
RAC: 26,132
Message 112336 - Posted: 29 Mar 2025, 0:08:31 UTC - in response to Message 112324.  

Personally, I can only applaud the imagination and ingenuity of people who come up with workable solutions for the rest of us.

Well, this took off while I was at work...
ID: 112336 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Laird o' th' wee White Hoose

Send message
Joined: 11 May 20
Posts: 3
Credit: 1,339,392
RAC: 17,660
Message 112340 - Posted: 29 Mar 2025, 12:26:18 UTC

Server Status and operating.

Server Status shows all servers are green. However, when we try to communicate with Rosetta, we are greeted with a message that the feeder is inactive.

We cannot upload or download WUs to the project, and relocated computers cannot add to the project.

Hopefully someone will have a good idea to get this excellent, and important, project back on track.
ID: 112340 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile The Ancient One

Send message
Joined: 4 Oct 05
Posts: 13
Credit: 1,272,625
RAC: 1,671
Message 112341 - Posted: 29 Mar 2025, 13:40:52 UTC - in response to Message 112340.  

I have 45 tasks waiting to report for the last 4 days. There must be a problem with your sever which needs investigation.
[/img]
ID: 112341 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Doyle

Send message
Joined: 2 Apr 20
Posts: 1
Credit: 3,392,639
RAC: 5,216
Message 112343 - Posted: 29 Mar 2025, 14:24:44 UTC - in response to Message 112334.  

This workaround corrected the problem immediately. Thank you!
ID: 112343 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 217
Credit: 7,404,102
RAC: 7,277
Message 112344 - Posted: 29 Mar 2025, 14:27:41 UTC - in response to Message 112332.  

You have to edit your hosts-file to use ipv4:

Linux: sudo nano /etc/hosts
Windows: notepad C:WindowsSystem32driversetchosts

paste these lines:

#Rosetta@home
128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org


Thank you. I already had the first line in there, but adding the second allowed the

Sat 29 Mar 2025 10:15:43 AM EDT | Rosetta@home | Reporting 42 completed tasks


reports to go up/
ID: 112344 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Boone

Send message
Joined: 16 Mar 20
Posts: 2
Credit: 1,300,961
RAC: 1,529
Message 112345 - Posted: 29 Mar 2025, 16:58:17 UTC - in response to Message 112333.  

What you have listed above for boinc-files doesn't match DNS, you may want to try removing (updating, or commenting out) that entry.

There is no need to disable ipv6 at all, only these two Rosetta-Servers can be reached with ipv4 with these IPs.
It works only this way at the moment:
128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org

Adding an ipv6-Entry (2607:4000:406::160:156 bwsrv1.bakerlab.org) will result in the same "feeder nor running"-Error.
ID: 112345 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CMHDan
Avatar

Send message
Joined: 16 May 20
Posts: 3
Credit: 328,826
RAC: 4,942
Message 112346 - Posted: 29 Mar 2025, 18:05:03 UTC - in response to Message 112345.  

There is no need to disable ipv6 at all, only these two Rosetta-Servers can be reached with ipv4 with these IPs.
It works only this way at the moment:
128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org


I agree that you shouldn't have to disable ipv6, but the second part I don't agree with. I only have ipv4 internet connectivity, and set up a packet sniffer to see what R@H IP's my machines are connecting to. So far I've seen packets to/from .156, .157 and .134, uploading/downloading/updating, seem to use .156 and .157, and reporting seems to use .134 (hoping it uses .135 as well, but I've only had a couple WU to report complete since I started the capture, so not enough data yet).
ID: 112346 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5727
Credit: 5,968,085
RAC: 1,692
Message 112348 - Posted: 29 Mar 2025, 18:58:04 UTC - in response to Message 112340.  

Server Status and operating.

Server Status shows all servers are green. However, when we try to communicate with Rosetta, we are greeted with a message that the feeder is inactive.

We cannot upload or download WUs to the project, and relocated computers cannot add to the project.

Hopefully someone will have a good idea to get this excellent, and important, project back on track.



The only simple fast and dirty way to get back and running is to read the link I posted here https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=112334

The experts here are looking at other stuff.
ID: 112348 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5727
Credit: 5,968,085
RAC: 1,692
Message 112349 - Posted: 29 Mar 2025, 19:02:04 UTC - in response to Message 112344.  

You have to edit your hosts-file to use ipv4:

Linux: sudo nano /etc/hosts
Windows: notepad C:WindowsSystem32driversetchosts

paste these lines:

#Rosetta@home
128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org


Thank you. I already had the first line in there, but adding the second allowed the

Sat 29 Mar 2025 10:15:43 AM EDT | Rosetta@home | Reporting 42 completed tasks


reports to go up/



Windows: notepad C:WindowsSystem32driversetchosts <--- huh? try breaking it down by directory. I got as far as drivers and then nothing matched.

Besides, why should we have to messing around with this kind of stuff?
RAH is supposed to be plug and play and none of this deep windows stuff.
ID: 112349 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5727
Credit: 5,968,085
RAC: 1,692
Message 112350 - Posted: 29 Mar 2025, 19:03:41 UTC - in response to Message 112346.  

There is no need to disable ipv6 at all, only these two Rosetta-Servers can be reached with ipv4 with these IPs.
It works only this way at the moment:
128.95.160.156 boinc-files.bakerlab.org
128.95.160.156 bwsrv1.bakerlab.org


I agree that you shouldn't have to disable ipv6, but the second part I don't agree with. I only have ipv4 internet connectivity, and set up a packet sniffer to see what R@H IP's my machines are connecting to. So far I've seen packets to/from .156, .157 and .134, uploading/downloading/updating, seem to use .156 and .157, and reporting seems to use .134 (hoping it uses .135 as well, but I've only had a couple WU to report complete since I started the capture, so not enough data yet).



ipv6 enabled gets the server error message. disabling it cures that unless someone has a easy and simple clear way of solving the inability of baker lab to keep their stuff in order.
ID: 112350 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CMHDan
Avatar

Send message
Joined: 16 May 20
Posts: 3
Credit: 328,826
RAC: 4,942
Message 112351 - Posted: 29 Mar 2025, 20:26:36 UTC - in response to Message 112350.  
Last modified: 29 Mar 2025, 20:32:00 UTC

ipv6 enabled gets the server error message. disabling it cures that unless someone has a easy and simple clear way of solving the inability of baker lab to keep their stuff in order.


Yeah, I get that. Doesn't matter as much why it isn't working [properly] vs what workarounds DO work. It just doesn't make sense why turning off ipv6 works, the hosts file entry should override DNS, so it should only try those ipv4 addresses and not even try ipv6.
The workaround sounds like only option for now (for those having the issue), until they have someone fix it. Smells of a DNS issue - or a misconfiguration, I think they need to have someone come in to check all the servers and networking, make sure everything is configured and running as needed.
Taking care of systems/networks like this is what I used to do, frustrating to not be able to fix things from the user side. If they needed help, I'd love to help (I'm sure many of us would). But from what I can see no one from the project is actually watching the forums, so...
ID: 112351 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1804
Credit: 18,534,891
RAC: 2
Message 112353 - Posted: 29 Mar 2025, 21:08:05 UTC - in response to Message 112351.  

But from what I can see no one from the project is actually watching the forums, so...
And they don't respond to emails.
This has been an issue for over a week, along with the lack of work Ready to send and the Assimilator backing up. And the download server issue? That's been over 6 months now.
And Ralph has been broken for around 9 or more months, and recently they just blocked it off (or something else broke).
Forbidden
You don't have permission to access this resource.
Is all you get when trying Ralph.
Grant
Darwin NT
ID: 112353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PMH_UK

Send message
Joined: 9 Aug 08
Posts: 19
Credit: 1,243,749
RAC: 0
Message 112354 - Posted: 29 Mar 2025, 21:30:06 UTC - in response to Message 112349.  

C:WindowsSystem32driversetchosts - had back-slashes removed when posted (just tried)

C:|Windows|System32|drivers|etc|hosts - change | to back-slash
Paul.
ID: 112354 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5727
Credit: 5,968,085
RAC: 1,692
Message 112355 - Posted: 29 Mar 2025, 23:22:35 UTC - in response to Message 112353.  

But from what I can see no one from the project is actually watching the forums, so...
And they don't respond to emails.
This has been an issue for over a week, along with the lack of work Ready to send and the Assimilator backing up. And the download server issue? That's been over 6 months now.
And Ralph has been broken for around 9 or more months, and recently they just blocked it off (or something else broke).
Forbidden
You don't have permission to access this resource.
Is all you get when trying Ralph.



Grant - this has been the case since they stopped putting grad students on here and in turn started their focus on AI. That's where the money is. This project just gets stuff that does not fit the AI model.

I once emailed the secretary (?) about something here and that was a black hole.
You notice Dr. B does not even post here anymore.
So we are on our own to figure out solutions to their problems.
ID: 112355 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5727
Credit: 5,968,085
RAC: 1,692
Message 112356 - Posted: 29 Mar 2025, 23:24:21 UTC - in response to Message 112351.  

ipv6 enabled gets the server error message. disabling it cures that unless someone has a easy and simple clear way of solving the inability of baker lab to keep their stuff in order.


Yeah, I get that. Doesn't matter as much why it isn't working [properly] vs what workarounds DO work. It just doesn't make sense why turning off ipv6 works, the hosts file entry should override DNS, so it should only try those ipv4 addresses and not even try ipv6.
The workaround sounds like only option for now (for those having the issue), until they have someone fix it. Smells of a DNS issue - or a misconfiguration, I think they need to have someone come in to check all the servers and networking, make sure everything is configured and running as needed.
Taking care of systems/networks like this is what I used to do, frustrating to not be able to fix things from the user side. If they needed help, I'd love to help (I'm sure many of us would). But from what I can see no one from the project is actually watching the forums, so...



Do a Timothy McGee or Penolpe Garcia and hack their system. lol
ID: 112356 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 322 · 323 · 324 · 325 · 326 · 327 · 328 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org