Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 311 · 312 · 313 · 314 · 315 · 316 · 317 . . . 352 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 111943 - Posted: 22 Jan 2025, 15:34:50 UTC - in response to Message 111942.  

It's Wednesday.
And the servers are down.

As usual

It's Wednesday. Again.

Is that actually a thing about Wednesdays?
Or is it just so frequent that you could say it about any day of the week and have a high chance of being right?
Anyway, you are right.
How tiresome it is - just as I'm starting to return tasks too.
ID: 111943 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 430
Credit: 14,933,398
RAC: 19
Message 111944 - Posted: 22 Jan 2025, 19:20:56 UTC

Rats!

The dreaded chi angle error strikes 54 seconds before the task was due to end :-

https://boinc.bakerlab.org/rosetta/result.php?resultid=1594094456
ID: 111944 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2124
Credit: 12,426,657
RAC: 2,579
Message 111947 - Posted: 23 Jan 2025, 16:00:18 UTC - in response to Message 111944.  

The dreaded chi angle error strikes 54 seconds before the task was due to end :-

https://boinc.bakerlab.org/rosetta/result.php?resultid=1594094456


Yeap!
And after 30 minutes :-(
File: C:cygwin64homeboinc4.17Rosettamainsourcesrccore/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: -nan(ind)

ID: 111947 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1895
Credit: 18,534,891
RAC: 0
Message 111953 - Posted: 24 Jan 2025, 6:47:41 UTC - in response to Message 111916.  

It's Wednesday.
And the servers are down.

As usual
And it's still dead.
Grant
Darwin NT
ID: 111953 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 111955 - Posted: 24 Jan 2025, 8:09:50 UTC - in response to Message 111953.  

It's Wednesday.
And the servers are down.

As usual
And it's still dead.

And the front page info is frozen at what it was 24hrs ago fwiw
New tasks have popped up over that time, so I'm still running, but insufficient to provide any stock either here or at the project
ID: 111955 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1895
Credit: 18,534,891
RAC: 0
Message 111957 - Posted: 24 Jan 2025, 9:34:18 UTC - in response to Message 111953.  

It's Wednesday.
And the servers are down.

As usual
And it's still dead.
And now it's alive again.
Till the next time.
Grant
Darwin NT
ID: 111957 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2124
Credit: 12,426,657
RAC: 2,579
Message 111958 - Posted: 24 Jan 2025, 14:20:19 UTC - in response to Message 111957.  

Till the next time.


Waiting for the next Wednesday, to see if it is the day....
ID: 111958 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
WezH

Send message
Joined: 6 Apr 20
Posts: 6
Credit: 6,243,401
RAC: 12,043
Message 111959 - Posted: 25 Jan 2025, 13:22:10 UTC - in response to Message 111944.  

Rats!

The dreaded chi angle error strikes 54 seconds before the task was due to end :-

https://boinc.bakerlab.org/rosetta/result.php?resultid=1594094456


It seems that admins cancelled those tasks, I have now several "Cancelled by server" -tasks
ID: 111959 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 430
Credit: 14,933,398
RAC: 19
Message 111960 - Posted: 25 Jan 2025, 15:52:29 UTC - in response to Message 111959.  

Rats!

The dreaded chi angle error strikes 54 seconds before the task was due to end :-

https://boinc.bakerlab.org/rosetta/result.php?resultid=1594094456


It seems that admins cancelled those tasks, I have now several "Cancelled by server" -tasks


No cancelled tasks here yet but I had 4 WUs error out after a combines 19 hours of processing - that’s nearly a whole armful! 🤪
ID: 111960 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
WezH

Send message
Joined: 6 Apr 20
Posts: 6
Credit: 6,243,401
RAC: 12,043
Message 111961 - Posted: 25 Jan 2025, 17:32:18 UTC - in response to Message 111960.  

Rats!

The dreaded chi angle error strikes 54 seconds before the task was due to end :-

https://boinc.bakerlab.org/rosetta/result.php?resultid=1594094456


It seems that admins cancelled those tasks, I have now several "Cancelled by server" -tasks


No cancelled tasks here yet but I had 4 WUs error out after a combines 19 hours of processing - that’s nearly a whole armful! 🤪


I got several cancelled but then I got more tasks which have chi angle errors from other computers...

One task is odd, it was chi angle error but then my host did complete it succesfully?
ID: 111961 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1895
Credit: 18,534,891
RAC: 0
Message 111962 - Posted: 25 Jan 2025, 22:38:42 UTC - in response to Message 111959.  
Last modified: 25 Jan 2025, 22:39:58 UTC

Rats!

The dreaded chi angle error strikes 54 seconds before the task was due to end :-

https://boinc.bakerlab.org/rosetta/result.php?resultid=1594094456

It seems that admins cancelled those tasks, I have now several "Cancelled by server" -tasks
No, they didn't.
All the Tasks on your system that i saw as being cancelled were resends- the other system was late in returning them, so they were re-issued to you.
Then the other system finally returned them, your system hadn't started them yet, so they were cancelled.

It's an automatic process (which really should be configured better so it doesn't occur so often. And they shouldn't count as errors either IMHO).
Grant
Darwin NT
ID: 111962 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 111966 - Posted: 27 Jan 2025, 0:25:23 UTC - in response to Message 111886.  

Front page is finally updated and 3million tasks are showing!

Did I imagine that?
Where'd they go?

And now the front page shows ~2.6m tasks.
Let's hope those don't suddenly disappear like the last batch did,

Time to run down my WCG stock to make room for Rosetta
ID: 111966 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
sb4TEVvgZ2g6Vq1hGgztbFLwdGF

Send message
Joined: 28 Dec 24
Posts: 2
Credit: 2,793,928
RAC: 2,788
Message 111971 - Posted: 28 Jan 2025, 3:26:04 UTC - in response to Message 111966.  
Last modified: 28 Jan 2025, 3:49:23 UTC

For the past month, I have been unable to download anything from Rosetta using Ubuntu 22.04. I am running Rosetta tasks on both my Mac and my Windows PC, but the Linux PC just won't get past these issues:

Rosetta@home | Not requesting tasks: some download is stalled
...
Rosetta@home | Resetting project
Rosetta@home | Detaching from project
________________ | Fetching configuration file from https://boinc.bakerlab.org/rosetta/get_project_config.php
Rosetta@home | Fetching scheduler list
Rosetta@home | Master file download succeeded
Rosetta@home | Sending scheduler request: Project initialization.
Rosetta@home | Requesting new tasks for CPU
Rosetta@home | Scheduler request completed: got 1 new tasks
...
Rosetta@home | [http] [ID#1114] Info: TLSv1.2 (OUT), TLS alert, unknown CA (560):
Rosetta@home | [http] [ID#1114] Info: SSL certificate problem: unable to get local issuer certificate
...
Rosetta@home | [http] HTTP error: SSL peer certificate or SSH remote key was not OK
...
Rosetta@home | [file_xfer] http op done; retval -184 (transient HTTP error)
...
Rosetta@home | Temporarily failed download of database_f5ae1de8e1.zip: transient HTTP error
Rosetta@home | Temporarily failed download of LiberationSans-Regular.ttf: transient HTTP error
...
________________ | Project communication failed: attempting access to reference site
...
________________ | Internet access OK - project servers may be temporarily down.

I tried restarting BOINC and the computer, retrying and aborting the downloads, resetting and updating Rosetta, updating from the stable to the alpha release of BOINC, and followed the suggestion at https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6198 to enable net.ipv4.tcp_timestamps and net.ipv4.tcp_window_scaling in /etc/sysctl.conf but nothing so far has affected the issue.

I'm sorry I am not a Linux or networking guru, so I am not sure what I should test next. Any suggestions?
ID: 111971 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 430
Credit: 14,933,398
RAC: 19
Message 111972 - Posted: 28 Jan 2025, 4:30:06 UTC - in response to Message 111971.  

Try following the fix in this message :-

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=111896#111896

So, as you’re not a Linux guru :-

Carl-Alt-T to open a terminal

sudo nano /etc/hosts

Edit to add a new line to the end of the file :-

128.95.160.156 boinc-files.bakerlab.org

Then close up (ctrl-x) and close the terminal (exit).

Job’s a good un.
ID: 111972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 288
Credit: 540,373
RAC: 0
Message 111974 - Posted: 28 Jan 2025, 15:08:08 UTC - in response to Message 111972.  

I press ctrl+s to save.
ID: 111974 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Random

Send message
Joined: 10 Mar 24
Posts: 8
Credit: 115,388
RAC: 0
Message 111976 - Posted: 29 Jan 2025, 0:58:38 UTC - in response to Message 111974.  

Rosetta was working fine in early Dec, then started it again 2 weeks ago and now I get WU's in my list, status is "downloading" in tasks but never do. In transfers they all show 0 progress, not even 1 %. I abort them and wait until the next restart, more WU's but same thing. I "reset" the project, another day and restart and still the same. Linux 7.18.0. I've read some about bad certificates which seems to me to be a security issue. I'm on Linux for better security, I don't want to chance it if Rosetta is turning into a threat. What's going on?
Thanks
ID: 111976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 288
Credit: 540,373
RAC: 0
Message 111978 - Posted: 29 Jan 2025, 3:18:17 UTC

There are several download server ips for load balancing.
They have updated one server's certificate but didn't update others.
ID: 111978 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Random

Send message
Joined: 10 Mar 24
Posts: 8
Credit: 115,388
RAC: 0
Message 111979 - Posted: 29 Jan 2025, 3:57:02 UTC - in response to Message 111978.  

A certificate that's not updated seems like a security risk to me. Not what DC needs. When will all be fixed?
ID: 111979 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
sb4TEVvgZ2g6Vq1hGgztbFLwdGF

Send message
Joined: 28 Dec 24
Posts: 2
Credit: 2,793,928
RAC: 2,788
Message 111980 - Posted: 29 Jan 2025, 4:07:49 UTC - in response to Message 111972.  

Edit to add a new line to the end of the file :-
128.95.160.156 boinc-files.bakerlab.org

That fixed it! I had found a mention of a "hosts fix", but I couldn't find what it was. I added "128.95.160.156 boinc-files.bakerlab.org", rebooted, and retried each of the downloads to reset the timer, and now things are working fine.

Thank you so much Bryn Mawr!
ID: 111980 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 221
Credit: 7,572,744
RAC: 0
Message 111981 - Posted: 29 Jan 2025, 4:52:31 UTC - in response to Message 111980.  

That fixed it! I had found a mention of a "hosts fix", but I couldn't find what it was. I added "128.95.160.156 boinc-files.bakerlab.org", rebooted, and retried each of the downloads to reset the timer, and now things are working fine.


I do not believe you need to reboot anything to make this effective. Rebooting would not hurt anything though.
ID: 111981 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 311 · 312 · 313 · 314 · 315 · 316 · 317 . . . 352 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org