Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 227 · 228 · 229 · 230 · 231 · 232 · 233 . . . 276 · Next

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,703,329
RAC: 2,182
Message 106885 - Posted: 9 Sep 2022, 12:40:20 UTC - in response to Message 106874.  

Just FYI...if you have been a WCG cruncher and have not checked recently, they are back online with tasks.
Cancer and Pandemics are active.



I guess not...that was just test stuff.
Gees.
They've been going for quite a while now, but the tasks are sporadic, some are issued quite a few times a day, but not enough for everyone. Also, when you try to download the tasks, the downloads get stuck.

So I set up a tickle function, I got Boinc to ask for work every 5 minutes. Got all my machines running 24/7 on their tasks now. 110 CPU cores. There were 12 GPUS and 2 phones, but they're not giving those out anymore.



I think that is another project I give up on.
I was getting some work and then the downloads stalled and would not restart.
Now I am getting can not connect to project errors from Boinc and the website is dead with some sort of bug that the web says is critical and cannot connect. The forums were old, can not be reached now. Twitter has not been used in ages, the FB page is the same and cannot be messaged. I guess they don't want to be reached.
They used to be good, now they are useless.
ID: 106885 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 374
Credit: 10,709,223
RAC: 5,616
Message 106886 - Posted: 9 Sep 2022, 18:20:08 UTC - in response to Message 106885.  

WCG is down due to an expired certificate, they are now aware of this and working on it :-)
ID: 106886 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 172
Credit: 5,658,458
RAC: 3,363
Message 106887 - Posted: 9 Sep 2022, 19:00:34 UTC - in response to Message 106886.  

WCG is down due to an expired certificate, they are now aware of this and working on it :-)


That is a project run by one or two inexperienced trainees in their spare time. A full time experienced networking profesional would realize the certificate needed to be renewed, usually annually. It would be on his calender.
ID: 106887 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 374
Credit: 10,709,223
RAC: 5,616
Message 106889 - Posted: 10 Sep 2022, 0:24:05 UTC - in response to Message 106887.  

WCG is down due to an expired certificate, they are now aware of this and working on it :-)


That is a project run by one or two inexperienced trainees in their spare time. A full time experienced networking profesional would realize the certificate needed to be renewed, usually annually. It would be on his calender.


Well it certainly isn’t the first project to have this problem - I’d say it’s the third in the past couple of years.
ID: 106889 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106890 - Posted: 10 Sep 2022, 1:10:31 UTC - in response to Message 106886.  

WCG is down due to an expired certificate, they are now aware of this and working on it :-)
I was told "2 hours" at 5pm UTC, it's now 8 hours after that and it's still not working. Incompetent lightweights.... I wish I lived in Canada, I'd pop over and sort it.
ID: 106890 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106891 - Posted: 10 Sep 2022, 1:11:41 UTC - in response to Message 106889.  

WCG is down due to an expired certificate, they are now aware of this and working on it :-)


That is a project run by one or two inexperienced trainees in their spare time. A full time experienced networking profesional would realize the certificate needed to be renewed, usually annually. It would be on his calender.


Well it certainly isn’t the first project to have this problem - I’d say it’s the third in the past couple of years.
The whole of Boinc seems to be susceptible to this. It might be Boinc's fault and not WCG. In which case we all need yet another Boinc Manager update. Do we really need all this OCD security bullshit?
ID: 106891 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106892 - Posted: 10 Sep 2022, 1:16:21 UTC
Last modified: 10 Sep 2022, 1:17:20 UTC

I just tried their website, and most browsers threw a girly hissy fit. Firefox however let me ignore the "risk of almost certain death" and I got this:

Proxy Error
The proxy server could not handle the request GET /.
Reason: Error during SSL Handshake with remote server

Does that mean their servers won't even talk to each other?
ID: 106892 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 172
Credit: 5,658,458
RAC: 3,363
Message 106893 - Posted: 10 Sep 2022, 5:38:18 UTC - in response to Message 106891.  

The whole of Boinc seems to be susceptible to this. It might be Boinc's fault and not WCG. In which case we all need yet another Boinc Manager update. Do we really need all this OCD security bullshit?


If this were a Boinc or Boinc Manager problem, would I not be experiencing this problem with my other Boinc projects as well? But I am not.

Furthermore, if this were just a Boinc problem, I would not be getting this

Down for Everyone or Just Me

Is Worldcommunitygrid.org down?
Checking if worldcommunitygrid.org is down or it is just you...
It's not just you! worldcommunitygrid.org is down.

ID: 106893 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106894 - Posted: 10 Sep 2022, 5:45:44 UTC - in response to Message 106893.  

It happens with many Boinc projects, but one at a time. For some reason Boinc uses a huge list of certificates, which expire at different times.

I've complained to Opera I can't get onto their website, since Firefox lets me ignore the danger of death sign.

There is no problem anywhere, everything physically works, but just because something is one nanosecond out of date apparently we can't use it. Do you throw food out because it's only just past the use by date?
ID: 106894 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 228
Credit: 316,044
RAC: 1,488
Message 106896 - Posted: 10 Sep 2022, 9:07:19 UTC - in response to Message 106894.  

ID: 106896 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106897 - Posted: 10 Sep 2022, 11:28:51 UTC - in response to Message 106896.  

HSTS developers apparently think so.
https://en.wikipedia.org/wiki/HTTP_Strict_Transport_Security
Except everything worked just fine back when we typed http. Just like our cars were fine before ABS and airbags.
ID: 106897 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 228
Credit: 316,044
RAC: 1,488
Message 106898 - Posted: 10 Sep 2022, 11:33:06 UTC - in response to Message 106897.  

It all happens because of Snowden.
ID: 106898 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106899 - Posted: 10 Sep 2022, 11:47:38 UTC - in response to Message 106898.  

It all happens because of Snowden.

You mean keeping the government out of our lives? I can appreciate that. Amazing what they can't see through a VPN....
ID: 106899 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106900 - Posted: 10 Sep 2022, 12:31:40 UTC

I wonder what the bottleneck is at WCG? I've noticed if I'm downloading tasks I've just been issued, they're downloaded much more easily than ones I'm retrying. This suggests to me a disk bottleneck, and the recent tasks are still in the cache. Universe benefitted immensely from going to SSD, perhaps WCG is still on the old rust spinners?
ID: 106900 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,703,329
RAC: 2,182
Message 106905 - Posted: 11 Sep 2022, 23:02:36 UTC

In the middle of trying to reinstall it and get new work while updating other projects I get a transient error. So I manually start the download again for the 3 file segments not loaded and they download ok.

There is something funky going on with their web servers.
Stalls, transient errors, etc.
ID: 106905 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106907 - Posted: 12 Sep 2022, 2:52:29 UTC - in response to Message 106905.  

In the middle of trying to reinstall it and get new work while updating other projects I get a transient error. So I manually start the download again for the 3 file segments not loaded and they download ok.

There is something funky going on with their web servers.
Stalls, transient errors, etc.
I seem to be getting everything the first time today, yesterday i got everything in the second try, before that it was several tries. It's improving.
ID: 106907 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,703,329
RAC: 2,182
Message 106909 - Posted: 12 Sep 2022, 11:46:16 UTC - in response to Message 106907.  

Looks like they finally got the bugs out.
Whats going on here in RAH?
Are they still auto kicking people because of errors, either by the project or by the user?
ID: 106909 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106910 - Posted: 12 Sep 2022, 12:12:23 UTC - in response to Message 106909.  

Looks like they finally got the bugs out.
Whats going on here in RAH?
Are they still auto kicking people because of errors, either by the project or by the user?
Probably, but Python is finally running out.
ID: 106910 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,703,329
RAC: 2,182
Message 106913 - Posted: 12 Sep 2022, 19:52:42 UTC - in response to Message 106910.  
Last modified: 12 Sep 2022, 20:05:07 UTC

Looks like they finally got the bugs out.
Whats going on here in RAH?
Are they still auto kicking people because of errors, either by the project or by the user?
Probably, but Python is finally running out.



oh? then what?
ID: 106913 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 9,715,763
RAC: 8,614
Message 106915 - Posted: 13 Sep 2022, 3:00:26 UTC - in response to Message 106913.  

Looks like they finally got the bugs out.
Whats going on here in RAH?
Are they still auto kicking people because of errors, either by the project or by the user?
Probably, but Python is finally running out.



oh? then what?
The end of the world as we know it. You could always try the new 30GB per 4 cores multithreaded climate prediction tasks that are on their way over the next month. I may need to upgrade some machines!
ID: 106915 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 227 · 228 · 229 · 230 · 231 · 232 · 233 . . . 276 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org