Posts by David E K

41) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86759)
Posted 27 Jun 2017 by Profile David E K
Post:
I'll look into this.
42) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86749)
Posted 26 Jun 2017 by Profile David E K
Post:
I think this issue is fixed now thanks to our sys admin, Luki.

Here are his notes on the fix:


"Monday 6/26 20:12:06 2017 | | [http] HTTP error: Peer certificate cannot be authenticated with given CA certificates

I've seen this problem when imperfect clients don't like the order of certificates provided by the server. Browsers don't seem to care. I rearranged them and the SSL test passes with flying colors, including the certificate chain verification (click the IP address for details): https://www.ssllabs.com/ssltest/analyze.html?d=boinc.bakerlab.org

OpenSSL is happy too"
43) Message boards : News : Outage notice (Message 86748)
Posted 26 Jun 2017 by Profile David E K
Post:
Yep, some of us also ran into that issue. When we had our DNS problem a short term fix was to add the IPs to the hosts file but we forgot to remove them. You need to remove them now for the new site to resolve correctly if you modified your hosts file.
44) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86738)
Posted 25 Jun 2017 by Profile David E K
Post:
In Boinc Manager, I cannot "Add Project" the Rosetta@Home to my new machine.

Environment:
macOS 10.12.5
Boinc 7.6.33

Steps to reproduce:
1. In Boinc Manager, choose "Add Project" menu
2. Choose Rosetta@home and click Next
3. Click on "I already have my account" radio button
4. Enter my Email address and password, and click Next
5. Some error message is displayed, and following line appears on the Event Log
Fetching configuration file from http://boinc.bakerlab.org/rosetta/get_project_config.php	
Project communication failed: attempting access to reference site	
Internet access OK - project servers may be temporarily down.	


Thank you.


I'm trying to figure out what is causing this. Does anyone have any suggestions? I contacted David Anderson for some help too. Thanks.
45) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86720)
Posted 24 Jun 2017 by Profile David E K
Post:
Website looks great thanks for upgrade

https://boinc.bakerlab.org/rosetta/stats/

Starts url are empty. You guys might already on it.


I'm fixing this way up high in the sky on a Boeing 737 :) The export is running now.
46) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86717)
Posted 24 Jun 2017 by Profile David E K
Post:
Willy from BoincStats is reporting that your "Stats exports directory is empty", hence we're not getting any credit for the results we've returned in the past couple of days...


I'll look into this later today.
47) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86711)
Posted 23 Jun 2017 by Profile David E K
Post:
Please,
- insert links "Home | Join | About | Participants | Community | Statistics" in the footer of forum, like old site
- insert "last modified" to know updates of forum

Do you plan to update Ralph@Home server??



I'll add these links soon.

We do plan on updating Ralph but probably later in July or Aug since I'll be on vacation soon.
48) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 86710)
Posted 23 Jun 2017 by Profile David E K
Post:
done!
49) Message boards : Number crunching : Piss poor Android version download (Message 81613)
Posted 20 Jun 2017 by Profile David E K
Post:
I'm not sure if it's related but we have been having issues with the UW network. and they recently sent out this advisory notice:

Outage 2017-06-20 01:46:00 PDT - 2017-06-20 03:06:00 PDT

Summary:

Update: Issues with a UW campus security device have been resolved: campus Internet and internal traffic is experiencing normal response times.

The issues with the McAfee/Intel Intrusion Protection device continued and Network Operations staff have placed the devices into bypass mode to restore network stability.

-Daniel L

Due to an issue with a UW campus security device, campus Internet and some internal traffic is experiencing high response times and interruptions. UW-IT Security Engineers are being engaged to investigate the issue.

If the issues with the McAfee/Intel Intrusion Protection device continue Network Operations staff will place the devices into bypass mode to restore network stability.

-Daniel L
UW-IT NOC


Affected Item(s):

UW : 2017-06-20 01:46:00 PDT - 2017-06-20 03:06:00 PDT
uwcr-ads-1 : 2017-06-20 01:46:00 PDT - 2017-06-20 03:06:00 PDT


Ref: INC0571620
50) Message boards : Number crunching : Access Violation Errors ( Computation error) (Message 81611)
Posted 20 Jun 2017 by Profile David E K
Post:
This may be some instability related to a specific job/protocol. I'll contact the research who's jobs these are. There have been other recent issues brought up with these jobs.

Hi, just want to confirm whether or not this was a hardware issue:
Task: 922044334

Much appreciated.



These tasks have been having some random issues. The researcher was informed but I think there are still some lingering in our queue.
51) Message boards : Number crunching : Piss poor Android version download (Message 81607)
Posted 19 Jun 2017 by Profile David E K
Post:
I was told the UW network was unstable last week so that might have been the cause also.
52) Message boards : Number crunching : Server update/upgrade (Message 81597)
Posted 15 Jun 2017 by Profile David E K
Post:
dekim

at

uw

dot

edu
53) Message boards : Number crunching : Server update/upgrade (Message 81585)
Posted 12 Jun 2017 by Profile David E K
Post:
Would anyone like to check out the new site and provide some feedback before it goes live? If so, please send me an email and I'll respond with the temporary url.
54) Message boards : Number crunching : Access Violation Errors ( Computation error) (Message 81568)
Posted 9 Jun 2017 by Profile David E K
Post:
This may be some instability related to a specific job/protocol. I'll contact the research who's jobs these are. There have been other recent issues brought up with these jobs.
55) Message boards : Number crunching : Server update/upgrade (Message 81561)
Posted 8 Jun 2017 by Profile David E K
Post:
I've been pretty busy working on the new server and trying to update the Rosetta application. There has been a lot of changes to the Rosetta code that need to be addressed for the various platforms, mainly Windows. We hope to do the transition this month. The current server is operating as usual without any issues.
56) Message boards : Rosetta@home Science : MAJOR UPDATES to the Rosetta@home website (Message 81558)
Posted 8 Jun 2017 by Profile David E K
Post:
It's coming along. We plan to release it sometime this month.
57) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 81545)
Posted 30 May 2017 by Profile David E K
Post:
The Server Status page is a sea of red: clearly there are problems of some sort.


Still.... :-(



Sorry for the errors in the status page. I'll take a look. Everything is running as normal so you can ignore the page for now.
58) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 81532)
Posted 14 May 2017 by Profile David E K
Post:
Hey,

I've received some work units which couldn't be finished due to a compute error. Interestingly, the second person calculating the same WU also resulted in a compute error:
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=825566938
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=825557307
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=825537629 (still pending)

Is this common behaviour? What has happened there?

Best regards



This was a bad batch that a researcher accidentally sent out.
59) Message boards : Number crunching : Question for Researchers about waiting for results (Message 81504)
Posted 20 Apr 2017 by Profile David E K
Post:
"how the heck do you manage to iterate in your experiments efficiently?"

We typically submit large batches of jobs per iteration and when we are satisfied with the results, we cancel jobs that are still queued but jobs that are on clients will continue to run. Having short turn around times and machines that are continually crunching and networked would obviously make this more efficient.

"how do you ensure that you don't spend a whole two weeks waiting for a run to complete only to find out that there was a typo in the input sequences somewhere?"

We try to be careful :) and we almost never have to manually type sequences.

"what do you do while waiting for jobs to finish?"

There's always stuff to do. Depending on the researcher, one can prepare more jobs, analyze data, develop new methods, write, refactor, and debug code, think of and do other experiments (computational and/or wet lab), write papers, go to meetings, respond to forum posts, etc etc etc.....
60) Message boards : Number crunching : Stuck on uploading is a new problem? (Message 81500)
Posted 19 Apr 2017 by Profile David E K
Post:
Our systems engineers have been working hard trying to figure out what was causing this odd behavior. It looks like it turned out to be out of their control since everything on our end (servers, local network, etc) seemed ok from their diagnostics and analysis but they contacted the right department, University of Washington IT, to figure out the issue and get it resolved for now.

Kudos to Luki, Darwin, Keith, and Patrick!


Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org