Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 29 · 30 · 31 · 32 · 33 · 34 · 35 . . . 55 · Next

AuthorMessage
belskiy

Send message
Joined: 2 Dec 13
Posts: 2
Credit: 2,411,125
RAC: 0
Message 77083 - Posted: 29 Jul 2014, 13:54:05 UTC

The same problem.
29.07.2014 17:26:36 | rosetta@home | update requested by user
29.07.2014 17:26:40 | rosetta@home | Sending scheduler request: Requested by user.
29.07.2014 17:26:40 | rosetta@home | Not requesting tasks: "no new tasks" requested via Manager
29.07.2014 17:27:02 | rosetta@home | Scheduler request failed: Couldn't connect to server
29.07.2014 17:27:06 | | Project communication failed: attempting access to reference site
29.07.2014 17:27:07 | | Internet access OK - project servers may be temporarily down.

ID: 77083 · Rating: 0 · rate: Rate + / Rate - Report as offensive
deerslayer_TA

Send message
Joined: 19 Feb 07
Posts: 3
Credit: 1,781,271
RAC: 0
Message 77084 - Posted: 29 Jul 2014, 14:42:05 UTC

Neither of my rigs can upload/download either.
ID: 77084 · Rating: 0 · rate: Rate + / Rate - Report as offensive
TJ

Send message
Joined: 29 Mar 09
Posts: 127
Credit: 4,799,890
RAC: 0
Message 77085 - Posted: 29 Jul 2014, 14:45:56 UTC

My rigs can't upload/download either. This project is becoming more and more frustrated. For weeks there where many errors in the WU's and now we can't upload and download for almost 24 hours,, as it started yesterday.

And do we get any answers, no off course not as usual.

I am considering another biology/medicine project to crunch for.
Greetings,
TJ.
ID: 77085 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Daniel Kohn

Send message
Joined: 30 Dec 05
Posts: 18
Credit: 2,899,939
RAC: 0
Message 77086 - Posted: 29 Jul 2014, 14:54:59 UTC
Last modified: 29 Jul 2014, 15:14:27 UTC

I cannot upload completed work either, starting this morning.
I run Windows 7 and ESET Smart Security - never had any issue until today and no software changed.
ID: 77086 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,135
Message 77087 - Posted: 29 Jul 2014, 15:16:08 UTC

Same old RAH problems as before....oh well..bad for them..good for other projects.
Oh yeah, forgot...same old communications problems as before. NONE!
ID: 77087 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile {CurlY BracketS}

Send message
Joined: 25 Sep 05
Posts: 1
Credit: 94,317
RAC: 0
Message 77088 - Posted: 29 Jul 2014, 15:46:16 UTC - in response to Message 77087.  

Hey Rosetta, are you mad? I got 12 crunchies waiting to upload. You claim your servers are up and running. You don't want me to do any work for you?

ID: 77088 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile The_Saint_(LDS)

Send message
Joined: 12 Aug 10
Posts: 6
Credit: 10,076,132
RAC: 0
Message 77089 - Posted: 29 Jul 2014, 15:48:49 UTC

Same here...Win XP, 7, 8 and 8.1...all were running fine until sometime yesterday, then no uploads and very, very few downloads (I was adding a new machine and was only able to get 4 units downloaded in the last 15 hrs). Machines are at different locations, different networks and ISPs, all running most recent software and were running fine until yesterday.

Not sure where the issue is, but it is not on my end, sounds like it is a common issue for multiple users and is very frustrating.
ID: 77089 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Killersocke@rosetta

Send message
Joined: 13 Nov 06
Posts: 29
Credit: 2,579,125
RAC: 0
Message 77090 - Posted: 29 Jul 2014, 16:26:11 UTC


Thats the Point why i stopped complete Rosetta
and kicked out from my BAM and Boinc Manager.

Regards



ID: 77090 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Polian
Avatar

Send message
Joined: 21 Sep 05
Posts: 152
Credit: 10,141,266
RAC: 0
Message 77092 - Posted: 29 Jul 2014, 16:45:38 UTC

They're just getting into work on the west coast, I'm sure it will be fixed soon. Not like they have 24x7 support and a NOC monitoring everything, sheesh
ID: 77092 · Rating: 0 · rate: Rate + / Rate - Report as offensive
krypton
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 16 Nov 11
Posts: 108
Credit: 2,164,309
RAC: 0
Message 77093 - Posted: 29 Jul 2014, 16:53:18 UTC

Thanks for the reports!! We are looking at this now.
ID: 77093 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Timo
Avatar

Send message
Joined: 9 Jan 12
Posts: 185
Credit: 45,644,940
RAC: 157
Message 77097 - Posted: 29 Jul 2014, 19:33:32 UTC
Last modified: 29 Jul 2014, 19:36:27 UTC

Some people here are incredibly immature and demanding - I'm talking to those who are stomping their feet and threatening to leave the project just 12 hours into what I'll call the work download/upload outage. Grow up people.

Sure it sucks when stuff breaks or stops working, but that's the nature of any complex system. BOINC and most boinc projects (R@H included) are very complex systems.

My computer cycles will continue to crunch for Rosetta, good luck on getting things fixed :).
ID: 77097 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 275,511,655
RAC: 192,699
Message 77098 - Posted: 29 Jul 2014, 19:47:51 UTC - in response to Message 77097.  

Some people here are incredibly immature and demanding - I'm talking to those who are stomping their feet and threatening to leave the project just 12 hours into what I'll call the work download/upload outage. Grow up people.

Sure it sucks when stuff breaks or stops working, but that's the nature of any complex system. BOINC and most boinc projects (R@H included) are very complex systems.

My computer cycles will continue to crunch for Rosetta, good luck on getting things fixed :).


+1
ID: 77098 · Rating: 0 · rate: Rate + / Rate - Report as offensive
biodoc

Send message
Joined: 19 Feb 06
Posts: 14
Credit: 30,298,003
RAC: 1,986
Message 77103 - Posted: 29 Jul 2014, 21:01:47 UTC

Similar problems with my 4 computers:

7/29/2014 4:56:43 PM | rosetta@home | Started upload of tube9_26_A_tube9_26_B_patchdock_split_03_140727_SAVE_ALL_OUT__179886_89_0_0
7/29/2014 4:57:01 PM | rosetta@home | Temporarily failed upload of hc_centroids_1tsf_234_0.5_06-01-14_SAVE_ALL_OUT_168123_4458_0_0: transient HTTP error
7/29/2014 4:57:01 PM | rosetta@home | Backing off 00:06:23 on upload of hc_centroids_1tsf_234_0.5_06-01-14_SAVE_ALL_OUT_168123_4458_0_0
7/29/2014 4:57:04 PM | rosetta@home | Temporarily failed upload of tube9_26_A_tube9_26_B_patchdock_split_03_140727_SAVE_ALL_OUT__179886_89_0_0: connect() failed
7/29/2014 4:57:04 PM | rosetta@home | Backing off 00:05:20 on upload of tube9_26_A_tube9_26_B_patchdock_split_03_140727_SAVE_ALL_OUT__179886_89_0_0

ID: 77103 · Rating: 0 · rate: Rate + / Rate - Report as offensive
amgthis

Send message
Joined: 25 Mar 06
Posts: 81
Credit: 203,879,282
RAC: 0
Message 77104 - Posted: 29 Jul 2014, 22:12:36 UTC

I'm having a ton of problems in the last couple of weeks that I *thought* appeared to be DNS lookup related, but I've come to find that the only application with the problem is Rosetta. Reading here about server problems gives me a little bit of relief in thinking it's not all on my end, or a telco problem. At least there is some clue to server status with the project posted here. Thanks for the efforts.

/Mike

ID: 77104 · Rating: 0 · rate: Rate + / Rate - Report as offensive
keputnam

Send message
Joined: 18 Sep 05
Posts: 24
Credit: 2,084,465
RAC: 0
Message 77105 - Posted: 29 Jul 2014, 22:18:07 UTC
Last modified: 29 Jul 2014, 22:18:53 UTC

@Timo

"Some people here are incredibly immature and demanding - I'm talking to those who are stomping their feet and threatening to leave the project just 12 hours into what I'll call the work download/upload outage. Grow up people."

The problem is, it is NOT 12 hours into this, it is more like five days

Which is PLENTY Of time to at least acknowledge that there IS a problem
ID: 77105 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 77106 - Posted: 29 Jul 2014, 22:36:59 UTC - in response to Message 77105.  
Last modified: 29 Jul 2014, 22:40:12 UTC

The problem is, it is NOT 12 hours into this, it is more like five days

Which is PLENTY Of time to at least acknowledge that there IS a problem


Except that it was only affecting a few individuals prior to today and the isolated error reports made on this forum suggested it was problems with the host computers or ISPs. It is only within the last day that the issue has escalated.

If you have been experiencing problems for the last 5 days, why was your last post 110 days ago? As the project team isn't psychic it is difficult for them to spot the early signs of a failure occuring when few people bother to post.

It is up to you whether you post or not, but it is a bit rich to then complain that they haven't fixed a problem you didn't report.
ID: 77106 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,135
Message 77107 - Posted: 29 Jul 2014, 22:52:42 UTC

There is no direct way to report a problem and we do not know if anyone from the project is reading or not. We also think that they have some sort of monitoring system that alerts them if something goes wrong, but I guess that is incorrect when it comes to technical things like this. Plus this has not just been a few hours, this has been in my case a full day or longer now.

I am a veteran RAH cruncher and I have seen lot of outages come and go and more often than not there is no acknowledgement from the team that there is something wrong. Yes the science side is very informative, but the IT side has never been that great in communication. Obviously this is deeper than what the server status page shows. I find it interesting that another project I do work for has quicker and better response time from someone IT related than this project does. That is what our issue is here. Other projects just as large or small as RAH have faster response times and better communications than here. All we want is better communication from the project.
ID: 77107 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile shanen
Avatar

Send message
Joined: 16 Apr 14
Posts: 195
Credit: 12,662,308
RAC: 0
Message 77108 - Posted: 29 Jul 2014, 22:59:55 UTC

Well, I sort of like their low-key attitude, but the poor communication encourages wild rumors. So have you heard the one about the NSA trainee who was assigned to hack Rosetta@home? He completely bollixed up the system, but at least there's "no real harm" while he learns from his mistake. He just bungled the installation of some large number of fake work units that were going to use our computers cameras to capture incriminating naked photos of our wives and girlfriends. Everyone at the NSA is sure he'll do much better on his next assignment. However, if this rumor catches on, then his next assignment may be cleaning the toilets for a few years.
ID: 77108 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 77110 - Posted: 29 Jul 2014, 23:07:56 UTC - in response to Message 77107.  

There is no direct way to report a problem and we do not know if anyone from the project is reading or not. We also think that they have some sort of monitoring system that alerts them if something goes wrong, but I guess that is incorrect when it comes to technical things like this. Plus this has not just been a few hours, this has been in my case a full day or longer now.


Erm...

krypton wrote:
Thanks for the reports!! We are looking at this now.
ID: 77110 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1225
Credit: 13,895,098
RAC: 3,819
Message 77112 - Posted: 29 Jul 2014, 23:37:20 UTC

I've been getting errors like these for several hours now:

7/29/2014 4:43:13 PM | rosetta@home | Sending scheduler request: To fetch work.
7/29/2014 4:43:13 PM | rosetta@home | Requesting new tasks for CPU
7/29/2014 4:43:36 PM | rosetta@home | Scheduler request failed: Couldn't connect to server

I'm connected to several other BOINC projects though, so only Rosetta@Home is getting less work from my computers due to this. No Rosetta@Home workunits downloaded for several hours.

Note on something relevant to a few earlier posts in this thread: If you are running 32-bit workunits under a 64-bit version of Windows Vista, don't expect the total size of the workunits to come very close to the total amount of main memory you have. 64-bit Windows Vista needs to use an amount of main memory roughly equal to the amount used by the workunits to hold the SYSWOW64 modules needed to run their 32-bit applications. Therefore if all your workunits use 32-bit applications, their total size is limited to about half of your main memory. 64-bit Windows 7 needs much less memory for its SYSWOW64 modules, though.
ID: 77112 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Previous · 1 . . . 29 · 30 · 31 · 32 · 33 · 34 · 35 . . . 55 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org