Message boards : Number crunching : Problems with web site
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · Next
Author | Message |
---|---|
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
Hi people. I just checked here in December. Data does not exist. That says to me that they no longer support that function. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1234 Credit: 14,338,560 RAC: 1,227 |
Could you check if all your servers are working? Whenever, I try to upload the output files from completed workunits, this now fails. Whenever I try to report one that was able to upload its output files, this also fails. No new workunits download. No error messages, just the usual delays after the server could not be contacted. |
mraswyp Send message Joined: 26 Feb 07 Posts: 2 Credit: 2,191,258 RAC: 0 |
None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down. |
Jim_S Send message Joined: 26 Aug 06 Posts: 15 Credit: 497,976 RAC: 0 |
None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down. I have a Stuck one also. ActCys_Ploop_abinitio_design_y053_010_74678_459_0_0 PEACE |
mraswyp Send message Joined: 26 Feb 07 Posts: 2 Credit: 2,191,258 RAC: 0 |
None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down. 2/18/2013 4:09:03 AM | rosetta@home | Started upload of P2_1_s6_f5_2_2_abinitio_design_y073_004_74609_429_0_0 2/18/2013 4:09:03 AM | rosetta@home | Started upload of Ross3X3_SAVE_ALL_OUT_t074_010_74652_326_0_0 2/18/2013 4:09:35 AM | rosetta@home | Temporarily failed upload of P2_1_s6_f5_2_2_abinitio_design_y073_004_74609_429_0_0: transient HTTP error 2/18/2013 4:09:35 AM | rosetta@home | Backing off 5 hr 42 min 28 sec on upload of P2_1_s6_f5_2_2_abinitio_design_y073_004_74609_429_0_0 2/18/2013 4:09:35 AM | rosetta@home | Temporarily failed upload of Ross3X3_SAVE_ALL_OUT_t074_010_74652_326_0_0: transient HTTP error 2/18/2013 4:09:35 AM | rosetta@home | Backing off 5 hr 39 min 8 sec on upload of Ross3X3_SAVE_ALL_OUT_t074_010_74652_326_0_0 2/18/2013 4:09:35 AM | rosetta@home | Started upload of rb_02_13_36834_69831__t000__0_D3_SAVE_ALL_OUT_IGNORE_THE_REST_74604_2167_0_0 2/18/2013 4:09:35 AM | rosetta@home | Started upload of ActCys_Ploop_abinitio_design_relax_y036_002_74683_27_0_0 2/18/2013 4:09:48 AM | | Project communication failed: attempting access to reference site 2/18/2013 4:09:50 AM | | Internet access OK - project servers may be temporarily down. 2/18/2013 4:10:06 AM | rosetta@home | Temporarily failed upload of rb_02_13_36834_69831__t000__0_D3_SAVE_ALL_OUT_IGNORE_THE_REST_74604_2167_0_0: transient HTTP error 2/18/2013 4:10:06 AM | rosetta@home | Backing off 4 hr 15 min 27 sec on upload of rb_02_13_36834_69831__t000__0_D3_SAVE_ALL_OUT_IGNORE_THE_REST_74604_2167_0_0 2/18/2013 4:10:08 AM | rosetta@home | Temporarily failed upload of ActCys_Ploop_abinitio_design_relax_y036_002_74683_27_0_0: transient HTTP error 2/18/2013 4:10:08 AM | rosetta@home | Backing off 5 hr 40 min 56 sec on upload of ActCys_Ploop_abinitio_design_relax_y036_002_74683_27_0_0 2/18/2013 4:10:09 AM | | Project communication failed: attempting access to reference site 2/18/2013 4:10:10 AM | | Internet access OK - project servers may be temporarily down. |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,214,786 RAC: 932 |
None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down. I will try this again, as long as it says this: | Project communication failed: attempting access to reference site | Internet access OK - project servers may be temporarily down. IT IS NOT OUR PC'S, it is the Project that is down!! YES the Project went down the other day, then the weekend came, we KNOW they don't even come in on weekends at Rosetta and YES they will probably fix it this week. If it is easy they should fix it today, if it is a more serious problem, like broken parts or even missing or stolen parts, it could take a bit longer! Set up a backup program with zero percent as the amount you want to give it. Every time Rosie goes down you will crunch units from the other project, until then Rosie will get all your time. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
we KNOW they don't even come in on weekends at Rosetta What is the source of your information here mikey? Rosetta Moderator: Mod.Sense |
Paul Vleugels Send message Joined: 1 Apr 09 Posts: 4 Credit: 643,821 RAC: 0 |
While running Rosette I've got 5 finished WU's ready to be send to the server, but my BOINC environment is constantly trying to upload the data to the server without any success for 3 days now. When I checked the server status yesterday and today I didn't see any issue, so what can be the reason for not starting the upload towards the server? I'm running v7.2.42 in a Windows 7 environment. |
dmargulis Send message Joined: 30 Apr 07 Posts: 2 Credit: 7,944,617 RAC: 763 |
Same issue here, but I have 43 WU's waiting to upload. The problem seems to have started on the night of 7/28: 7/28/2014 11:50:44 PM | rosetta@home | Computation for task rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0 finished 7/28/2014 11:50:46 PM | rosetta@home | Started upload of rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0_0 7/28/2014 11:51:10 PM | rosetta@home | Temporarily failed upload of rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0_0: connect() failed 7/28/2014 11:51:10 PM | rosetta@home | Backing off 00:03:05 on upload of rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0_0 7/28/2014 11:51:14 PM | | Project communication failed: attempting access to reference site 7/28/2014 11:51:17 PM | | Internet access OK - project servers may be temporarily down. Any thoughts? While running Rosette I've got 5 finished WU's ready to be send to the server, but my BOINC environment is constantly trying to upload the data to the server without any success for 3 days now. |
robl Send message Joined: 8 Dec 12 Posts: 4 Credit: 10,776,593 RAC: 0 |
Anyone know when this upload problem will be resolved. If the servers are online then it seems like DNS can't resolve the IP addr for the upload server. My same machines are not having an issue with E@H. |
Polian Send message Joined: 21 Sep 05 Posts: 152 Credit: 10,141,266 RAC: 0 |
Anyone know when this upload problem will be resolved. If the servers are online then it seems like DNS can't resolve the IP addr for the upload server. My same machines are not having an issue with E@H. I'm not seeing DNS problems at all. The names are being resolved. The servers aren't responding, likely due to significantly throttled traffic as the Rosetta problems thread and the front page indicates. It will be fixed when it's fixed. I'm sure they don't enjoy their own project not receiving needed data, either. In the mean time, attach to another project or turn the computer off, walk away, and come back later. |
shanen Send message Joined: 16 Apr 14 Posts: 195 Credit: 12,662,308 RAC: 0 |
I think it's the lack of useful news that is the main source of frustration here, and this thread is also lacking therein. The secondary source is idle machines, though right now it is down to one. Most of the other machines were managed to get dribbles of tasks to work on and haven't been too idle... However, in terms of competence of the project managers and system operators, the situation is not impressive, even by my low standards back when I was in graduate school. If I have any personal involvement in BOINC, it would be on the client side, but the server side is mature enough that lots of people are able to operate it pretty reliably. Not so for the University of Washington. By the way, I'm NOT trying to claim any credit for BOINC. If I had any involvement, it was more on the negative side in terms of criticisms of the ancestor s@h project and suggested improvements. Because of my caustic temperament, I don't know or care if I was involved, and I'm certain not to get any recognition. However, I also think it's just a coincidence that BOINC is structured the way it is, because my suggestions were, after all, entirely and intuitively obvious to the most casual observer. So in conclusion, how about a few wild guesses? Maybe something like "We're 35% sure we know what the problem is, and think it is about 55% fixed with a 25% estimate that the system will clear the backlog within 2 days, plus or minus four days. |
Miklos M Send message Joined: 8 Dec 13 Posts: 29 Credit: 5,277,251 RAC: 0 |
I wish we could be told how long this outage will last. Many units waiting to upload here. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Let me share what I can. I am not at UW, and so have no first-hand knowledge of this specific situation. But let me offer the following: In the past when such extended issues have arisen, they have extended the task deadlines so you still get the credit you would have if you had been able to upload on time. Typically such network problems are either in the "still a problem" category, or in the "now resolved" category, there really is not often an in between. Now that it has gone for a few days, the project will get slammed when things come back to normal. So even when the network is fully functional, it won't feel like it because most every active machine attached to the project wants the server's attention. It often takes 2 days of full operation before things really start to look normal so far as speed of upload and download, and getting new tasks when requested etc. This is because all the machines are out of work, and also because as they crunch for other projects, they later try to repay the debt to R@h to maintain your resource share. So, hosts that are running multiple projects start to hit R@h more than they normally would, especially in the first couple of days back in operation. So, I typically try to avoid hitting the server when it first comes up. I extend my runtime preference so the tasks I do get will run longer on my machine before being ready to report back (don't go overboard on changes to runtime preference, otherwise BOINC will tend to download too much work). This helps my machine put less strain on the server as it is trying to dig out from the backlog. If you don't wish to mess with such things, rest assured that the BOINC Manager retries are structured in a way to achieve many of the same net results as I do manually. But only if you resist the temptation to hit retry now on all of your uploads when the server comes back up. :) Rosetta Moderator: Mod.Sense |
Keith Jillings Send message Joined: 26 Sep 06 Posts: 7 Credit: 536,631 RAC: 0 |
I wondered if it was just me! The "Server Status" says all is working, but I've got a batch of work units that have been "Uploading" for several days. I get a variety of messages from "Project servers may be temporarily down" to "BOINC can't access internet" (which it can, because it is doing for other projects). |
amgthis Send message Joined: 25 Mar 06 Posts: 81 Credit: 203,879,282 RAC: 0 |
[quote][quote]Let me share what I can. I am not at UW, and so have no first-hand knowledge of this specific situation. But let me offer the following: Thanks for the update on the situation. Now would be a great time to suspend network communicatiion until the project gets a chance to get it's head back up above water... Thanks. /Mike |
BarryAZ Send message Joined: 27 Dec 05 Posts: 153 Credit: 30,843,285 RAC: 0 |
Alternatively, one could suspend processing on Rosetta until the IT folks back at UW address the back channel problem that they may have inadvertently created by making some change they may not be aware they made. One would think the IT folks could figure out what they changed -- but sometimes it is more complicated I suppose. This one has lingered for quite a while though. Perhaps it will get resolved in the coming week or so. [quote][quote]Let me share what I can. I am not at UW, and so have no first-hand knowledge of this specific situation. But let me offer the following: |
Paul Vleugels Send message Joined: 1 Apr 09 Posts: 4 Credit: 643,821 RAC: 0 |
Good to know I'm not the only one who's suffering from the issue I've seen. Will temporarily stall Rosetta calculations until the upload procedure is working again in a more reliable way. While running Rosette I've got 5 finished WU's ready to be send to the server, but my BOINC environment is constantly trying to upload the data to the server without any success for 3 days now. |
Elektra* Send message Joined: 12 Nov 05 Posts: 120 Credit: 493,260 RAC: 0 |
Can't update my user profile as there in no captcha shown and no input field for the required two words available Edit: Seems to be a problem with Chrome browser, with Internet Explorer a small pop-up says that only secure content is displayed, and I have to confirm that all content is shown before the captcha system becomes available. So what have I to do to get Chrome browser eligible for all of my Rosetta duties? Love, Michi |
David E K Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 1 Jul 05 Posts: 1480 Credit: 4,334,829 RAC: 0 |
Can't update my user profile as there in no captcha shown and no input field for the required two words available Thanks for catching this! I identified the issue and fixed it. Regards! |
Message boards :
Number crunching :
Problems with web site
©2024 University of Washington
https://www.bakerlab.org