Problems with web site

Message boards : Number crunching : Problems with web site

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · Next

AuthorMessage
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 689
Credit: 9,663,307
RAC: 4,842
Message 71092 - Posted: 18 Aug 2011, 18:00:08 UTC

How about adding a link to a webpage explaining the different queues to each of the server status reports?
ID: 71092 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 71106 - Posted: 19 Aug 2011, 23:32:48 UTC

Right, the queue of jobs are not \"ready to send\" yet, and the tasks that are \"ready to send\" have to move to the \"feeder\'s\" internal memory (generally about 100 tasks), before a given task can be sent on a scheduler reply.
Rosetta Moderator: Mod.Sense
ID: 71106 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jesse Viviano

Send message
Joined: 14 Jan 10
Posts: 41
Credit: 1,501,581
RAC: 3
Message 71334 - Posted: 26 Sep 2011, 15:52:31 UTC

Here are two links that seem to get an error. When I click the \"Results\" link which is between \"Your account\" and \"Teams\" on the front page or when I clic on the \"View\" link to the right of \"Reports and plots of results from active work units\" on the Your account page, I get this error: \"Request Error: Sorry, the data requested does not exist.\" Could this error be fixed? Thank you.
ID: 71334 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 71338 - Posted: 27 Sep 2011, 8:19:09 UTC

Hi people.

Any chance of getting this page fixed?

Reports and plots of results from active work units.

Been getting this for a while now.

Sorry, the data requested does not exist.

ID: 71338 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Darren LaKous

Send message
Joined: 13 Mar 10
Posts: 2
Credit: 196,031
RAC: 0
Message 71412 - Posted: 15 Oct 2011, 13:48:40 UTC - in response to Message 71338.  

Hi people.

Any chance of getting this page fixed?

Reports and plots of results from active work units.

Been getting this for a while now.

Sorry, the data requested does not exist.


Bump. I\'ve been getting this for months.
ID: 71412 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,716,670
RAC: 1,179
Message 71812 - Posted: 16 Dec 2011, 16:44:22 UTC - in response to Message 71412.  

Hi people.

Any chance of getting this page fixed?

Reports and plots of results from active work units.

Been getting this for a while now.

Sorry, the data requested does not exist.


Bump. I\'ve been getting this for months.



I just checked here in December.
Data does not exist.
That says to me that they no longer support that function.
ID: 71812 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 689
Credit: 9,663,307
RAC: 4,842
Message 75108 - Posted: 17 Feb 2013, 21:43:52 UTC

Could you check if all your servers are working? Whenever, I try to upload the output files from completed workunits, this now fails. Whenever I try to report one that was able to upload its output files, this also fails. No new workunits download. No error messages, just the usual delays after the server could not be contacted.
ID: 75108 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mraswyp

Send message
Joined: 26 Feb 07
Posts: 2
Credit: 2,191,258
RAC: 0
Message 75116 - Posted: 18 Feb 2013, 8:50:58 UTC

None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down.
ID: 75116 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jim_S
Avatar

Send message
Joined: 26 Aug 06
Posts: 15
Credit: 497,976
RAC: 0
Message 75117 - Posted: 18 Feb 2013, 10:03:06 UTC - in response to Message 75116.  

None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down.


I have a Stuck one also. ActCys_Ploop_abinitio_design_y053_010_74678_459_0_0


PEACE
ID: 75117 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mraswyp

Send message
Joined: 26 Feb 07
Posts: 2
Credit: 2,191,258
RAC: 0
Message 75118 - Posted: 18 Feb 2013, 10:17:16 UTC - in response to Message 75117.  

None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down.


I have a Stuck one also. ActCys_Ploop_abinitio_design_y053_010_74678_459_0_0



2/18/2013 4:09:03 AM | rosetta@home | Started upload of P2_1_s6_f5_2_2_abinitio_design_y073_004_74609_429_0_0
2/18/2013 4:09:03 AM | rosetta@home | Started upload of Ross3X3_SAVE_ALL_OUT_t074_010_74652_326_0_0
2/18/2013 4:09:35 AM | rosetta@home | Temporarily failed upload of P2_1_s6_f5_2_2_abinitio_design_y073_004_74609_429_0_0: transient HTTP error
2/18/2013 4:09:35 AM | rosetta@home | Backing off 5 hr 42 min 28 sec on upload of P2_1_s6_f5_2_2_abinitio_design_y073_004_74609_429_0_0
2/18/2013 4:09:35 AM | rosetta@home | Temporarily failed upload of Ross3X3_SAVE_ALL_OUT_t074_010_74652_326_0_0: transient HTTP error
2/18/2013 4:09:35 AM | rosetta@home | Backing off 5 hr 39 min 8 sec on upload of Ross3X3_SAVE_ALL_OUT_t074_010_74652_326_0_0
2/18/2013 4:09:35 AM | rosetta@home | Started upload of rb_02_13_36834_69831__t000__0_D3_SAVE_ALL_OUT_IGNORE_THE_REST_74604_2167_0_0
2/18/2013 4:09:35 AM | rosetta@home | Started upload of ActCys_Ploop_abinitio_design_relax_y036_002_74683_27_0_0
2/18/2013 4:09:48 AM | | Project communication failed: attempting access to reference site
2/18/2013 4:09:50 AM | | Internet access OK - project servers may be temporarily down.
2/18/2013 4:10:06 AM | rosetta@home | Temporarily failed upload of rb_02_13_36834_69831__t000__0_D3_SAVE_ALL_OUT_IGNORE_THE_REST_74604_2167_0_0: transient HTTP error
2/18/2013 4:10:06 AM | rosetta@home | Backing off 4 hr 15 min 27 sec on upload of rb_02_13_36834_69831__t000__0_D3_SAVE_ALL_OUT_IGNORE_THE_REST_74604_2167_0_0
2/18/2013 4:10:08 AM | rosetta@home | Temporarily failed upload of ActCys_Ploop_abinitio_design_relax_y036_002_74683_27_0_0: transient HTTP error
2/18/2013 4:10:08 AM | rosetta@home | Backing off 5 hr 40 min 56 sec on upload of ActCys_Ploop_abinitio_design_relax_y036_002_74683_27_0_0
2/18/2013 4:10:09 AM | | Project communication failed: attempting access to reference site
2/18/2013 4:10:10 AM | | Internet access OK - project servers may be temporarily down.
ID: 75118 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1449
Credit: 5,776,642
RAC: 0
Message 75119 - Posted: 18 Feb 2013, 12:11:28 UTC - in response to Message 75117.  

None of my finished WUs have uploaded for over 24 hours. Is it just me? Says internet access ok, severs may be down.


I have a Stuck one also. ActCys_Ploop_abinitio_design_y053_010_74678_459_0_0


I will try this again, as long as it says this:
| Project communication failed: attempting access to reference site
| Internet access OK - project servers may be temporarily down.

IT IS NOT OUR PC\'S, it is the Project that is down!! YES the Project went down the other day, then the weekend came, we KNOW they don\'t even come in on weekends at Rosetta and YES they will probably fix it this week. If it is easy they should fix it today, if it is a more serious problem, like broken parts or even missing or stolen parts, it could take a bit longer! Set up a backup program with zero percent as the amount you want to give it. Every time Rosie goes down you will crunch units from the other project, until then Rosie will get all your time.
ID: 75119 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 75135 - Posted: 18 Feb 2013, 20:05:14 UTC
Last modified: 18 Feb 2013, 20:14:20 UTC

we KNOW they don\'t even come in on weekends at Rosetta


What is the source of your information here mikey?
Rosetta Moderator: Mod.Sense
ID: 75135 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Paul Vleugels

Send message
Joined: 1 Apr 09
Posts: 4
Credit: 620,780
RAC: 126
Message 77173 - Posted: 31 Jul 2014, 13:44:50 UTC

While running Rosette I\'ve got 5 finished WU\'s ready to be send to the server, but my BOINC environment is constantly trying to upload the data to the server without any success for 3 days now.

When I checked the server status yesterday and today I didn\'t see any issue, so what can be the reason for not starting the upload towards the server?

I\'m running v7.2.42 in a Windows 7 environment.
ID: 77173 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dmargulis

Send message
Joined: 30 Apr 07
Posts: 2
Credit: 4,533,557
RAC: 2,641
Message 77174 - Posted: 31 Jul 2014, 14:21:08 UTC - in response to Message 77173.  

Same issue here, but I have 43 WU\'s waiting to upload. The problem seems to have started on the night of 7/28:

7/28/2014 11:50:44 PM | rosetta@home | Computation for task rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0 finished
7/28/2014 11:50:46 PM | rosetta@home | Started upload of rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0_0
7/28/2014 11:51:10 PM | rosetta@home | Temporarily failed upload of rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0_0: connect() failed
7/28/2014 11:51:10 PM | rosetta@home | Backing off 00:03:05 on upload of rb_07_21_48261_94947_ab_stage0_h001___robetta_IGNORE_THE_REST_04_06_179542_9_0_0
7/28/2014 11:51:14 PM | | Project communication failed: attempting access to reference site
7/28/2014 11:51:17 PM | | Internet access OK - project servers may be temporarily down.

Any thoughts?

While running Rosette I\'ve got 5 finished WU\'s ready to be send to the server, but my BOINC environment is constantly trying to upload the data to the server without any success for 3 days now.

When I checked the server status yesterday and today I didn\'t see any issue, so what can be the reason for not starting the upload towards the server?

I\'m running v7.2.42 in a Windows 7 environment.

ID: 77174 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robl

Send message
Joined: 8 Dec 12
Posts: 4
Credit: 8,254,003
RAC: 2
Message 77175 - Posted: 31 Jul 2014, 15:21:36 UTC - in response to Message 77174.  

Anyone know when this upload problem will be resolved. If the servers are online then it seems like DNS can\'t resolve the IP addr for the upload server. My same machines are not having an issue with E@H.
ID: 77175 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Polian
Avatar

Send message
Joined: 21 Sep 05
Posts: 152
Credit: 10,141,266
RAC: 0
Message 77176 - Posted: 31 Jul 2014, 16:24:11 UTC - in response to Message 77175.  

Anyone know when this upload problem will be resolved. If the servers are online then it seems like DNS can\'t resolve the IP addr for the upload server. My same machines are not having an issue with E@H.


I\'m not seeing DNS problems at all. The names are being resolved. The servers aren\'t responding, likely due to significantly throttled traffic as the Rosetta problems thread and the front page indicates.

It will be fixed when it\'s fixed. I\'m sure they don\'t enjoy their own project not receiving needed data, either. In the mean time, attach to another project or turn the computer off, walk away, and come back later.
ID: 77176 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile shanen
Avatar

Send message
Joined: 16 Apr 14
Posts: 187
Credit: 11,682,361
RAC: 6,048
Message 77180 - Posted: 31 Jul 2014, 19:32:24 UTC

I think it\'s the lack of useful news that is the main source of frustration here, and this thread is also lacking therein. The secondary source is idle machines, though right now it is down to one. Most of the other machines were managed to get dribbles of tasks to work on and haven\'t been too idle...

However, in terms of competence of the project managers and system operators, the situation is not impressive, even by my low standards back when I was in graduate school. If I have any personal involvement in BOINC, it would be on the client side, but the server side is mature enough that lots of people are able to operate it pretty reliably. Not so for the University of Washington.

By the way, I\'m NOT trying to claim any credit for BOINC. If I had any involvement, it was more on the negative side in terms of criticisms of the ancestor s@h project and suggested improvements. Because of my caustic temperament, I don\'t know or care if I was involved, and I\'m certain not to get any recognition. However, I also think it\'s just a coincidence that BOINC is structured the way it is, because my suggestions were, after all, entirely and intuitively obvious to the most casual observer.

So in conclusion, how about a few wild guesses? Maybe something like \"We\'re 35% sure we know what the problem is, and think it is about 55% fixed with a 25% estimate that the system will clear the backlog within 2 days, plus or minus four days.
ID: 77180 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Miklos,M

Send message
Joined: 8 Dec 13
Posts: 23
Credit: 4,942,189
RAC: 0
Message 77181 - Posted: 31 Jul 2014, 20:09:42 UTC

I wish we could be told how long this outage will last. Many units waiting to upload here.
ID: 77181 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 77183 - Posted: 31 Jul 2014, 22:03:38 UTC

Let me share what I can. I am not at UW, and so have no first-hand knowledge of this specific situation. But let me offer the following:

In the past when such extended issues have arisen, they have extended the task deadlines so you still get the credit you would have if you had been able to upload on time.

Typically such network problems are either in the \"still a problem\" category, or in the \"now resolved\" category, there really is not often an in between.

Now that it has gone for a few days, the project will get slammed when things come back to normal. So even when the network is fully functional, it won\'t feel like it because most every active machine attached to the project wants the server\'s attention. It often takes 2 days of full operation before things really start to look normal so far as speed of upload and download, and getting new tasks when requested etc.

This is because all the machines are out of work, and also because as they crunch for other projects, they later try to repay the debt to R@h to maintain your resource share. So, hosts that are running multiple projects start to hit R@h more than they normally would, especially in the first couple of days back in operation.

So, I typically try to avoid hitting the server when it first comes up. I extend my runtime preference so the tasks I do get will run longer on my machine before being ready to report back (don\'t go overboard on changes to runtime preference, otherwise BOINC will tend to download too much work). This helps my machine put less strain on the server as it is trying to dig out from the backlog.

If you don\'t wish to mess with such things, rest assured that the BOINC Manager retries are structured in a way to achieve many of the same net results as I do manually. But only if you resist the temptation to hit retry now on all of your uploads when the server comes back up. :)
Rosetta Moderator: Mod.Sense
ID: 77183 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Keith Jillings

Send message
Joined: 26 Sep 06
Posts: 7
Credit: 536,631
RAC: 0
Message 77185 - Posted: 31 Jul 2014, 22:41:44 UTC

I wondered if it was just me! The \"Server Status\" says all is working, but I\'ve got a batch of work units that have been \"Uploading\" for several days.
I get a variety of messages from \"Project servers may be temporarily down\" to \"BOINC can\'t access internet\" (which it can, because it is doing for other projects).
ID: 77185 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · Next

Message boards : Number crunching : Problems with web site



©2020 University of Washington
http://www.bakerlab.org