SERVER PROBLEMS - 2.

Message boards : Number crunching : SERVER PROBLEMS - 2.

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
goraxan

Send message
Joined: 18 Jul 10
Posts: 6
Credit: 1,143,926
RAC: 0
Message 67347 - Posted: 25 Aug 2010, 21:21:39 UTC

Ok, thx. Now I've found where are the cache properties :)
ID: 67347 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 67404 - Posted: 27 Aug 2010, 21:52:50 UTC

It seem most servers are down, hopefully they can fix the problems finally.

Data-driven web pages boinc.bakerlab.org Running
Scheduler srv4.bakerlab.org Running
rah_make_work1 srv1 Not running
rah_make_work2 srv3 Not running
feeder srv4 Not running
file_deleter srv1 Not running
rah_validator_beta bk2 Not running
rah_validator_mini bk1 Not running
rah_assimilatorbeta1 bk1 Not running
rah_assimilatorbeta2 bk1 Not running
rah_assimilatorbeta3 bk2 Not running
rah_assimilatorbeta4 bk2 Not running
rah_assimilator_mini1 bk1 Not running
rah_assimilator_mini2 bk1 Not running
rah_assimilator_mini3 bk2 Not running
rah_assimilator_mini4 bk2 Not running
rah_assimilator_mini5 bk1 Not running
rah_assimilator_mini6 bk1 Not running
rah_assimilator_mini7 bk2 Not running
rah_assimilator_mini8 bk2 Not running
transitioner boinc Not running
db_purge srv1 Not runnin

ID: 67404 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 163
Credit: 826,597
RAC: 0
Message 67405 - Posted: 27 Aug 2010, 22:08:37 UTC

As of 27 Aug 2010 22:04:51 UTC

All servers are running.

Ready to send 3,280
In progress 258,996
Have a crunching good day!!
ID: 67405 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
goraxan

Send message
Joined: 18 Jul 10
Posts: 6
Credit: 1,143,926
RAC: 0
Message 67413 - Posted: 28 Aug 2010, 8:37:31 UTC

Everything running but still problems :(

Ready to send 10
In progress 358,716
ID: 67413 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Evan

Send message
Joined: 10 Aug 08
Posts: 5
Credit: 39,050
RAC: 0
Message 67444 - Posted: 29 Aug 2010, 21:20:29 UTC



Still having some troubles.
ID: 67444 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CrazySpy

Send message
Joined: 29 Aug 10
Posts: 1
Credit: 11,961
RAC: 0
Message 67477 - Posted: 30 Aug 2010, 23:50:58 UTC - in response to Message 67444.  



Still having some troubles.


Hi, unfortunately I´m having the same problem.

I have two project in my BOINC Manager.

Seti and Rosetta.

I´m able to process Seti tasks without problem but I can´t get any work from Rosetta... only getting the Communication deferred in the Status messagem on the Projects tab.

Could anyone help me? I´m really interest in help the Rosetta project.
ID: 67477 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Polian
Avatar

Send message
Joined: 21 Sep 05
Posts: 152
Credit: 10,141,266
RAC: 0
Message 67481 - Posted: 31 Aug 2010, 3:00:07 UTC - in response to Message 67477.  

Could anyone help me? I´m really interest in help the Rosetta project.


You're not doing anything wrong, it appears the servers are working, although they are churning out workunits slowly, and work is not always available at the moment.

However, without official word from the project, this is just speculation. When your client gets "lucky" you'll receive some work. My PCs have been crunching *most* of the time.
ID: 67481 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Hepburn

Send message
Joined: 18 Sep 05
Posts: 14
Credit: 14,975,271
RAC: 0
Message 67482 - Posted: 31 Aug 2010, 4:07:27 UTC - in response to Message 67477.  


Hi, unfortunately I´m having the same problem.

I have two project in my BOINC Manager.

Seti and Rosetta.

I´m able to process Seti tasks without problem but I can´t get any work from Rosetta... only getting the Communication deferred in the Status messagem on the Projects tab.

Could anyone help me? I´m really interest in help the Rosetta project.


Remember that Seti goes into it's weekly three day outage starting Tuesday morning California time (UTC-7). So, if they don't get Rosetta fixed, you may run out of work. Over the years, Rosetta has been one of the most reliable BOINC projects, so I remain optimistic.


ID: 67482 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 67483 - Posted: 31 Aug 2010, 4:07:29 UTC
Last modified: 31 Aug 2010, 4:07:48 UTC

Down again.

Data-driven web pages boinc.bakerlab.org Running
Scheduler srv4.bakerlab.org Running
rah_make_work1 srv1 Not running
rah_make_work2 srv3 Not running
feeder srv4 Not running
file_deleter srv1 Not running
rah_validator_beta bk2 Not running
rah_validator_mini bk1 Not running
rah_assimilatorbeta1 bk1 Not running
rah_assimilatorbeta2 bk1 Not running
rah_assimilatorbeta3 bk2 Not running
rah_assimilatorbeta4 bk2 Not running
rah_assimilator_mini1 bk1 Not running
rah_assimilator_mini2 bk1 Not running
rah_assimilator_mini3 bk2 Not running
rah_assimilator_mini4 bk2 Not running
rah_assimilator_mini5 bk1 Not running
rah_assimilator_mini6 bk1 Not running
rah_assimilator_mini7 bk2 Not running
rah_assimilator_mini8 bk2 Not running
transitioner boinc Not running
db_purge srv1 Not running

Running: Program is operating normally

Not Running: Program failed or ran out of work
ID: 67483 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1898
Credit: 12,723,752
RAC: 682
Message 67485 - Posted: 31 Aug 2010, 9:34:34 UTC - in response to Message 67481.  

Could anyone help me? I´m really interest in help the Rosetta project.


You're not doing anything wrong, it appears the servers are working, although they are churning out workunits slowly, and work is not always available at the moment.

However, without official word from the project, this is just speculation. When your client gets "lucky" you'll receive some work. My PCs have been crunching *most* of the time.


Over in another thread this came out by a person that knows what is going on:
"Even after the cause of the go-slow is identified and fixed it may be a week before we are back up to normal operations. The servers normally make work available in the tens of thousands of tasks per hour; right now idle crunchers are probably requesting at least 100,000 tasks. It will take time to clear that backlog even when running at full capacity."

The thread is here Message boards : Number crunching : no work units
ID: 67485 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 67494 - Posted: 31 Aug 2010, 18:17:58 UTC - in response to Message 67485.  
Last modified: 31 Aug 2010, 18:23:10 UTC

Over in another thread this came out by a person that knows what is going on:


I think a bit of clarification is needed here. The comment was by a person who made some educated guesses about what is going on based on previous experiences.

I have no more knowledge than any other volunteer who has trawled these forums over the past few years. I can speculate based on similar situations in the past, but we are still awaiting official comments from the project team. They may actually surprise me and upgrade the capacity so the backlog is cleared much sooner.
ID: 67494 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 67570 - Posted: 3 Sep 2010, 18:23:31 UTC - in response to Message 67494.  

They may actually surprise me and upgrade the capacity so the backlog is cleared much sooner.


As the project has jumped from 32 Teraflops to 108 Teraflops in less than 48 hours my prediction of a week to recover looks abysmal in hindsight. I doubt I will be able to get employment as either a fortune teller or a weatherman.
ID: 67570 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 67882 - Posted: 30 Sep 2010, 22:46:06 UTC
Last modified: 30 Sep 2010, 22:47:25 UTC

Hi.

Well that was a big one, glad to see the servers coming back up.
ID: 67882 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5770
Credit: 6,139,760
RAC: 1
Message 67884 - Posted: 30 Sep 2010, 23:17:28 UTC

server status says everything is up and running, but I get

10/1/2010 1:16:20 AM rosetta@home Sending scheduler request: To report completed tasks.
10/1/2010 1:16:20 AM rosetta@home Reporting 14 completed tasks, not requesting new tasks
10/1/2010 1:16:22 AM Project communication failed: attempting access to reference site
10/1/2010 1:16:22 AM rosetta@home Scheduler request failed: Couldn't connect to server
10/1/2010 1:16:23 AM Internet access OK - project servers may be temporarily down.


so I guess they are in overload mode again
ID: 67884 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jesse1919

Send message
Joined: 1 Jul 10
Posts: 8
Credit: 2,680,869
RAC: 0
Message 67893 - Posted: 1 Oct 2010, 5:45:40 UTC

I haven't been able to upload or download all day. Servers are all green but now
"TeraFLOPS estimate: 0.642"
That's not good. Hope they figure it out.


ID: 67893 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5770
Credit: 6,139,760
RAC: 1
Message 67902 - Posted: 1 Oct 2010, 7:41:05 UTC - in response to Message 67893.  

when they go down for awhile, everyone's program is looking to send information to the server. when the servers come back online then they get pounded with everyone's program trying to access the server. So it says it can't reach the server, which is true, since it is overloaded. Just wait for the server to get unburied and your program will update as soon as it can get through.


I haven't been able to upload or download all day. Servers are all green but now
"TeraFLOPS estimate: 0.642"
That's not good. Hope they figure it out.



ID: 67902 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chris Holvenstot
Avatar

Send message
Joined: 2 May 10
Posts: 220
Credit: 9,106,918
RAC: 0
Message 67907 - Posted: 1 Oct 2010, 10:52:51 UTC

Greg -

I hate to be contrary (you believe that?) but I don't think that this is a simple case of servers being overloaded at this point - if it were you would see a few jobs squeaking through now and then.

Which does not seem to be the case - things appear to be locked up tighter than Fort Knox. I have not been able to report a completed task, finish an upload, or get a new unit of work since the servers started coming back up yesterday.

And judging from a few of the "stats" pages my closest competitors are in the same boat.

Further, the project's TeraFLOPS estimate has remained static at 0.642 during this time frame, another indication that nothing is getting through.

So me thinks the good folks out in Washington are still working on getting all of their systems and the network which connects them back up and functional - yes, I know that the "Server Status Board" is all "green" but what the heck, seeing is believing, right?

Have patience my friend, I'm sure that getting the project up and functional again is their top priority.

CH

ID: 67907 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1898
Credit: 12,723,752
RAC: 682
Message 67908 - Posted: 1 Oct 2010, 12:45:04 UTC - in response to Message 67907.  

Greg -

I hate to be contrary (you believe that?) but I don't think that this is a simple case of servers being overloaded at this point - if it were you would see a few jobs squeaking through now and then.

Which does not seem to be the case - things appear to be locked up tighter than Fort Knox. I have not been able to report a completed task, finish an upload, or get a new unit of work since the servers started coming back up yesterday.

And judging from a few of the "stats" pages my closest competitors are in the same boat.

Further, the project's TeraFLOPS estimate has remained static at 0.642 during this time frame, another indication that nothing is getting through.

So me thinks the good folks out in Washington are still working on getting all of their systems and the network which connects them back up and functional - yes, I know that the "Server Status Board" is all "green" but what the heck, seeing is believing, right?

Have patience my friend, I'm sure that getting the project up and functional again is their top priority.

CH


We talked about the 'server status' page on another project, it is only as good as the data given to it, so if the data is bad or non existent the 'server status' page will be inaccurate.
ID: 67908 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dave Mickey

Send message
Joined: 29 Dec 07
Posts: 33
Credit: 4,136,957
RAC: 0
Message 67912 - Posted: 1 Oct 2010, 13:10:25 UTC

The view from over here is much the same as Chris reports. I've got 3 hosts, each with units to upload and older units to report (that got uploaded before the outage) and none of them gets even a peep out of the servers. Update requests are NACKed immediately. None of them has made a contact since the problems began.
Can anybody report that they do get server requests completed? Are they limping along under a flood, or are they just not serving anything?

This project is my *other* project, and where I keep my current prefs because it's always been so solid in terms of server availability. So I'm confident the sys folks will get it back. Just hope it's soon, because between this and SETI's big problems, most of my machines are draining quickly.

Dave
ID: 67912 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TJ

Send message
Joined: 29 Mar 09
Posts: 127
Credit: 4,799,890
RAC: 0
Message 67916 - Posted: 1 Oct 2010, 14:36:03 UTC

Same here, nothing is going trough and uploading is pending.
This is more then 24 hours and at the main page there is no information of what is going on. Einstein@home is my main project and when they are down for a few hours I do something for mankind.

part of message:
01/10/2010 16:29:51|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 3 completed tasks
01/10/2010 16:29:53||Project communication failed: attempting access to reference site
01/10/2010 16:29:54||Internet access OK - project servers may be temporarily down.
01/10/2010 16:29:56|rosetta@home|Scheduler request failed: Couldn't connect to server

Some inforamtion from the admins would be nice. Einstein@home does that perfectly. (in most cases)
Greetings,
TJ.
ID: 67916 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : SERVER PROBLEMS - 2.



©2025 University of Washington
https://www.bakerlab.org