Message boards : Number crunching : SERVER PROBLEMS - 2.
Previous · 1 · 2 · 3 · 4 · 5 · Next
Author | Message |
---|---|
goraxan Send message Joined: 18 Jul 10 Posts: 6 Credit: 1,143,926 RAC: 0 |
Ok, thx. Now I've found where are the cache properties :) |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
It seem most servers are down, hopefully they can fix the problems finally. Data-driven web pages boinc.bakerlab.org Running Scheduler srv4.bakerlab.org Running rah_make_work1 srv1 Not running rah_make_work2 srv3 Not running feeder srv4 Not running file_deleter srv1 Not running rah_validator_beta bk2 Not running rah_validator_mini bk1 Not running rah_assimilatorbeta1 bk1 Not running rah_assimilatorbeta2 bk1 Not running rah_assimilatorbeta3 bk2 Not running rah_assimilatorbeta4 bk2 Not running rah_assimilator_mini1 bk1 Not running rah_assimilator_mini2 bk1 Not running rah_assimilator_mini3 bk2 Not running rah_assimilator_mini4 bk2 Not running rah_assimilator_mini5 bk1 Not running rah_assimilator_mini6 bk1 Not running rah_assimilator_mini7 bk2 Not running rah_assimilator_mini8 bk2 Not running transitioner boinc Not running db_purge srv1 Not runnin |
Speedy Send message Joined: 25 Sep 05 Posts: 163 Credit: 808,337 RAC: 0 |
As of 27 Aug 2010 22:04:51 UTC All servers are running. Ready to send 3,280 In progress 258,996 Have a crunching good day!! |
goraxan Send message Joined: 18 Jul 10 Posts: 6 Credit: 1,143,926 RAC: 0 |
Everything running but still problems :( Ready to send 10 In progress 358,716 |
Evan Send message Joined: 10 Aug 08 Posts: 5 Credit: 39,050 RAC: 0 |
Still having some troubles. |
CrazySpy Send message Joined: 29 Aug 10 Posts: 1 Credit: 11,961 RAC: 0 |
Hi, unfortunately I´m having the same problem. I have two project in my BOINC Manager. Seti and Rosetta. I´m able to process Seti tasks without problem but I can´t get any work from Rosetta... only getting the Communication deferred in the Status messagem on the Projects tab. Could anyone help me? I´m really interest in help the Rosetta project. |
Polian Send message Joined: 21 Sep 05 Posts: 152 Credit: 10,141,266 RAC: 0 |
Could anyone help me? I´m really interest in help the Rosetta project. You're not doing anything wrong, it appears the servers are working, although they are churning out workunits slowly, and work is not always available at the moment. However, without official word from the project, this is just speculation. When your client gets "lucky" you'll receive some work. My PCs have been crunching *most* of the time. |
Bill Hepburn Send message Joined: 18 Sep 05 Posts: 14 Credit: 14,953,680 RAC: 479 |
Remember that Seti goes into it's weekly three day outage starting Tuesday morning California time (UTC-7). So, if they don't get Rosetta fixed, you may run out of work. Over the years, Rosetta has been one of the most reliable BOINC projects, so I remain optimistic. |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
Down again. Data-driven web pages boinc.bakerlab.org Running Scheduler srv4.bakerlab.org Running rah_make_work1 srv1 Not running rah_make_work2 srv3 Not running feeder srv4 Not running file_deleter srv1 Not running rah_validator_beta bk2 Not running rah_validator_mini bk1 Not running rah_assimilatorbeta1 bk1 Not running rah_assimilatorbeta2 bk1 Not running rah_assimilatorbeta3 bk2 Not running rah_assimilatorbeta4 bk2 Not running rah_assimilator_mini1 bk1 Not running rah_assimilator_mini2 bk1 Not running rah_assimilator_mini3 bk2 Not running rah_assimilator_mini4 bk2 Not running rah_assimilator_mini5 bk1 Not running rah_assimilator_mini6 bk1 Not running rah_assimilator_mini7 bk2 Not running rah_assimilator_mini8 bk2 Not running transitioner boinc Not running db_purge srv1 Not running Running: Program is operating normally Not Running: Program failed or ran out of work |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,217,610 RAC: 674 |
Could anyone help me? I´m really interest in help the Rosetta project. Over in another thread this came out by a person that knows what is going on: "Even after the cause of the go-slow is identified and fixed it may be a week before we are back up to normal operations. The servers normally make work available in the tens of thousands of tasks per hour; right now idle crunchers are probably requesting at least 100,000 tasks. It will take time to clear that backlog even when running at full capacity." The thread is here Message boards : Number crunching : no work units |
Murasaki Send message Joined: 20 Apr 06 Posts: 303 Credit: 511,418 RAC: 0 |
Over in another thread this came out by a person that knows what is going on: I think a bit of clarification is needed here. The comment was by a person who made some educated guesses about what is going on based on previous experiences. I have no more knowledge than any other volunteer who has trawled these forums over the past few years. I can speculate based on similar situations in the past, but we are still awaiting official comments from the project team. They may actually surprise me and upgrade the capacity so the backlog is cleared much sooner. |
Murasaki Send message Joined: 20 Apr 06 Posts: 303 Credit: 511,418 RAC: 0 |
They may actually surprise me and upgrade the capacity so the backlog is cleared much sooner. As the project has jumped from 32 Teraflops to 108 Teraflops in less than 48 hours my prediction of a week to recover looks abysmal in hindsight. I doubt I will be able to get employment as either a fortune teller or a weatherman. |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
Hi. Well that was a big one, glad to see the servers coming back up. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
server status says everything is up and running, but I get 10/1/2010 1:16:20 AM rosetta@home Sending scheduler request: To report completed tasks. 10/1/2010 1:16:20 AM rosetta@home Reporting 14 completed tasks, not requesting new tasks 10/1/2010 1:16:22 AM Project communication failed: attempting access to reference site 10/1/2010 1:16:22 AM rosetta@home Scheduler request failed: Couldn't connect to server 10/1/2010 1:16:23 AM Internet access OK - project servers may be temporarily down. so I guess they are in overload mode again |
jesse1919 Send message Joined: 1 Jul 10 Posts: 8 Credit: 2,680,869 RAC: 0 |
I haven't been able to upload or download all day. Servers are all green but now "TeraFLOPS estimate: 0.642" That's not good. Hope they figure it out. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
when they go down for awhile, everyone's program is looking to send information to the server. when the servers come back online then they get pounded with everyone's program trying to access the server. So it says it can't reach the server, which is true, since it is overloaded. Just wait for the server to get unburied and your program will update as soon as it can get through. I haven't been able to upload or download all day. Servers are all green but now |
Chris Holvenstot Send message Joined: 2 May 10 Posts: 220 Credit: 9,106,918 RAC: 0 |
Greg - I hate to be contrary (you believe that?) but I don't think that this is a simple case of servers being overloaded at this point - if it were you would see a few jobs squeaking through now and then. Which does not seem to be the case - things appear to be locked up tighter than Fort Knox. I have not been able to report a completed task, finish an upload, or get a new unit of work since the servers started coming back up yesterday. And judging from a few of the "stats" pages my closest competitors are in the same boat. Further, the project's TeraFLOPS estimate has remained static at 0.642 during this time frame, another indication that nothing is getting through. So me thinks the good folks out in Washington are still working on getting all of their systems and the network which connects them back up and functional - yes, I know that the "Server Status Board" is all "green" but what the heck, seeing is believing, right? Have patience my friend, I'm sure that getting the project up and functional again is their top priority. CH |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,217,610 RAC: 674 |
Greg - We talked about the 'server status' page on another project, it is only as good as the data given to it, so if the data is bad or non existent the 'server status' page will be inaccurate. |
Dave Mickey Send message Joined: 29 Dec 07 Posts: 33 Credit: 4,136,957 RAC: 0 |
The view from over here is much the same as Chris reports. I've got 3 hosts, each with units to upload and older units to report (that got uploaded before the outage) and none of them gets even a peep out of the servers. Update requests are NACKed immediately. None of them has made a contact since the problems began. Can anybody report that they do get server requests completed? Are they limping along under a flood, or are they just not serving anything? This project is my *other* project, and where I keep my current prefs because it's always been so solid in terms of server availability. So I'm confident the sys folks will get it back. Just hope it's soon, because between this and SETI's big problems, most of my machines are draining quickly. Dave |
TJ Send message Joined: 29 Mar 09 Posts: 127 Credit: 4,799,890 RAC: 0 |
Same here, nothing is going trough and uploading is pending. This is more then 24 hours and at the main page there is no information of what is going on. Einstein@home is my main project and when they are down for a few hours I do something for mankind. part of message: 01/10/2010 16:29:51|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 3 completed tasks 01/10/2010 16:29:53||Project communication failed: attempting access to reference site 01/10/2010 16:29:54||Internet access OK - project servers may be temporarily down. 01/10/2010 16:29:56|rosetta@home|Scheduler request failed: Couldn't connect to server Some inforamtion from the admins would be nice. Einstein@home does that perfectly. (in most cases) Greetings, TJ. |
Message boards :
Number crunching :
SERVER PROBLEMS - 2.
©2025 University of Washington
https://www.bakerlab.org