Rosetta@home

No work from project with 89,369 queued

  UW Seal
 
[ Home ] [ Join ] [ About ] [ Participants ] [ Community ] [ Statistics ]
  [ login/out ]


Advanced search
Message boards : Number crunching : No work from project with 89,369 queued

Sort
AuthorMessage
[BAT] tutta55 Profile
Avatar

Joined: Sep 16 05
Posts: 60
ID: 66
Credit: 99,832
RAC: 0
Message 2038 - Posted 2 Nov 2005 9:51:41 UTC
Last modified: 2 Nov 2005 9:54:45 UTC

I am getting intermittent 'No work from project' replies from the Rosetta server. Last time it took 3 attempts of my client to get work. Yet I see in the status that 89,369 wu are queued. What is causing this? I have my work queue set a 0.1 days. But with this going on it might be safer to take a higher number to avoid running dry.

____________

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair

pe
Avatar

Joined: Sep 17 05
Posts: 4
ID: 100
Credit: 219,401
RAC: 80
Message 2041 - Posted 2 Nov 2005 11:24:53 UTC

same here.
doenst seem to get any new work downloaded.

greetz, pe.
____________

AnRM
Avatar

Joined: Sep 18 05
Posts: 123
ID: 353
Credit: 1,355,486
RAC: 0
Message 2045 - Posted 2 Nov 2005 12:04:47 UTC
Last modified: 2 Nov 2005 12:06:21 UTC

'Huston, we have a problem'.....no download, no work and I notice that the 'Queued' have exceeded 100,000........help! One box dry already.
____________

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2047 - Posted 2 Nov 2005 12:52:07 UTC
Last modified: 2 Nov 2005 12:53:19 UTC

Just curious...

I have 5 PC's running Rosetta. Two are running BOINC 5.2.2 and have plenty of work. One is running 4.68 and is also getting work. The other two are running 4.45 and suddenly cannot get work (so they're crunching SETI and Predictor instead).

Not sure if it means anything, probably coincidence?


____________
*** Join BOINC@Australia today ***

stephan_t Profile
Avatar

Joined: Oct 20 05
Posts: 129
ID: 5636
Credit: 35,464
RAC: 0
Message 2048 - Posted 2 Nov 2005 13:02:13 UTC
Last modified: 2 Nov 2005 13:04:34 UTC

Running 522 here - not getting any work either

Edit: Just to clarify, I have 4 boxes, currently away so only able to monitor 1. It has 20 WU queued, but usually can I set the 'network update' to say, 3 days, and still get another 40 WU on top of that. Right now I can't (it says 'no new work available'.
____________
Team CFVault.com
http://www.cfvault.com

KSMarksPsych Profile
Avatar

Joined: Oct 15 05
Posts: 199
ID: 4774
Credit: 22,337
RAC: 0
Message 2050 - Posted 2 Nov 2005 13:32:54 UTC - in response to Message ID 2047.

Just curious...

I have 5 PC's running Rosetta. Two are running BOINC 5.2.2 and have plenty of work. One is running 4.68 and is also getting work. The other two are running 4.45 and suddenly cannot get work (so they're crunching SETI and Predictor instead).

Not sure if it means anything, probably coincidence?




Just thought I'd add that I'm running 4.45 and I'm getting work ok. Must be lucky ;)
____________
Kathryn :o)
The BOINC FAQ Service
The Unofficial BOINC Wiki
The Trac System
More BOINC information than you can shake a stick of RAM at.

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2051 - Posted 2 Nov 2005 13:40:33 UTC
Last modified: 2 Nov 2005 13:40:54 UTC

Thanks - must have been coincidence.

I managed to get a handful of WU for these two PCs - hopefully there will be more in an hour or two when they run dry again

____________
*** Join BOINC@Australia today ***

stephan_t Profile
Avatar

Joined: Oct 20 05
Posts: 129
ID: 5636
Credit: 35,464
RAC: 0
Message 2052 - Posted 2 Nov 2005 13:55:59 UTC

Actually this reminds me of something I thought about the other day - potentially someone with a large number of boxes (say, the behemoth Housing and Food Services) could set their network settings at 10 days (the max) and update all their boxes at the same time. Surely that would suck the project dry of WUs. Or would it? Just a thought.
____________
Team CFVault.com
http://www.cfvault.com

Nothing But Idle Time

Joined: Sep 28 05
Posts: 209
ID: 1675
Credit: 139,545
RAC: 0
Message 2053 - Posted 2 Nov 2005 15:02:36 UTC - in response to Message ID 2051.

Thanks - must have been coincidence.

I managed to get a handful of WU for these two PCs - hopefully there will be more in an hour or two when they run dry again


For what it is worth: I'm on v5.2.5 and have been getting work regularly for days. Suddenly overnight I received only 2 WU mixed in with the No Work From Project messages. LHC has no work and Rosetta has no work (supposedly) so I am down to one project at work.
____________

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,115,782
RAC: 2,162
Message 2055 - Posted 2 Nov 2005 15:11:03 UTC
Last modified: 2 Nov 2005 15:12:12 UTC

I have no Rosetta wu's either, just the No work... message. Core client 4.25. Server status now shows 165,000+ wu's queued.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

KWSN_Dagger

Joined: Oct 19 05
Posts: 5
ID: 5429
Credit: 2,803
RAC: 0
Message 2056 - Posted 2 Nov 2005 15:13:45 UTC

I run 5.2.2, and i'm getting no work from any of the projects i contribute to except for Einstein.

Bok Profile

Joined: Sep 17 05
Posts: 54
ID: 71
Credit: 3,411,417
RAC: 0
Message 2057 - Posted 2 Nov 2005 15:19:35 UTC

I'm also getting no work on a number of machines. Even a detach/attach is failing to receive new work...

:(

Just when I was really pushing the project too

Bok
____________
Free-DC

Stats for all projects

Custom Stats

Charles Dennett Profile
Avatar

Joined: Sep 27 05
Posts: 92
ID: 1447
Credit: 1,082,242
RAC: 0
Message 2058 - Posted 2 Nov 2005 15:29:52 UTC

I'm getting new WU's from time to time and getting no work the rest of the time. My backup project (seti) has kicked in when I have no RAH work.

Remember, the servers are on the west coast of the U.S. As I type this from the east coast it is 10:30 AM here and 7:30 AM there. The people are probably just getting into work now so give them a chance to size up the situation and fix it. Probabby will be just a restart of the servers.


____________

Andrew

Joined: Sep 19 05
Posts: 162
ID: 527
Credit: 105,512
RAC: 0
Message 2059 - Posted 2 Nov 2005 15:55:11 UTC

I'm using Trux's 5.3.1 client, and I've noticed that I'd get the "No work..." message , and then at 10:05 EST, I downloaded 2 WU, and since then I get the "No work..." msg again. :(

Looking at my msg log, I noticed that I started get no work at around 06:45 EST (11:45 UTC, i believe) this morning.

When did everyone else start seeing the "No work..." msg?
____________

Nothing But Idle Time

Joined: Sep 28 05
Posts: 209
ID: 1675
Credit: 139,545
RAC: 0
Message 2060 - Posted 2 Nov 2005 16:13:14 UTC - in response to Message ID 2059.

I'm using Trux's 5.3.1 client, and I've noticed that I'd get the "No work..." message , and then at 10:05 EST, I downloaded 2 WU, and since then I get the "No work..." msg again. :(

Looking at my msg log, I noticed that I started get no work at around 06:45 EST (11:45 UTC, i believe) this morning.

When did everyone else start seeing the "No work..." msg?


2:49:57 am EST
____________

stephan_t Profile
Avatar

Joined: Oct 20 05
Posts: 129
ID: 5636
Credit: 35,464
RAC: 0
Message 2061 - Posted 2 Nov 2005 16:19:43 UTC

Yeah, that's it, it's dry as the desert right now. No more wus. On the plus side of thing it's the first time some of my computers cpus temperature went below 60 celcius, and it's noticeably cooler in the room :)

____________
Team CFVault.com
http://www.cfvault.com

stephan_t Profile
Avatar

Joined: Oct 20 05
Posts: 129
ID: 5636
Credit: 35,464
RAC: 0
Message 2062 - Posted 2 Nov 2005 16:20:20 UTC

BTW - this should be reflected on boincstats tonight - should see a noticeable drop in returned WUs.
____________
Team CFVault.com
http://www.cfvault.com

[BAT] tutta55 Profile
Avatar

Joined: Sep 16 05
Posts: 60
ID: 66
Credit: 99,832
RAC: 0
Message 2063 - Posted 2 Nov 2005 16:28:40 UTC

Could this be the cause? From the Seti website:

November 1, 2005
Give your computer some variety - check out Rosetta@home. This new BOINC-based project aims to solve the ab initio protein structure prediction problem, and to design new chemical catalysts and potential HIV vaccines. Participants who found the lowest energy structures have already been acknowledged in a scientific paper describing the results.


Maybe there's a surge of new subscribers causing a serious backlog in work production.
____________

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair

Bok Profile

Joined: Sep 17 05
Posts: 54
ID: 71
Credit: 3,411,417
RAC: 0
Message 2064 - Posted 2 Nov 2005 16:47:33 UTC - in response to Message ID 2063.


Maybe there's a surge of new subscribers causing a serious backlog in work production.


I don't see that many new users today....

http://stats.free-dc.org/new/newusers.php?proj=rah

Bok
____________
Free-DC

Stats for all projects

Custom Stats

Paul D. Buck Profile

Joined: Sep 17 05
Posts: 815
ID: 269
Credit: 1,812,737
RAC: 0
Message 2065 - Posted 2 Nov 2005 16:49:40 UTC

An epidemic ... I have at least one computer that is also trying unsuccessfully to get work too ...

Worse, I just stopped a 1% work unit with 6 hours on it ... :(

Waiting to see if it starts running this time ...
____________

Scribe
Avatar

Joined: Nov 2 05
Posts: 284
ID: 8368
Credit: 157,359
RAC: 0
Message 2066 - Posted 2 Nov 2005 17:08:57 UTC

Problems here also
____________

Red Squirrel

Joined: Sep 26 05
Posts: 13
ID: 1200
Credit: 3,613
RAC: 0
Message 2067 - Posted 2 Nov 2005 17:09:09 UTC

Getting same thing here. It first asked for work at 15:30 UTC today and got the "No work from project" message. I've also noticed that the server status on Rosetta's home page says "Server status as of 30 Oct 2005 13:42:37 UTC".
Is this a related problem or just a coincidence?
____________

Rebirther Profile
Avatar

Joined: Sep 17 05
Posts: 116
ID: 152
Credit: 41,315
RAC: 0
Message 2069 - Posted 2 Nov 2005 18:05:44 UTC

02/11/2005 19:03:43|rosetta@home|Message from server: Server has software problem
02/11/2005 19:03:43|rosetta@home|Project is down
02/11/2005 19:03:48|rosetta@home|Deferring communication with project for 59 minutes and 54 seconds

:( Hmm, how long does it take?
____________

Bok Profile

Joined: Sep 17 05
Posts: 54
ID: 71
Credit: 3,411,417
RAC: 0
Message 2070 - Posted 2 Nov 2005 18:10:09 UTC

I just noticed that too, now it's back. I think they may have been fixing the issue. Seems ok now...fingers crossed. I'll check the rest of my clients..

Bok
____________
Free-DC

Stats for all projects

Custom Stats

Nothing But Idle Time

Joined: Sep 28 05
Posts: 209
ID: 1675
Credit: 139,545
RAC: 0
Message 2071 - Posted 2 Nov 2005 18:10:41 UTC - in response to Message ID 2069.
Last modified: 2 Nov 2005 18:17:35 UTC


...

:( Hmm, how long does it take?


My thought also. Is everyone asleep at the switchboard? What good is donating unused CPU cycles if they continue to remain unused? I'm being sarcastic of course, I have another project to churn. Still can't be good for the Rosetta project to leave the hosts in limbo. Rosetta, EAH thanks you!

____________

[B@H] Ray Profile
Avatar

Joined: Sep 20 05
Posts: 118
ID: 657
Credit: 100,251
RAC: 0
Message 2073 - Posted 2 Nov 2005 18:18:51 UTC
Last modified: 2 Nov 2005 18:21:31 UTC

I have 2 systems running BOINC 4.45.

One ran out and switched to CPDN. (usually runs Rosetta 75% of time, SETI and CPDN get smaller amounts). Was not watching to see if that was intermittent or just stopped getting Rosetta.

Other system usually runs CPDN most the time but has 17 Rosetta WU's ( half 1pvaA, the long running ones ), suspended CPDN on this one to run those Rosetta units. This system usually runs CPDN about 65%, most of the rest goes to Rosetta and a small amount to SETI. Sure wish that I could send some of these to the other system.

Will be good when the 1st can download again and get these two back to what thet usually do. But for now SETI seems to get the most benifit of the Rosetta not handing out work, the RAC there has dropped a lot with doing a lot more Rosetta and CPDN anyways.

Ray
____________


Pizza@Home Rays Place Rays place Forums

ColdRain~old
Avatar

Joined: Nov 1 05
Posts: 27
ID: 8112
Credit: 33,378
RAC: 0
Message 2074 - Posted 2 Nov 2005 18:25:04 UTC - in response to Message ID 2064.


Maybe there's a surge of new subscribers causing a serious backlog in work production.


I don't see that many new users today....

http://stats.free-dc.org/new/newusers.php?proj=rah

Bok

I'm new :P
Trying to switch-over my puters to R@H because FaD is closing down as of Dec 16 ...

Hence, a lot of FaD crunchers is looking for a new project. Most want a medical tainted project. Could be Rosetta, or Predictor or even Folding. Who'll tell :)
I made my choice ;)
____________

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2077 - Posted 2 Nov 2005 18:38:46 UTC

Looks like the database load is high. Working on a fix now.

AnRM
Avatar

Joined: Sep 18 05
Posts: 123
ID: 353
Credit: 1,355,486
RAC: 0
Message 2079 - Posted 2 Nov 2005 18:44:35 UTC - in response to Message ID 2071.
Last modified: 2 Nov 2005 18:49:38 UTC

[quote]
...

:( Hmm, how long does it take?


My thought also. Is everyone asleep ......."

>Exactly! The first indication that trouble was brewing was appromimately 2 AM their local time this morning. I'm sure if your system went down at that time it would take until the next morning for you to react as well. To my knowledge, this is the first time this problem has appeared at R@H and I hope they have it up soon.
____________

Rebirther Profile
Avatar

Joined: Sep 17 05
Posts: 116
ID: 152
Credit: 41,315
RAC: 0
Message 2080 - Posted 2 Nov 2005 18:54:43 UTC

I hope we have not reached the maximum capacity :o
____________

Nothing But Idle Time

Joined: Sep 28 05
Posts: 209
ID: 1675
Credit: 139,545
RAC: 0
Message 2082 - Posted 2 Nov 2005 19:12:28 UTC - in response to Message ID 2079.


...

:( Hmm, how long does it take?

My thought also. Is everyone asleep ......."

>Exactly!


Yes we're impatient. That's easy when isolated in our own little fiefdoms and rely solely on message boards to know what is going on, to get comfort in knowing that someone else is sharing the same experience or not, to get the attention of the project team. I would have expected at least an acknowledgment of the problem before now in spite of the difference in time zones. Don't those dedicated university types show up to work at 6 am? (chuckle, chuckle).
____________

Scribe
Avatar

Joined: Nov 2 05
Posts: 284
ID: 8368
Credit: 157,359
RAC: 0
Message 2083 - Posted 2 Nov 2005 19:13:46 UTC

Now I am getting "Project is down"!!!!!!
____________

kb7rzf Profile
Avatar

Joined: Oct 7 05
Posts: 16
ID: 3186
Credit: 35,427
RAC: 0
Message 2084 - Posted 2 Nov 2005 19:26:01 UTC

This is what I'm getting:

"11/2/2005 11:12:43 AM|rosetta@home|Message from server: Project is temporarily shut down for maintenance
11/2/2005 11:12:43 AM|rosetta@home|Project is down"

I still got plenty of WU's, so I'm not worried, and got Seti and Seti Beta to crunch too, so no big deal here. :-)

Jeremy

____________

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2086 - Posted 2 Nov 2005 19:29:40 UTC

I was doing a server restart. The project may be down intermittently (for just a few minutes each time) for the next hour or so as I purge the database.

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2087 - Posted 2 Nov 2005 19:31:34 UTC - in response to Message ID 2080.

I hope we have not reached the maximum capacity :o


Nothing to worry about. Just have to purge the database of old WUs.

AnRM
Avatar

Joined: Sep 18 05
Posts: 123
ID: 353
Credit: 1,355,486
RAC: 0
Message 2088 - Posted 2 Nov 2005 19:33:03 UTC - in response to Message ID 2082.
Last modified: 2 Nov 2005 19:41:42 UTC

....... I would have expected at least an acknowledgment of the problem before now in spite of the difference in time zones. Don't those dedicated university types show up to work at 6 am? (chuckle, chuckle).[/quote]

>Can't argue about the acknowlegment part. They usually do communicate very well, however, maybe they can't communicate and scratch their heads at the same time?:).....cheers, Rog.
P.S.... I see David is on the job.....thanks for the info!
____________

Divide Overflow

Joined: Sep 17 05
Posts: 82
ID: 129
Credit: 921,382
RAC: 0
Message 2089 - Posted 2 Nov 2005 19:52:54 UTC

I'm not getting any work either, but I'm also seeing a message that the project is down. The message also says that there is a server problem.

The main project page seems to show that the project is still up and running.
____________

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2091 - Posted 2 Nov 2005 20:00:54 UTC - in response to Message ID 2088.
Last modified: 3 Nov 2005 6:27:23 UTC

They usually do communicate very well, however, maybe they can't communicate and scratch their heads at the same time?


And sleep. -edit :)

AnRM
Avatar

Joined: Sep 18 05
Posts: 123
ID: 353
Credit: 1,355,486
RAC: 0
Message 2099 - Posted 2 Nov 2005 20:59:56 UTC - in response to Message ID 2091.
Last modified: 2 Nov 2005 21:08:23 UTC

They usually do communicate very well, however, maybe they can't communicate and scratch their heads at the same time?:)


And sleep.


>I hear you, David!.....Cheers, Rog.
P.S. I see you removed my 'smily' at the end of the quote so..... I put it back in:)
____________

Doug Worrall Profile
Avatar

Joined: Sep 19 05
Posts: 60
ID: 514
Credit: 58,445
RAC: 0
Message 2101 - Posted 2 Nov 2005 21:46:58 UTC - in response to Message ID 2089.

I'm not getting any work either, but I'm also seeing a message that the project is down. The message also says that there is a server problem.

The main project page seems to show that the project is still up and running.



Same here.
Last night it started,got a couple w/u today,last night got to end of 3hour w/u would not finish,Grrrr,Refuse to get 5.2.6.Enough is enough Boinc,get your Shxt
Together.3 Releases in 4 days.If I cannot get work.Toooooo Badddd!
Doug
____________

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2110 - Posted 3 Nov 2005 1:51:00 UTC
Last modified: 3 Nov 2005 1:55:10 UTC

David (Kim)

I guess you may be sleeping now or at least not at work, but it looks like the problem still isn't solved. None of my PC's are getting any work from Rosetta. Maybe a short announcement on the home page would be appropriate if it's going to take a while to fix?


____________
*** Join BOINC@Australia today ***

Poohbear

Joined: Nov 2 05
Posts: 5
ID: 8438
Credit: 10,331
RAC: 0
Message 2111 - Posted 3 Nov 2005 3:19:52 UTC

Just joined today, got 1 WU and now I'm having the same problem. "No work from project." Get it fixed or the 1 WU I have done will be the only one.


____________

John McLeod VII
Avatar

Joined: Sep 17 05
Posts: 108
ID: 314
Credit: 190,419
RAC: 0
Message 2113 - Posted 3 Nov 2005 3:32:44 UTC - in response to Message ID 2111.

Just joined today, got 1 WU and now I'm having the same problem. "No work from project." Get it fixed or the 1 WU I have done will be the only one.


A bit tense are we? Just leave it attached with the other projects and let the client work it out.
____________


BOINC WIKI

kb7rzf Profile
Avatar

Joined: Oct 7 05
Posts: 16
ID: 3186
Credit: 35,427
RAC: 0
Message 2114 - Posted 3 Nov 2005 3:37:39 UTC

Was able to upload a WU just fine about 5pm PST, my computer has not requested any more work yet from Rosetta. But, as I said in my last post, i still have plenty of WU's for rosetta, since my machine doesn't crunch them that fast, and I also have other projects that take their fair share. Way I look at it, is it gets fixed when it gets fixed. The "big guys" know whats wrong, as David has already said, and when its fixed, I'm more than certain he will let us all know, or you will start getting more WU's. Pretty simple there I think. :-) Happy crunching.

Jeremy
____________

Fuzzy Hollynoodles Profile
Avatar

Joined: Oct 7 05
Posts: 234
ID: 3088
Credit: 15,020
RAC: 0
Message 2116 - Posted 3 Nov 2005 3:59:14 UTC - in response to Message ID 2113.
Last modified: 3 Nov 2005 3:59:49 UTC


A bit tense are we? Just leave it attached with the other projects and let the client work it out.


I'm thinking the same. I can't get any WU's from Seti either, but LHC have created WU's I'm crunching right now. And since LHC has 50 % of my share, I guess the debt is in it's favour. So let me crunch LHC WU's untill the people at Rosetta and Seti has sorted it out or my BOINC manager does.

That's the benefit of BOINC, one can crunch more than one project and never have an idle processor!


____________
"I'm trying to maintain a shred of dignity in this world." - Me

EclipseHA

Joined: Nov 3 05
Posts: 12
ID: 8667
Credit: 122,749
RAC: 0
Message 2120 - Posted 3 Nov 2005 4:38:35 UTC

I attached a Linux box and a Windoze box a few hours back. Per the stats, there seems to be plenty of work ready to send, but on both systems, I'm getting "no work available", now every 10-20 minutes.

Is there a problem with the servers, or maybe the network?

Something's not right, and it would be nice to understand if I should retry tomorrow, next week, next month, or next year.

Thanks in advance.
____________

Shaktai Profile
Avatar

Joined: Sep 21 05
Posts: 56
ID: 701
Credit: 575,419
RAC: 0
Message 2122 - Posted 3 Nov 2005 4:59:46 UTC - in response to Message ID 2120.


Is there a problem with the servers, or maybe the network?

Something's not right, and it would be nice to understand if I should retry tomorrow, next week, next month, or next year.

Thanks in advance.


Read David Kim's posts above. It looks like they are having to purge the database of completed work units. Might take a few hours I'd guess. Might want to give it another try tomorrow or a little later this evening. They are on Pacific (Washingtion state) time, so might still get it back up tonight though. I just fired up Einstein for the evening or until Rosetta comes back. Plenty of work, the database just choked on the unpurged finished work units.
____________

Team MacNN - The best Macintosh team ever.

Fuzzy Hollynoodles Profile
Avatar

Joined: Oct 7 05
Posts: 234
ID: 3088
Credit: 15,020
RAC: 0
Message 2126 - Posted 3 Nov 2005 6:53:35 UTC

I managed to download a WU, but now I get this message:

11/3/2005 7:41:45 AM|rosetta@home|Sending scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi
11/3/2005 7:41:45 AM|rosetta@home|Reason: Requested by user
11/3/2005 7:41:45 AM|rosetta@home|Requesting 8640 seconds of new work
11/3/2005 7:42:00 AM|rosetta@home|Scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
11/3/2005 7:42:00 AM|rosetta@home|Message from server: Server has software problem
11/3/2005 7:42:00 AM|rosetta@home|Project is down

11/3/2005 7:42:05 AM|rosetta@home|Deferring communication with project for 59 minutes and 54 seconds



____________

"I'm trying to maintain a shred of dignity in this world." - Me

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2127 - Posted 3 Nov 2005 7:33:08 UTC - in response to Message ID 2126.
Last modified: 3 Nov 2005 7:33:46 UTC

I managed to download a WU, but now I get this message:
11/3/2005 7:42:00 AM|rosetta@home|Project is down[/b]


See news on Rosetta home page.

____________
*** Join BOINC@Australia today ***

[BAT] tutta55 Profile
Avatar

Joined: Sep 16 05
Posts: 60
ID: 66
Credit: 99,832
RAC: 0
Message 2136 - Posted 3 Nov 2005 10:15:37 UTC

=-( 1 machine dry. 3 running on fumes. No more work consistently now. I know I can let them download from other projects, but it will be with reluctance.
____________

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair

Scribe
Avatar

Joined: Nov 2 05
Posts: 284
ID: 8368
Credit: 157,359
RAC: 0
Message 2138 - Posted 3 Nov 2005 11:52:22 UTC - in response to Message ID 2127.

I managed to download a WU, but now I get this message:
11/3/2005 7:42:00 AM|rosetta@home|Project is down[/b]


See news on Rosetta home page.


The news on the Home page tells you nothing....can you print it here in a post what you mean please?
____________

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2139 - Posted 3 Nov 2005 12:19:02 UTC - in response to Message ID 2138.
Last modified: 3 Nov 2005 12:20:02 UTC

The news on the Home page tells you nothing....can you print it here in a post what you mean please?


November 2, 2005
We are experiencing problems with our database and as a result the project will be down intermittently.

Also see David Kim's posts in this thread
____________
*** Join BOINC@Australia today ***

Scribe
Avatar

Joined: Nov 2 05
Posts: 284
ID: 8368
Credit: 157,359
RAC: 0
Message 2140 - Posted 3 Nov 2005 12:21:30 UTC

That was a long time ago and the problem cannot be defined as 'intermittent'.

We could do with an update and forecast of a fix please.
____________

AnRM
Avatar

Joined: Sep 18 05
Posts: 123
ID: 353
Credit: 1,355,486
RAC: 0
Message 2142 - Posted 3 Nov 2005 13:11:27 UTC

Hope it's not a virus problem.....I seem to recall that PP@H had a virus attack at one point....??
____________

rbpeake Profile

Joined: Sep 25 05
Posts: 168
ID: 1036
Credit: 246,593
RAC: 0
Message 2146 - Posted 3 Nov 2005 13:31:05 UTC - in response to Message ID 2140.

That was a long time ago and the problem cannot be defined as 'intermittent'.

We could do with an update and forecast of a fix please.


They probably are asleep now, it being Pacific time. So, as they say, "patience is a virtue". :)

____________
Regards,
Bob P.

rbpeake Profile

Joined: Sep 25 05
Posts: 168
ID: 1036
Credit: 246,593
RAC: 0
Message 2147 - Posted 3 Nov 2005 13:32:16 UTC - in response to Message ID 2140.
Last modified: 3 Nov 2005 13:36:06 UTC

Accidental repeat of message, sorry! :(
____________
Regards,
Bob P.

Andrew

Joined: Sep 19 05
Posts: 162
ID: 527
Credit: 105,512
RAC: 0
Message 2153 - Posted 3 Nov 2005 13:58:34 UTC

Just an update...

It's 9am EST here, and I'm still getting "No work..." message.
____________

Poohbear

Joined: Nov 2 05
Posts: 5
ID: 8438
Credit: 10,331
RAC: 0
Message 2161 - Posted 3 Nov 2005 16:26:49 UTC - in response to Message ID 2113.

Just joined today, got 1 WU and now I'm having the same problem. "No work from project." Get it fixed or the 1 WU I have done will be the only one.


A bit tense are we? Just leave it attached with the other projects and let the client work it out.



Yes, I was a bit tense at the time. It wasn't the only thing I was having a problem with at that time. Other problem gone, and I am now waiting patiently until they get whatever is wrong fixed and my computer can resume work.
In the meantime I have five other projects going.

____________

rbpeake Profile

Joined: Sep 25 05
Posts: 168
ID: 1036
Credit: 246,593
RAC: 0
Message 2164 - Posted 3 Nov 2005 16:38:42 UTC - in response to Message ID 2161.

Yes, I was a bit tense at the time. It wasn't the only thing I was having a problem with at that time.


I know, we all have those kind of days! :(
____________
Regards,
Bob P.

Andrew

Joined: Sep 19 05
Posts: 162
ID: 527
Credit: 105,512
RAC: 0
Message 2165 - Posted 3 Nov 2005 16:39:46 UTC - in response to Message ID 2161.

Yes, I was a bit tense at the time. It wasn't the only thing I was having a problem with at that time. Other problem gone, and I am now waiting patiently until they get whatever is wrong fixed and my computer can resume work.
In the meantime I have five other projects going.


Glad to hear that Poohbear.

And that's what Boinc is all about... If one project isn't producing work at least your other projects will keep you computer busy :)
____________

[B@H] Ray Profile
Avatar

Joined: Sep 20 05
Posts: 118
ID: 657
Credit: 100,251
RAC: 0
Message 2173 - Posted 3 Nov 2005 18:03:43 UTC

Got plunty of work last night but not today. Still have some on the system that runs mostely Rosetta, but LHC has work now so I set that to 50% until Rosetta is working correct again. LHC and CPDN will keep that system busy. Only 4 waiting to be done for Rosetta there.

Still have plunty of Rosetta on the other system that usually runs CPDN most of the time but going down, should last another 4 to 5 days on that system.
____________


Pizza@Home Rays Place Rays place Forums

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2178 - Posted 3 Nov 2005 18:33:14 UTC
Last modified: 3 Nov 2005 18:33:27 UTC

Here is an update. The database is still being purged. It has gone through over 500,000 WU's and results but there is close to another 500,000 to go. The purging and archiving is also burdening the database server of course. I ask for peoples' patience as it is taking a while to write out the data into archive files. I do not want to just delete the WU records without archiving them so I can still link the structure predictions (output data) with who and what computer did them to give everyone feedback and credit.

We'll be sure to add new hardware in the future if necessary. I'd like to acknowledge our hard working servers with an image :)

Bok Profile

Joined: Sep 17 05
Posts: 54
ID: 71
Credit: 3,411,417
RAC: 0
Message 2179 - Posted 3 Nov 2005 18:40:34 UTC

David,

thanks for that. I for one will be patient :)

Seems to be a long time to do the purging though? What is the bottleneck do you know ? I work with large volume DB's, mostly Oracle & DB2 mind, and I know this can't be a fair comparison, but I've run applications which unload millions of transactions in a matter of minutes.

I presume this is mysql or PostGres ? I don't want to serve up the obvious but are the tables optimized, is it IO writing to disk etc etc.

Be interesting (at least to me) to know :)

Thanks again!

Bok


____________
Free-DC

Stats for all projects

Custom Stats

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2180 - Posted 3 Nov 2005 18:48:19 UTC

Hi Bok,

We use mysql and I haven't optimized the set up yet but I'm glad you mentioned it because that is one of the first things on my list to do. It is also writing a bunch of data to disk to generate archive files. I'm also printing out debug statements since this is the first time using the purge utility and I want to monitor it's progress.

Bok Profile

Joined: Sep 17 05
Posts: 54
ID: 71
Credit: 3,411,417
RAC: 0
Message 2181 - Posted 3 Nov 2005 18:55:17 UTC

It can be quite amazing how much mysql improves with just running

optimize table xxxxxx

frequently. Though it's only if there is regular fragmentation of the data.

I ran into fairly recently on my stats system, adding that prior to every run fixed a lot of issues with speed.

Bok
____________
Free-DC

Stats for all projects

Custom Stats

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2184 - Posted 3 Nov 2005 19:05:56 UTC

Great!, thanks Bok, I'll look into it.

Thomas J Leary DDS Profile

Joined: Oct 23 05
Posts: 1
ID: 6314
Credit: 17,582
RAC: 0
Message 2201 - Posted 3 Nov 2005 22:30:23 UTC

I have been on Seti for years but cannot get BOINC to work. Please help.
____________

rbpeake Profile

Joined: Sep 25 05
Posts: 168
ID: 1036
Credit: 246,593
RAC: 0
Message 2206 - Posted 3 Nov 2005 22:46:13 UTC - in response to Message ID 2201.

I have been on Seti for years but cannot get BOINC to work. Please help.


The Rosetta server is temporarily down for data purging. Please see this thread

http://boinc.bakerlab.org/rosetta/forum_thread.php?id=272

____________
Regards,
Bob P.

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2217 - Posted 4 Nov 2005 1:29:00 UTC

Please let me know if anyone is still experiencing problems getting work. The database load is down but I am going to continue purging tonight.

Divide Overflow

Joined: Sep 17 05
Posts: 82
ID: 129
Credit: 921,382
RAC: 0
Message 2218 - Posted 4 Nov 2005 2:04:49 UTC - in response to Message ID 2217.
Last modified: 4 Nov 2005 2:06:51 UTC

Please let me know if anyone is still experiencing problems getting work. The database load is down but I am going to continue purging tonight.

I'm not able to get any work right now, nor am I able to report a handful of my last results.
EDIT: I see that the database is down, so you must be working on that purge at the moment. Thanks for keeping the server status page updated!
____________

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2220 - Posted 4 Nov 2005 2:16:32 UTC - in response to Message ID 2217.
Last modified: 4 Nov 2005 2:18:49 UTC

Please let me know if anyone is still experiencing problems getting work. The database load is down but I am going to continue purging tonight.


Most of my computers now have some Rosetta work units but I still get intermittent problems, like (times are GMT+8):

4/11/2005 10:12:25 AM No schedulers responded
and
4/11/2005 10:13:53 AM No work from project
and
4/11/2005 10:18:34 AM Project is down


____________
*** Join BOINC@Australia today ***

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2222 - Posted 4 Nov 2005 2:18:30 UTC

Sorry, I'm still working on the database server. But hope to finish within the next hour.

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2226 - Posted 4 Nov 2005 4:15:31 UTC

I am purging the database more overnight. The load on the database has reduced though so let me know if you are still experiencing problems getting work.

Carlos

Joined: Oct 29 05
Posts: 1
ID: 7401
Credit: 189,283
RAC: 20
Message 2227 - Posted 4 Nov 2005 4:59:52 UTC - in response to Message ID 2226.

I am purging the database more overnight. The load on the database has reduced though so let me know if you are still experiencing problems getting work.


Time is Central Time Zone (USA)

11/3/2005 10:55:17 PM|rosetta@home|Fetching master file
11/3/2005 10:55:22 PM|rosetta@home|Master file download succeeded
11/3/2005 10:55:27 PM|rosetta@home|Sending scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi
11/3/2005 10:55:27 PM|rosetta@home|Reason: To fetch work
11/3/2005 10:55:27 PM|rosetta@home|Requesting 43200 seconds of new work
11/3/2005 10:55:32 PM|rosetta@home|Scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
11/3/2005 10:55:32 PM|rosetta@home|No work from project
11/3/2005 10:55:37 PM|rosetta@home|Deferring communication with project for 54 seconds
11/3/2005 10:56:33 PM|rosetta@home|Sending scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi
11/3/2005 10:56:33 PM|rosetta@home|Reason: To fetch work
11/3/2005 10:56:33 PM|rosetta@home|Requesting 43200 seconds of new work
11/3/2005 10:56:38 PM|rosetta@home|Scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
11/3/2005 10:56:38 PM|rosetta@home|No work from project
11/3/2005 10:56:43 PM|rosetta@home|Deferring communication with project for 54 seconds
11/3/2005 10:57:39 PM|rosetta@home|Sending scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi
11/3/2005 10:57:39 PM|rosetta@home|Reason: To fetch work
11/3/2005 10:57:39 PM|rosetta@home|Requesting 43200 seconds of new work
11/3/2005 10:57:44 PM|rosetta@home|Scheduler request to http://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
11/3/2005 10:57:44 PM|rosetta@home|No work from project
11/3/2005 10:57:49 PM|rosetta@home|Deferring communication with project for 54 seconds

____________

stephan_t Profile
Avatar

Joined: Oct 20 05
Posts: 129
ID: 5636
Credit: 35,464
RAC: 0
Message 2232 - Posted 4 Nov 2005 6:42:39 UTC

As everyone probably noticed, it seems to be working again now (7am UTC).
____________
Team CFVault.com
http://www.cfvault.com

Scribe
Avatar

Joined: Nov 2 05
Posts: 284
ID: 8368
Credit: 157,359
RAC: 0
Message 2233 - Posted 4 Nov 2005 7:06:44 UTC

Yup, WUs coming down!
____________

Webmaster Yoda Profile
Avatar

Joined: Sep 17 05
Posts: 161
ID: 75
Credit: 162,253
RAC: 0
Message 2244 - Posted 4 Nov 2005 13:14:17 UTC - in response to Message ID 2226.
Last modified: 4 Nov 2005 13:14:55 UTC

I am purging the database more overnight. The load on the database has reduced though so let me know if you are still experiencing problems getting work.


Plenty of work now, thanks David.

I think the purge is now so far ahead that it purges results the minute they have come in and validated (and credit has been granted). All I see now on all my hosts is work units not yet completed (even minutes after they have been reported).

Hopefully these results aren't lost to the science?
____________
*** Join BOINC@Australia today ***

Doug Worrall Profile
Avatar

Joined: Sep 19 05
Posts: 60
ID: 514
Credit: 58,445
RAC: 0
Message 2245 - Posted 4 Nov 2005 13:18:16 UTC - in response to Message ID 2244.

I am purging the database more overnight. The load on the database has reduced though so let me know if you are still experiencing problems getting work.


Plenty of work now, thanks David.

I think the purge is now so far ahead that it purges results the minute they have come in and validated (and credit has been granted). All I see now on all my hosts is work units not yet completed (even minutes after they have been reported).

Hopefully these results aren't lost to the science?



Same here,
Finish w/u gets uploaded given credit "Dissapears" Magic,into cyber space,I hope not?
Thanks
Doug
____________

Honza

Joined: Sep 18 05
Posts: 48
ID: 434
Credit: 173,517
RAC: 0
Message 2272 - Posted 4 Nov 2005 16:34:46 UTC - in response to Message ID 2226.

I am purging the database more overnight. The load on the database has REDUCED though so let me know if you are still experiencing problems getting work.


But the homepage says:
Our database is being purged and old workunits and results are being archived. As a result, the load on the server is HIGH and work flow has been reduced.

I think the homepage should state "load on the server is HIGH" other way round -> "As a result, the load on the server is LOWER/REDUCED/etc."

My English is not very good but this put me in confusion on first reading :-)
____________

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2276 - Posted 4 Nov 2005 17:50:37 UTC

The database purge has finished. Completed WU's and results should now be kept in the database for 1 week before they get purged and archived so users can view their results. WU's and results (and the credits granted) that have been purged and archived have not been lost. The credits for each user is totaled and kept in the database. The result output (structures and rmsd vs energy) can and will still be linked to users and computers for the results that have been archived. Thanks for everyone's patience!!

Charles Dennett Profile
Avatar

Joined: Sep 27 05
Posts: 92
ID: 1447
Credit: 1,082,242
RAC: 0
Message 2277 - Posted 4 Nov 2005 17:56:56 UTC - in response to Message ID 2276.

The database purge has finished. Completed WU's and results should now be kept in the database for 1 week before they get purged and archived so users can view their results. WU's and results (and the credits granted) that have been purged and archived have not been lost. The credits for each user is totaled and kept in the database. The result output (structures and rmsd vs energy) can and will still be linked to users and computers for the results that have been archived. Thanks for everyone's patience!!


Good work, David! Enjoy the weekend!


____________

Nothing But Idle Time

Joined: Sep 28 05
Posts: 209
ID: 1675
Credit: 139,545
RAC: 0
Message 2292 - Posted 4 Nov 2005 20:52:28 UTC - in response to Message ID 2276.

The database purge has finished. Completed WU's and results should now be kept in the database for 1 week before they get purged and archived so users can view their results. WU's and results (and the credits granted) that have been purged and archived have not been lost. The credits for each user is totaled and kept in the database. The result output (structures and rmsd vs energy) can and will still be linked to users and computers for the results that have been archived. Thanks for everyone's patience!!


Your message is welcomed but I feel uneasy because I would like to know if the real reason for the downtime has been permanently remedied. For the period preceding the downtime I observed what seemed to be a large number of Work Units that ran perhaps 1/4 of the usual time (tests?). If the effect was to quadruple the number of WU processed in a given time period leading to a data base overflow, could we experience this phenomenon again or has it been resolved?
____________

Doug Worrall Profile
Avatar

Joined: Sep 19 05
Posts: 60
ID: 514
Credit: 58,445
RAC: 0
Message 2293 - Posted 4 Nov 2005 21:00:51 UTC - in response to Message ID 2276.

The database purge has finished. Completed WU's and results should now be kept in the database for 1 week before they get purged and archived so users can view their results. WU's and results (and the credits granted) that have been purged and archived have not been lost. The credits for each user is totaled and kept in the database. The result output (structures and rmsd vs energy) can and will still be linked to users and computers for the results that have been archived. Thanks for everyone's patience!!


Dave,
Very good way to do it,
Database,was wondering how the changes were affecting the "Results" tab
and now understand.Thanks for the help explaining this Process.
Aslso have a great Weekend
Doug Worrall
____________

rbpeake Profile

Joined: Sep 25 05
Posts: 168
ID: 1036
Credit: 246,593
RAC: 0
Message 2294 - Posted 4 Nov 2005 21:07:53 UTC - in response to Message ID 2292.

Your message is welcomed but I feel uneasy because I would like to know if the real reason for the downtime has been permanently remedied. For the period preceding the downtime I observed what seemed to be a large number of Work Units that ran perhaps 1/4 of the usual time (tests?). If the effect was to quadruple the number of WU processed in a given time period leading to a data base overflow, could we experience this phenomenon again or has it been resolved?


The database I believe had not been purged since the project started on BOINC, which included several weeks of short work units. Therefore if I may read between the lines, a weekly purge should remedy the problem, and if not I suppose the project team would make any necessary adjustments to prevent a similar "outage" in the future.

This is what happened with Einstein@home, and after the first "emergency" purge the project has been running very smoothly ever since. My two cents. :)

____________
Regards,
Bob P.

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2296 - Posted 4 Nov 2005 21:33:04 UTC

Here are the steps we are or will be taking in the near future:

1. Weekly database purge and archive to keep the wu and results tables at managable sizes.
2. Adding more memory to the database server. Memory to fill out our server will be ordered today which will at least double the current size to 8gigs (we still have to check how many slots are available but an excess will be ordered regardless).
3. We are currently looking into getting a very beafy database server.
4. The size of work units will be increased (note however that they will still depend on factors like the size of the protein and what kind of prediction is being run).

We will also get any necessary hardware to handle increased demand.

Bok Profile

Joined: Sep 17 05
Posts: 54
ID: 71
Credit: 3,411,417
RAC: 0
Message 2297 - Posted 4 Nov 2005 22:13:52 UTC

Sounds good David however I presume you are also going to tune mysql? Is it actually using all that memory for instance?

Bok
____________
Free-DC

Stats for all projects

Custom Stats

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2301 - Posted 4 Nov 2005 23:02:30 UTC

Hi Bok,

Sure will. Can't let the memory just sit there unused.

Andrew

Joined: Sep 19 05
Posts: 162
ID: 527
Credit: 105,512
RAC: 0
Message 2351 - Posted 5 Nov 2005 16:43:48 UTC

I'm getting "Now work..." msg again. :(
____________

John

Joined: Nov 5 05
Posts: 1
ID: 9556
Credit: 3,835
RAC: 0
Message 2378 - Posted 5 Nov 2005 22:06:07 UTC

After 5 failed attempts I finally got work. I'm a newbie and the signup process was a snap. I like the new BOINC 5.2.6. Email and password, now that's the way to go.

Divide Overflow

Joined: Sep 17 05
Posts: 82
ID: 129
Credit: 921,382
RAC: 0
Message 2632 - Posted 8 Nov 2005 16:05:21 UTC

David,

Do you have any plans yet for a regular schedule for maintenance (backups, purging old results, etc.) or is this still being assessed?

____________

David E K Profile
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Jul 1 05
Posts: 960
ID: 14
Credit: 2,338,909
RAC: 1,239
Message 2644 - Posted 8 Nov 2005 17:48:46 UTC

The state of our servers and expansion for the future is still being assessed. Two dual opterons are going to be ordered today and we are considering possibly getting a 16gig server. Once we have our architecture set and stable, we will set up a regular maintenance schedule for backups, purging, cleanup..etc. Our goal is to have a system that may be capable of handling Seti-like usage.

Message boards : Number crunching : No work from project with 89,369 queued


Home | Join | About | Participants | Community | Statistics

Copyright © 2017 University of Washington

Last Modified: 10 Nov 2010 1:51:38 UTC
Back to top ^