Not getting any new tasks

Message boards : Number crunching : Not getting any new tasks

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Jochen

Send message
Joined: 6 Jun 06
Posts: 133
Credit: 3,847,433
RAC: 0
Message 62375 - Posted: 22 Jul 2009, 17:07:25 UTC - in response to Message 62374.  
Last modified: 22 Jul 2009, 17:11:05 UTC

That could be helped, maybe, by resetting the RDCF to 1. Obviously the BOINC manager thinks those 20 units will be enough to meet your needs,that could be because the RDCF is seriously out of wack.


Thanks for your reply. I have checked the DCF in the client_state.xml. It was 0.96xxxx on that machine. AFAIR everything equal or lower than 1 is fine... I have changed it to 1.000000, but no change so far. I even detached from LHC on that machine yesterday. Rosetta is now the only project.
On the machine that is fine, the RCF is even lower (0.8xxxxx).

The Q9650 and the i7 920 are less than 2 month old, and to be honest, I had the same problems on the two machines I had before. I can't tell when this started and actually I didn't really care, because both machines were due to be replaced in the next weeks, when I noticed it.

Joe

P.S.: In my desperation, I just increased the cache from 3 to 5 days on that one machine - and now it is downloading WUs. 47 so far. At least I now got a cache for one day.

EDIT: Oh, come on, now all active WUs have changed to high priority. I give in. I'm done with BOINC.
ID: 62375 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1726
Credit: 5,947,552
RAC: 1,416
Message 62421 - Posted: 25 Jul 2009, 11:51:28 UTC - in response to Message 62375.  
Last modified: 25 Jul 2009, 11:55:21 UTC

That could be helped, maybe, by resetting the RDCF to 1. Obviously the BOINC manager thinks those 20 units will be enough to meet your needs,that could be because the RDCF is seriously out of wack.


Thanks for your reply. I have checked the DCF in the client_state.xml. It was 0.96xxxx on that machine. AFAIR everything equal or lower than 1 is fine... I have changed it to 1.000000, but no change so far. I even detached from LHC on that machine yesterday. Rosetta is now the only project.
On the machine that is fine, the RCF is even lower (0.8xxxxx).

The Q9650 and the i7 920 are less than 2 month old, and to be honest, I had the same problems on the two machines I had before. I can't tell when this started and actually I didn't really care, because both machines were due to be replaced in the next weeks, when I noticed it.

Joe

P.S.: In my desperation, I just increased the cache from 3 to 5 days on that one machine - and now it is downloading WUs. 47 so far. At least I now got a cache for one day.

EDIT: Oh, come on, now all active WUs have changed to high priority. I give in. I'm done with BOINC.


Your problem is the version of Boinc you are using! On the newer machines you are using Boinc 6.6.36 that version, along with all 6.6.? version, has some SERIOUS scheduling problems! I recommend you drop to version 6.4.7 or something earlier. On one machine you are using version 5.4.9 and it seems to be working just fine. You can get all versions of Boinc here: http://boincdl.ssl.berkeley.edu/dl/
You can also go to the home page and under download Boinc click on the All Versions link, 6.4.7 is there too.
I have been crunching for Boinc since the early Beta days, there are good versions and not quite so good version, there probably always will be. Find one you like and stick with it until you find something better. Distributed Computing is a worthwhile thing to do and if you can find something that you can believe in the project in, they you have a keeper! IMHO Rosie is such a project!!
ID: 62421 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4875
Credit: 4,472,466
RAC: 313
Message 62430 - Posted: 25 Jul 2009, 23:00:19 UTC - in response to Message 62421.  

That could be helped, maybe, by resetting the RDCF to 1. Obviously the BOINC manager thinks those 20 units will be enough to meet your needs,that could be because the RDCF is seriously out of wack.


Thanks for your reply. I have checked the DCF in the client_state.xml. It was 0.96xxxx on that machine. AFAIR everything equal or lower than 1 is fine... I have changed it to 1.000000, but no change so far. I even detached from LHC on that machine yesterday. Rosetta is now the only project.
On the machine that is fine, the RCF is even lower (0.8xxxxx).

The Q9650 and the i7 920 are less than 2 month old, and to be honest, I had the same problems on the two machines I had before. I can't tell when this started and actually I didn't really care, because both machines were due to be replaced in the next weeks, when I noticed it.

Joe

P.S.: In my desperation, I just increased the cache from 3 to 5 days on that one machine - and now it is downloading WUs. 47 so far. At least I now got a cache for one day.

EDIT: Oh, come on, now all active WUs have changed to high priority. I give in. I'm done with BOINC.


Your problem is the version of Boinc you are using! On the newer machines you are using Boinc 6.6.36 that version, along with all 6.6.? version, has some SERIOUS scheduling problems! I recommend you drop to version 6.4.7 or something earlier. On one machine you are using version 5.4.9 and it seems to be working just fine. You can get all versions of Boinc here: http://boincdl.ssl.berkeley.edu/dl/
You can also go to the home page and under download Boinc click on the All Versions link, 6.4.7 is there too.
I have been crunching for Boinc since the early Beta days, there are good versions and not quite so good version, there probably always will be. Find one you like and stick with it until you find something better. Distributed Computing is a worthwhile thing to do and if you can find something that you can believe in the project in, they you have a keeper! IMHO Rosie is such a project!!


6.4.7 is a non GPU program. I think it 6.5.2 or something in that area that allows for GPU processors.
ID: 62430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1726
Credit: 5,947,552
RAC: 1,416
Message 62451 - Posted: 26 Jul 2009, 11:02:59 UTC - in response to Message 62430.  
Last modified: 26 Jul 2009, 11:04:01 UTC

That could be helped, maybe, by resetting the RDCF to 1. Obviously the BOINC manager thinks those 20 units will be enough to meet your needs,that could be because the RDCF is seriously out of wack.


Thanks for your reply. I have checked the DCF in the client_state.xml. It was 0.96xxxx on that machine. AFAIR everything equal or lower than 1 is fine... I have changed it to 1.000000, but no change so far. I even detached from LHC on that machine yesterday. Rosetta is now the only project.
On the machine that is fine, the RCF is even lower (0.8xxxxx).

The Q9650 and the i7 920 are less than 2 month old, and to be honest, I had the same problems on the two machines I had before. I can't tell when this started and actually I didn't really care, because both machines were due to be replaced in the next weeks, when I noticed it.

Joe

P.S.: In my desperation, I just increased the cache from 3 to 5 days on that one machine - and now it is downloading WUs. 47 so far. At least I now got a cache for one day.

EDIT: Oh, come on, now all active WUs have changed to high priority. I give in. I'm done with BOINC.


Your problem is the version of Boinc you are using! On the newer machines you are using Boinc 6.6.36 that version, along with all 6.6.? version, has some SERIOUS scheduling problems! I recommend you drop to version 6.4.7 or something earlier. On one machine you are using version 5.4.9 and it seems to be working just fine. You can get all versions of Boinc here: http://boincdl.ssl.berkeley.edu/dl/
You can also go to the home page and under download Boinc click on the All Versions link, 6.4.7 is there too.
I have been crunching for Boinc since the early Beta days, there are good versions and not quite so good version, there probably always will be. Find one you like and stick with it until you find something better. Distributed Computing is a worthwhile thing to do and if you can find something that you can believe in the project in, they you have a keeper! IMHO Rosie is such a project!!


6.4.7 is a non GPU program. I think it 6.5.2 or something in that area that allows for GPU processors.


You are correct...Paul Buck says 6.5.0 is the best for GPU crunching, I do not crunch thru Boinc so can't vouch for that though.
ID: 62451 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
brilor

Send message
Joined: 31 Mar 08
Posts: 8
Credit: 124,013
RAC: 0
Message 62521 - Posted: 27 Jul 2009, 20:38:48 UTC

Haven't been able to download any new tasks for 2 days. Messages say "download failed". This is Mac OS X 10.5.7 and BOINC 6.6.36 running *only* Rosetta. I tried deleting all of Boinc ( data files, app, prefs )from the machine and doing a new install but now receive "Scheduler request completed: got 0 new tasks". Prefs are defaults upon install with no limits. Is there a fix on my end?

Thanks....Brian S.
ID: 62521 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 62523 - Posted: 27 Jul 2009, 21:25:55 UTC - in response to Message 62521.  

Haven't been able to download any new tasks for 2 days. Messages say "download failed". This is Mac OS X 10.5.7 and BOINC 6.6.36 running *only* Rosetta. I tried deleting all of Boinc ( data files, app, prefs )from the machine and doing a new install but now receive "Scheduler request completed: got 0 new tasks". Prefs are defaults upon install with no limits. Is there a fix on my end?

Thanks....Brian S.


There was a major bug on the server side, so nothing wrong with your computer. It looks like it has been resolved now, but almost every participant is trying to download new work at the same time. The servers are busy handing out work as fast as they can but it may be a few hours before everything is back to normal.
ID: 62523 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gen_X_Accord
Avatar

Send message
Joined: 5 Jun 06
Posts: 154
Credit: 279,018
RAC: 0
Message 62527 - Posted: 27 Jul 2009, 23:13:52 UTC

I didn't just run out of work for Rosetta, but for Folding@home's gpu client as well. I am running the regular Folding client until I can get something for Rosetta. I'm watching those Tera Flops drop on the main page.
ID: 62527 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael G.R.

Send message
Joined: 11 Nov 05
Posts: 264
Credit: 11,246,222
RAC: 1,210
Message 62541 - Posted: 28 Jul 2009, 4:53:29 UTC

I've been running some World Computing Grid projects in the meantime.. Don't let those CPU cycles go to waste!
ID: 62541 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Heidi1
Avatar

Send message
Joined: 11 Aug 07
Posts: 49
Credit: 1,760,864
RAC: 54
Message 62542 - Posted: 28 Jul 2009, 5:25:13 UTC

I was at least able to finally upload my finished WUs. My downloading bonked out because that bad database file mentioned on the home page had been queued up for me, so it messed up my downloading a little bit. It's now been cleared out of my crunching list.

Now, I can't get work again. The messages list says there's no work from server, but the RAH home page and server status are both saying everything's fine. I guess it's being overloaded again like last week.
ID: 62542 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gen_X_Accord
Avatar

Send message
Joined: 5 Jun 06
Posts: 154
Credit: 279,018
RAC: 0
Message 62543 - Posted: 28 Jul 2009, 6:26:53 UTC

I now have one Rosetta task running on one core, and am running Folding on another core. That's different. Just waiting for more Rosy.
ID: 62543 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1471
Credit: 27,582,793
RAC: 25,639
Message 62555 - Posted: 28 Jul 2009, 12:19:59 UTC
Last modified: 28 Jul 2009, 12:34:20 UTC

6 new tasks started coming down here about 50 minutes ago. No download errors. One job completed 99 decoys in 30 minutes for some reason and has gone back already. 9 more tasks coming down now. Hopefully all goes well.

It must be about 4am Seattle time. Well done over there.

Edit: Validate error :(
ID: 62555 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Emigdio Lopez Laburu

Send message
Joined: 25 Feb 06
Posts: 61
Credit: 40,240,061
RAC: 0
Message 62562 - Posted: 28 Jul 2009, 17:07:27 UTC

I still have problems to receive new tasks... and my WU buffer is finishing... :(

I understand that someone of the R@H team is still working on this issue.

At this moment there are 1.365.000 WU ready to send and increasing rapidly because seams that the transfer process is stopped.

In adition... I should like to see more news from Rosetta,s responsibles updating about this issue to their "customers"... if this is not very complicated...
ID: 62562 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,003,213
RAC: 0
Message 62563 - Posted: 28 Jul 2009, 18:19:52 UTC

our scheduler server is getting hammered right now with work requests. it will take some time for it to settle down unfortunately. hang in there and you should start getting jobs.
ID: 62563 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Sparrow

Send message
Joined: 15 Aug 06
Posts: 4
Credit: 352,722
RAC: 0
Message 62568 - Posted: 28 Jul 2009, 20:55:27 UTC

Finally I was able to download a WU again! There were a lot of failed attempts this evening again, but some minutes ago it worked. Only one single WU was downloaded, but it is a start :-)
ID: 62568 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Emigdio Lopez Laburu

Send message
Joined: 25 Feb 06
Posts: 61
Credit: 40,240,061
RAC: 0
Message 62588 - Posted: 29 Jul 2009, 8:40:49 UTC

Yes.

From my part, seams that things come back to work normally.

Thanks, team.
ID: 62588 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gen_X_Accord
Avatar

Send message
Joined: 5 Jun 06
Posts: 154
Credit: 279,018
RAC: 0
Message 62594 - Posted: 29 Jul 2009, 11:57:54 UTC

I have a few work units. They seem to be hit and miss. I also am having trouble uploading finished work units. Rosetta is having MAJOR problems folks. Looks like WCG is going to be getting some attention from me in the meantime. I'll have at least one core crunching Rosy, but the other will probably be WCG until Rosy gets its problems fixed, and maybe even after Rosy gets it's problems fixed. And the Folding gpu client is working for me again. So since I have the potential to run 3 different workunits or projects at once, I'm going to spread the computing potential around instead of it all being focused on Rosy. (Which I cannot utilize my gpu for anyway.)
ID: 62594 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1471
Credit: 27,582,793
RAC: 25,639
Message 62596 - Posted: 29 Jul 2009, 14:35:28 UTC - in response to Message 62563.  

Our scheduler server is getting hammered right now with work requests. It will take some time for it to settle down unfortunately. Hang in there and you should start getting jobs.

Can I ask, David, was this a server issue? Is the server still getting a lot of hits? After 24+ hours of work being available there are still a lot of errorsfailuresretries in uploads and downloads (it does all go through eventually). When the previous problems arose there were no errors reported on the Server Status page.

What I'm really asking is if the problem has been resolved or if it's possible we could see continuing issues at unexpected times. Things still don't seem quite right, even now.
ID: 62596 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Emigdio Lopez Laburu

Send message
Joined: 25 Feb 06
Posts: 61
Credit: 40,240,061
RAC: 0
Message 62597 - Posted: 29 Jul 2009, 17:12:11 UTC

Scheduler is again not working. Seams that problems are not yet solved.
ID: 62597 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
HW&JC

Send message
Joined: 2 May 08
Posts: 20
Credit: 6,179,041
RAC: 1,739
Message 62598 - Posted: 29 Jul 2009, 17:21:37 UTC - in response to Message 62596.  

What I'm really asking is if the problem has been resolved or if it's possible we could see continuing issues at unexpected times. Things still don't seem quite right, even now.

Either you're a fortune teller or a curse. Servers disabled.

Hopefully it's to put in a more permanent fix.
ID: 62598 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile joseps

Send message
Joined: 25 Jun 06
Posts: 72
Credit: 8,173,820
RAC: 0
Message 62608 - Posted: 30 Jul 2009, 0:20:09 UTC - in response to Message 62597.  

Scheduler is again not working. Seams that problems are not yet solved.


I believe you are right. There must be a big problem with the server. Am too concerned . Rosetta is my favorite. Since we are just enthusiastic volunteers, lets just relax and wait. For the time being, take some WU from World C Grid. They have lots of work.
Right now my biggest concern is the Temperature in Seattle, 90*F - 102*F for the next few days or more. I have no aircon in my computer room. My room Ambient is 95*F. I am shutting off all my 5 quad machines to avoid shortening their life. The machines are all drdicated to Rosetta@home. All our major appliance stores run out of window aircon.
joseps
I turned off my 5computers when I went on vacation. When I return today, I can not upload work. Need work units to run computers.
joseps
ID: 62608 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Not getting any new tasks



©2021 University of Washington
https://www.bakerlab.org