Results pending and results still uploading

Message boards : Number crunching : Results pending and results still uploading

To post messages, you must log in.

AuthorMessage
TJ

Send message
Joined: 29 Mar 09
Posts: 127
Credit: 4,799,890
RAC: 0
Message 65386 - Posted: 22 Feb 2010, 12:09:07 UTC

Since 20 February my results that have been uploaded are still pending for credit. That is not a problem as it could take a few daqys. However since last night finished WU's are still uploading.
This is a part of messages:
22/02/2010 12:41:30 rosetta@home Requesting new tasks
22/02/2010 12:41:46 rosetta@home Scheduler request failed: Couldn't resolve host name
22/02/2010 12:42:46 rosetta@home Sending scheduler request: To fetch work.
22/02/2010 12:42:46 rosetta@home Requesting new tasks
22/02/2010 12:43:02 rosetta@home Scheduler request failed: Couldn't resolve host name

Can someone of the dev please look at this?
Thanks.
Greetings,
TJ.
ID: 65386 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 11 Oct 05
Posts: 146
Credit: 3,211,099
RAC: 972
Message 65387 - Posted: 22 Feb 2010, 12:29:45 UTC - in response to Message 65386.  
Last modified: 22 Feb 2010, 12:31:42 UTC

Since 20 February my results that have been uploaded are still pending for credit. That is not a problem as it could take a few daqys. However since last night finished WU's are still uploading.
This is a part of messages:
22/02/2010 12:41:30 rosetta@home Requesting new tasks
22/02/2010 12:41:46 rosetta@home Scheduler request failed: Couldn't resolve host name
22/02/2010 12:42:46 rosetta@home Sending scheduler request: To fetch work.
22/02/2010 12:42:46 rosetta@home Requesting new tasks
22/02/2010 12:43:02 rosetta@home Scheduler request failed: Couldn't resolve host name

Can someone of the dev please look at this?
Thanks.


I also have added a message under Server Problems about the Pending Issue.
I now have over 2 pages of Pending work units.

The message about "Couldn't Resolve Host Name" usually means that your local Internet connection has been interrupted towards the Boinc Server and your local Boinc Client (on your computer) can't talk to the Rosetta Server.

I am not getting that message myself so I can talk to the servers (from Australia), but I have gotten it in the past for short periods even though I can still use my own internet quite OK.

I would say it is an issue with the Rosetta Boinc server losing it's internet connection or a problem with their internet Provider, but also check that your own internet is still working OK.
ID: 65387 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TJ

Send message
Joined: 29 Mar 09
Posts: 127
Credit: 4,799,890
RAC: 0
Message 65388 - Posted: 22 Feb 2010, 13:04:10 UTC - in response to Message 65387.  

Thanks for your input Conan,

I have checked my internet connection beforehand and Einstein and Milkyway where and are still "talking" with my computers.

Perhaps Rosetta solves it in the course of the day.
Greetings,
TJ.
ID: 65388 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aroundomaha

Send message
Joined: 11 Sep 08
Posts: 14
Credit: 54,456,950
RAC: 13,197
Message 65389 - Posted: 22 Feb 2010, 16:38:28 UTC - in response to Message 65386.  

I noticed a huge drop off in my average credits over the week-end and now see I have 16,000 pending credits. It appears that everthing sicne around Friday evening has been put into pending.

Since 20 February my results that have been uploaded are still pending for credit. That is not a problem as it could take a few daqys. However since last night finished WU's are still uploading.
This is a part of messages:
22/02/2010 12:41:30 rosetta@home Requesting new tasks
22/02/2010 12:41:46 rosetta@home Scheduler request failed: Couldn't resolve host name
22/02/2010 12:42:46 rosetta@home Sending scheduler request: To fetch work.
22/02/2010 12:42:46 rosetta@home Requesting new tasks
22/02/2010 12:43:02 rosetta@home Scheduler request failed: Couldn't resolve host name

Can someone of the dev please look at this?
Thanks.

ID: 65389 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 192
Credit: 14,211,999
RAC: 11,890
Message 65390 - Posted: 22 Feb 2010, 17:22:39 UTC

Most of my tasks over the last 3 days, stuck in a state of "pending" too.
Apparently this is due to the validator failure (rah_validator_mini = Not running)
I only get credits for "rossmann3x3..." tasks. Seems because it runs on different application (Rosetta Beta 5.98), with own validator (rah_validator_beta = Running)
So what you have to wait until devs fix rah_validator_mini...
ID: 65390 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile AndyAtlanta

Send message
Joined: 26 Mar 06
Posts: 8
Credit: 3,018,913
RAC: 0
Message 65391 - Posted: 22 Feb 2010, 18:19:39 UTC

I have 3 screens' worth of pending, total value approx. 3300 credits.
ID: 65391 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1337
Credit: 24,629,392
RAC: 12,566
Message 65392 - Posted: 22 Feb 2010, 18:44:03 UTC
Last modified: 22 Feb 2010, 19:41:32 UTC

The validator is back running now and I've noticed a few WUs have already been awarded.

A big day for credits today to make up for the weekend. Panic over.

Edit: Well that's fastest 2000 credits I've ever had!

Just 3 left to validate, but 2 validate errors in the process. I'll cope.
ID: 65392 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jjwhalen
Avatar

Send message
Joined: 20 Dec 06
Posts: 4
Credit: 399,398
RAC: 0
Message 65395 - Posted: 22 Feb 2010, 22:25:06 UTC - in response to Message 65392.  



Just 3 left to validate, but 2 validate errors in the process. I'll cope.


I only had 3 Rosetta WUs stuck waiting for the mini validator to come back up, but 2 of them (291252049 & 291252024) also failed to validate. After reading the previous post, I suspect the validator wasn't feeling well when it processed them. Based on this host's (1184796) past performance, I think a server-side, rather than a client-side issue is more likely.

I'm not whining about a few hundred credits here...you can keep 'em, I have more. I really believe project administration should take a look at this validator's performance since it came back up. But as the philosopher says, "that's just my opinion, I could be wrong."

Best wishes:)

ID: 65395 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 192
Credit: 14,211,999
RAC: 11,890
Message 65396 - Posted: 23 Feb 2010, 0:18:49 UTC

I have validate error too: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=291359005
And these WU is the same type as listed in the jjwhalen's post: placestub_1sq2_*
ID: 65396 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
LizzieBarry

Send message
Joined: 25 Feb 08
Posts: 76
Credit: 201,862
RAC: 0
Message 65397 - Posted: 23 Feb 2010, 1:08:51 UTC - in response to Message 65395.  

I'm not whining about a few hundred credits here...you can keep 'em, I have more. I really believe project administration should take a look at this validator's performance since it came back up. But as the philosopher says, "that's just my opinion, I could be wrong."

If I'm not mistaken another job runs each day (or two) that picks up all the validate errors and awards them the same value as the claimed credit. Isn't that right, someone?

If so, I'd be more concerned that the project gets the benefit of the processing. Does the job's results get accepted properly for the science part?
ID: 65397 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gorkan

Send message
Joined: 13 Sep 07
Posts: 10
Credit: 151,300
RAC: 0
Message 65398 - Posted: 23 Feb 2010, 1:52:01 UTC

Sent in 44 wu's after validator went down.
17 Validate errors
4 still pending

Come on guys, your just spinnin my power meter here.
Im sure Alabama power loves you for it but I'm just not feelin the "love"
ID: 65398 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 65399 - Posted: 23 Feb 2010, 4:14:54 UTC

I got 4 validate errors out of 12 tasks, not good odds

some of them have been since the validator restarted. :(

ID: 65399 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gorkan

Send message
Joined: 13 Sep 07
Posts: 10
Credit: 151,300
RAC: 0
Message 65402 - Posted: 23 Feb 2010, 19:46:51 UTC

I'm still getting the validate errors on wu's sent yesterday.
have flushed everything and starting fresh - even my drink
dunno if its gonna fix things, but I can say the drink tastes better!
wish me luck ;)


ID: 65402 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1337
Credit: 24,629,392
RAC: 12,566
Message 65412 - Posted: 24 Feb 2010, 17:30:39 UTC - in response to Message 65395.  
Last modified: 24 Feb 2010, 17:31:47 UTC

Just 3 left to validate, but 2 validate errors in the process. I'll cope.

I only had 3 Rosetta WUs stuck waiting for the mini validator to come back up, but 2 of them (291252049 & 291252024) also failed to validate. After reading the previous post, I suspect the validator wasn't feeling well when it processed them...

I'm not whining about a few hundred credits here... you can keep 'em, I have more. I really believe project administration should take a look at this validator's performance since it came back up. But as the philosopher says, "that's just my opinion, I could be wrong."

As mentioned earlier, both those jobs have been awarded full claimed credit in spite of the validate error. One was sent to a wingman and validated properly. Another went out again and produced another validate error. Short story, everyone here gets the credits so nothing to worry about there.

The server is coming up green on everything, but there are still a few validate errors and delays in awarding them, so it does appear there's a remaining issue somewhere.

Could someone at TPTB take a look at it and report back please?
ID: 65412 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1337
Credit: 24,629,392
RAC: 12,566
Message 65417 - Posted: 25 Feb 2010, 8:42:18 UTC - in response to Message 65412.  
Last modified: 25 Feb 2010, 8:42:33 UTC

The server is coming up green on everything, but there are still a few validate errors and delays in awarding them, so it does appear there's a remaining issue somewhere.

Could someone at TPTB take a look at it and report back please?

I noticed the server went down for a while last night and came back up soon enough. Now validation is going through much better. I assume this is all fixed now, but a confirmation would still be appreciated.
ID: 65417 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 929
Credit: 10,488,745
RAC: 4,310
Message 65431 - Posted: 27 Feb 2010, 4:41:26 UTC - in response to Message 65387.  

The message about "Couldn't Resolve Host Name" usually means that your local Internet connection has been interrupted towards the Boinc Server and your local Boinc Client (on your computer) can't talk to the Rosetta Server.


Likely to depend on what ISP you're using. For mine, it usually means that the DNS server you're using for turning host names into IP addresses isn't working, even if the rest of the internet is still working and you can still reach any other hosts for which your computer has saved the IP address from the last time it contacted that host.
ID: 65431 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 929
Credit: 10,488,745
RAC: 4,310
Message 65432 - Posted: 27 Feb 2010, 4:57:47 UTC - in response to Message 65397.  

I'm not whining about a few hundred credits here...you can keep 'em, I have more. I really believe project administration should take a look at this validator's performance since it came back up. But as the philosopher says, "that's just my opinion, I could be wrong."

If I'm not mistaken another job runs each day (or two) that picks up all the validate errors and awards them the same value as the claimed credit. Isn't that right, someone?

If so, I'd be more concerned that the project gets the benefit of the processing. Does the job's results get accepted properly for the science part?


I'd expect to be more like manual intervention to inspect any workunits for which too many people got validate errors, and then either adjust the validator and run it on on those workunits again, or manually determine the valid parts of the workunit outputs and award credits for just those parts.

What you suggest might be true for the workunits closer to beta testing of new program sections, though, since for those workunits, testing the software is more important than getting scientific results.
ID: 65432 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Results pending and results still uploading



©2020 University of Washington
https://www.bakerlab.org