computaiton error

Message boards : Number crunching : computaiton error

To post messages, you must log in.

AuthorMessage
Dom

Send message
Joined: 2 Aug 10
Posts: 14
Credit: 187,991
RAC: 0
Message 67079 - Posted: 3 Aug 2010, 15:32:14 UTC
Last modified: 3 Aug 2010, 15:35:52 UTC

I have tryed to find the answer before posting this so please don't shout at me to go read this or that page :).

I am new to Rosetta and have let my PC crunch numbers for a few days now but have noticed some client errors or computaiton error on my results page? Is this something I need to worry about? I am just trying to save time for more crunching.

My Results

Any help or advice welcome.

Dom
ID: 67079 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dom

Send message
Joined: 2 Aug 10
Posts: 14
Credit: 187,991
RAC: 0
Message 67081 - Posted: 3 Aug 2010, 16:12:52 UTC - in response to Message 67080.  

Looking at your results it is not as if you're only returning computation errors. There can't be a major problem. On the other hand, nothing seems to be wrong with the tasks themselves, as fas I can see. I don't know if you overclock but if you do, you may want to back off a little on the OC.


Nope its an off the shelf gaming PC, I just dont have time to play game these days. So wanted to put the pc to better use.



ID: 67081 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,506,360
RAC: 56,818
Message 67082 - Posted: 3 Aug 2010, 16:40:14 UTC

Hi Dom

The best method for testing is almost always Prime95 for four hours or so - just download from step 2 under 'Setup Instructions for New Users'. Then run 'Stress test only', choose option 2 and let it run. If it doesn't give any errors then Rosetta (and everything else) should be fine.

Danny
ID: 67082 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dom

Send message
Joined: 2 Aug 10
Posts: 14
Credit: 187,991
RAC: 0
Message 67083 - Posted: 3 Aug 2010, 16:45:48 UTC

Good Idea, I have used that in the past to check memory problems. I will give it a try thanks.

ID: 67083 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jochen

Send message
Joined: 6 Jun 06
Posts: 133
Credit: 3,847,433
RAC: 0
Message 67084 - Posted: 3 Aug 2010, 17:29:13 UTC - in response to Message 67083.  

Good Idea, I have used that in the past to check memory problems. I will give it a try thanks.



You probably know, but just to make sure: Keep an eye on the temperatures.

It could be a temperature related problem. How old is the computer? Ever cleaned the dust filtes, or the coolers and fans?
I have to clean the dust filters every 6 months...

cu

Joe

ID: 67084 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dom

Send message
Joined: 2 Aug 10
Posts: 14
Credit: 187,991
RAC: 0
Message 67089 - Posted: 4 Aug 2010, 0:42:01 UTC

3 months old and in a clean room well almost, but yes I do clean the fans/ducting each month.

Prime95 found no problems so I will just monitor the problem for a week before I try anything else.



ID: 67089 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 12,464
Message 67095 - Posted: 4 Aug 2010, 10:29:41 UTC - in response to Message 67090.  

3 months old and in a clean room well almost, but yes I do clean the fans/ducting each month.

Prime95 found no problems so I will just monitor the problem for a week before I try anything else.

EDIT. Sorry for the double post. Can't see how I remove it.


YOU can't remove it, normally we just put the words 'double post' in it and let the Moderators remove it. But it is okay it happens to all of us sometimes. As for the errors have you set Boinc so the suspended units stay in memory? That is a setting on the webpage under Your Account, Computing Preferences and then you will see this:
"Leave applications in memory while suspended?
(suspended applications will consume swap space if 'yes') yes"

As you can see mine is set to yes, change yours to yes, if it is not already and the errors might go away.
ID: 67095 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dom

Send message
Joined: 2 Aug 10
Posts: 14
Credit: 187,991
RAC: 0
Message 67109 - Posted: 5 Aug 2010, 3:38:03 UTC


I am still getting errors but I will try the "Leave applications in memory while suspended? (suspended applications will consume swap space if 'yes')" Suggestion.

I do use the pc a little now and then when its crunching so maybe that is the problem.

Thanks for the help guys.


ID: 67109 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dom

Send message
Joined: 2 Aug 10
Posts: 14
Credit: 187,991
RAC: 0
Message 67201 - Posted: 13 Aug 2010, 17:21:54 UTC

Not a happy bunny.

It looks like I have 20 computation errors in 10 days.
And now the server is also down and I cant find any information on when its due back up.

I may have to give up on this project.


ID: 67201 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
quel

Send message
Joined: 18 May 10
Posts: 2
Credit: 543,099
RAC: 0
Message 67203 - Posted: 13 Aug 2010, 17:56:23 UTC

It seems the server is starting to come back to life. srv4.bakerlab.org wasn't even responding to web requests but it is now. I was able to upload all my finished jobs but I cannot send the scheduler request to report them. Here is the output I get:
13-Aug-2010 12:47:58 [rosetta@home] Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 2 completed tasks
13-Aug-2010 12:48:03 [rosetta@home] Scheduler request succeeded: got 0 new tasks
13-Aug-2010 12:48:03 [rosetta@home] Message from server: Server error: can't attach shared memory

And the tasks are still listed as ready to report.
ID: 67203 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 12,464
Message 67212 - Posted: 14 Aug 2010, 10:55:05 UTC - in response to Message 67203.  

It seems the server is starting to come back to life. srv4.bakerlab.org wasn't even responding to web requests but it is now. I was able to upload all my finished jobs but I cannot send the scheduler request to report them. Here is the output I get:
13-Aug-2010 12:47:58 [rosetta@home] Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 2 completed tasks
13-Aug-2010 12:48:03 [rosetta@home] Scheduler request succeeded: got 0 new tasks
13-Aug-2010 12:48:03 [rosetta@home] Message from server: Server error: can't attach shared memory

And the tasks are still listed as ready to report.


Time is on your side, they are working on it and as of right now I think everything is back up and running, they are all green anyway!
https://boinc.bakerlab.org/rosetta/rah_status.php
ID: 67212 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : computaiton error



©2024 University of Washington
https://www.bakerlab.org