Minirosetta v1.28 bug thread

Message boards : Number crunching : Minirosetta v1.28 bug thread

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Winston_Smith

Send message
Joined: 24 Apr 08
Posts: 2
Credit: 24,918
RAC: 0
Message 53881 - Posted: 20 Jun 2008, 18:59:30 UTC

Thnx for the tip, shame I wasted that CPU time. I know this is het mini 1.28 problem, but the malfunctioning tasks are 5.96. Is 1.28 causing the problem with 5.96?
ID: 53881 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike.Gibson

Send message
Joined: 3 Nov 07
Posts: 19
Credit: 311,844
RAC: 0
Message 53883 - Posted: 20 Jun 2008, 23:03:22 UTC

Mini 1.28 computation finished on WU 155747918 when time to completion still over 40 minutes. Job reported automatically but a validation error was recorded. Are these factors linked? It seems to have been a waste of 19.3 hours. Can anything be done to recover the WU & points?
ID: 53883 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5658
Credit: 5,670,291
RAC: 2,328
Message 53884 - Posted: 20 Jun 2008, 23:11:45 UTC

I just noticed this t405 unit that crashed with a huge call stack
https://boinc.bakerlab.org/rosetta/result.php?resultid=170539088
ID: 53884 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 53900 - Posted: 21 Jun 2008, 17:45:30 UTC

ID: 53900 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5658
Credit: 5,670,291
RAC: 2,328
Message 53902 - Posted: 21 Jun 2008, 18:31:53 UTC - in response to Message 53900.  

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=154312469


you went past the deadline, it is still due to run since it has not reported yet. personally i would abort it. 2x compute errors is enough.
ID: 53902 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cosmo_vk

Send message
Joined: 23 Jun 08
Posts: 4
Credit: 338,981
RAC: 0
Message 53919 - Posted: 23 Jun 2008, 12:27:56 UTC

Errors with t405 and minirosetta 1.28:
https://boinc.bakerlab.org/rosetta/result.php?resultid=173022693
https://boinc.bakerlab.org/rosetta/result.php?resultid=172998319
https://boinc.bakerlab.org/rosetta/result.php?resultid=172998314

Boinc:5.10.30
ID: 53919 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nothing But Idle Time

Send message
Joined: 28 Sep 05
Posts: 209
Credit: 139,545
RAC: 0
Message 53927 - Posted: 23 Jun 2008, 14:43:18 UTC

On a happy note from my perspective and just so the news isn't all bad... I don't recall ever seeing a mini-Rosetta task of any version that didn't run on my Intel/Windows XP computer using Boinc 5.10.13 (don't know if I got a t405 or not). I'm either very lucky with the tasks I'm assigned or there is something very special about my computer setup. Hope my luck continues and...knock on wood!
ID: 53927 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 53928 - Posted: 23 Jun 2008, 15:00:31 UTC - in response to Message 53902.  

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=154312469


you went past the deadline, it is still due to run since it has not reported yet. personally i would abort it. 2x compute errors is enough.

right you are ....thanks
ID: 53928 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 163
Credit: 800,690
RAC: 173
Message 53972 - Posted: 24 Jun 2008, 21:57:59 UTC

The current version is minirosetta_database_rev23035.zip.

I just tried to delete the old minirosetta_database's.

I have Rev20940, 21566, 22619 & 23035. I deleted all but Rev23035, (the size of the old db's are 43.7MB) when I opened up boinc manger it started 2 download these again so I closed bm & replaced them so I don"t use my data unnecessarily. has this happened to any one else?
Cheers
Speedy
Have a crunching good day!!
ID: 53972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nothing But Idle Time

Send message
Joined: 28 Sep 05
Posts: 209
Credit: 139,545
RAC: 0
Message 53981 - Posted: 25 Jun 2008, 4:26:41 UTC - in response to Message 53972.  

The current version is minirosetta_database_rev23035.zip.

I just tried to delete the old minirosetta_database's.

I have Rev20940, 21566, 22619 & 23035. I deleted all but Rev23035, (the size of the old db's are 43.7MB) when I opened up boinc manger it started 2 download these again so I closed bm & replaced them so I don"t use my data unnecessarily. has this happened to any one else?
Cheers
Speedy
I think you have to not only delete the physical DB files but you also have to remove any reference to these files from the client_state.xml file as well, otherwise Boinc will think the DBs are simply missing and try to download them again. Unless you are good with xml it is best to wait for David Kim to release the proper software...so please hurry.
ID: 53981 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cosmo_vk

Send message
Joined: 23 Jun 08
Posts: 4
Credit: 338,981
RAC: 0
Message 53983 - Posted: 25 Jun 2008, 5:54:55 UTC

all my minirosetta WU go to error. Why?

https://boinc.bakerlab.org/rosetta/result.php?resultid=173208692
https://boinc.bakerlab.org/rosetta/result.php?resultid=173207777
https://boinc.bakerlab.org/rosetta/result.php?resultid=173205772
https://boinc.bakerlab.org/rosetta/result.php?resultid=173175602
https://boinc.bakerlab.org/rosetta/result.php?resultid=173175601
and others...
ID: 53983 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
WHopkins

Send message
Joined: 9 Aug 06
Posts: 2
Credit: 27,250
RAC: 0
Message 53991 - Posted: 25 Jun 2008, 20:17:11 UTC

All 10 rb_06_24_11702_20999_T0468_* tasks I've downloaded today have failed with computation errors on initilization.
ID: 53991 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 53992 - Posted: 25 Jun 2008, 20:36:52 UTC

These are odd errors. Can you try to reset the project on your client? I wonder if the database file got corrupted. We'll look into this further.
ID: 53992 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 53993 - Posted: 25 Jun 2008, 20:48:32 UTC - in response to Message 53983.  

all my minirosetta WU go to error. Why?

https://boinc.bakerlab.org/rosetta/result.php?resultid=173208692
https://boinc.bakerlab.org/rosetta/result.php?resultid=173207777
https://boinc.bakerlab.org/rosetta/result.php?resultid=173205772
https://boinc.bakerlab.org/rosetta/result.php?resultid=173175602
https://boinc.bakerlab.org/rosetta/result.php?resultid=173175601
and others...



The errors may indicate a corrupted database file. Can you try resetting the project?
ID: 53993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cosmo_vk

Send message
Joined: 23 Jun 08
Posts: 4
Credit: 338,981
RAC: 0
Message 54001 - Posted: 26 Jun 2008, 2:47:05 UTC - in response to Message 53993.  

The errors may indicate a corrupted database file. Can you try resetting the project?

I will try to do it today
ID: 54001 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cosmo_vk

Send message
Joined: 23 Jun 08
Posts: 4
Credit: 338,981
RAC: 0
Message 54004 - Posted: 26 Jun 2008, 10:14:09 UTC

I restart the project, but I received the result that nothing has changed( 0.13% and error):
https://boinc.bakerlab.org/rosetta/result.php?resultid=173664257
https://boinc.bakerlab.org/rosetta/result.php?resultid=173664256
https://boinc.bakerlab.org/rosetta/result.php?resultid=173664242

How to get the job only for the Rosetta 5.96(5.98)?

Aleksey Vlasov,
Russian Federation
Team: Russia
ID: 54004 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 28 Sep 05
Posts: 115
Credit: 101,358
RAC: 0
Message 54006 - Posted: 26 Jun 2008, 12:01:25 UTC - in response to Message 54004.  

How to get the job only for the Rosetta 5.96(5.98)?

You'd have to write your own anonymous platform description file app_info.xml for Rosetta project, which would contain just the required application(s) and describe all other necessary files. I've done few for SETI@Home in the past, but Rosetta's files requirements are superior to those of SETI.

It would be simpler for you to set Rosetta to NoNewTasks for the time being and try again later.

Or someone can volunteer...

Peter
ID: 54006 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JordanWeber

Send message
Joined: 24 Apr 08
Posts: 4
Credit: 716,009
RAC: 0
Message 54008 - Posted: 26 Jun 2008, 16:00:57 UTC

I regret to inform you (Rosetta team) that I will no longer be participating much longer with the Rosetta@home project on 1 of my 3 computers (which donates approx 500 credits daily, which may not seem like much to you). Because of the fact that any minirosetta tasks that get downloaded, never start and stay in a blank "CPU time"/Running state getting nothing done ever (this was reported over a month ago, and opt outs have been requested by others as well prior)

Because of this I have been forced to abort minirosetta tasks, which for the past 3 days now have been the only things I have been getting (1 in 100 being 5.96 tasks) Meaning every day I come in and MANUALLY cancel 75+ tasks (until your servers state I have gotten my daily quota), and only getting 1 5.96 tasks if I'm lucky, which is too much of a hassle. Seeing as you still have left no way for me to opt out of the minirosetta project, and I've been forced to use a non-working application, I will have to end participation in the project on that computer.

Most of the information about my computer should already be in my profile, but if you need additional information or need help debugging I am always happy to help, so feel free to leave me a private message. Sorry
ID: 54008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 54012 - Posted: 26 Jun 2008, 17:01:00 UTC - in response to Message 54004.  

I restart the project, but I received the result that nothing has changed( 0.13% and error):
https://boinc.bakerlab.org/rosetta/result.php?resultid=173664257
https://boinc.bakerlab.org/rosetta/result.php?resultid=173664256
https://boinc.bakerlab.org/rosetta/result.php?resultid=173664242

How to get the job only for the Rosetta 5.96(5.98)?

Aleksey Vlasov,
Russian Federation
Team: Russia



We're looking into this. It's not a general error but seems specific to your computer. The errors now are all similar and coming from the same area in the code according to the trace in stderr.
ID: 54012 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 54013 - Posted: 26 Jun 2008, 17:10:03 UTC - in response to Message 54008.  

I regret to inform you (Rosetta team) that I will no longer be participating much longer with the Rosetta@home project on 1 of my 3 computers (which donates approx 500 credits daily, which may not seem like much to you). Because of the fact that any minirosetta tasks that get downloaded, never start and stay in a blank "CPU time"/Running state getting nothing done ever (this was reported over a month ago, and opt outs have been requested by others as well prior)

Because of this I have been forced to abort minirosetta tasks, which for the past 3 days now have been the only things I have been getting (1 in 100 being 5.96 tasks) Meaning every day I come in and MANUALLY cancel 75+ tasks (until your servers state I have gotten my daily quota), and only getting 1 5.96 tasks if I'm lucky, which is too much of a hassle. Seeing as you still have left no way for me to opt out of the minirosetta project, and I've been forced to use a non-working application, I will have to end participation in the project on that computer.

Most of the information about my computer should already be in my profile, but if you need additional information or need help debugging I am always happy to help, so feel free to leave me a private message. Sorry



Can you please run the Ralph project with this computer since minirosetta is under constant development and may become stable for this particular machine? I don't really know what could be going on. I'm going to update mini soon within a few days.


ID: 54013 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Minirosetta v1.28 bug thread



©2024 University of Washington
https://www.bakerlab.org