Problems with Rosetta stable version 5.69 and beta version 5.77

Message boards : Number crunching : Problems with Rosetta stable version 5.69 and beta version 5.77

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3458
Credit: 0
RAC: 0
Message 47522 - Posted: 8 Oct 2007, 21:02:24 UTC - in response to Message 47521.  

Thank you for your replies guys

So the benchmark is a bit falsified on some of our machines, ok but here is a problem all the same http://boinc.bakerlab.org/rosetta/workunit.php?wuid=99953073

The benchmark seem to be right, a core2quad is a fast cpu, 2 point for 11 000 seconde of calculation is very strange, i think it\'s a problem on some wu because the other wu calculated are right http://boinc.bakerlab.org/rosetta/results.php?hostid=573215&offset=40


David, Rhiju, check this out. The result shows two! completion sections. One shows 29 decoys and 10494 seconds of CPU, the other shows 30 decoys and 10861.7 seconds of CPU. ...and either one would normally have granted more then 2 credits. It\'s almost like it completed the task once and then later ran another model on it.

This is Windows XP Pro. on Intel Core2 Quad. With only 1GB of memory for 4CPUs, it would say this machine is probably memory constrained. From what I can tell, more of the machines reporting Linux problems are memory constrained as well.

ZeuZ, I didn\'t mean to say any of the bechmarks were falsified (although some people do that, and that is a big part of why Rosetta made a more independant credit system). I only meant that the work measured in the benchmark is trivial (simple) when compared to running Rosetta. So it is possible for one machine to show benchmarks twice as high as another, and yet it does not get twice as much work done.
Rosetta Moderator: Mod.Sense
ID: 47522 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ziegenmelker

Send message
Joined: 26 Jul 06
Posts: 10
Credit: 26,061
RAC: 0
Message 47535 - Posted: 9 Oct 2007, 7:32:41 UTC
Last modified: 9 Oct 2007, 7:33:40 UTC

This host really tries out hard to get a valid WU. :-(
Btw. when I shut down the machine yesterday, there were afair 9(!) instances of Rosetta@home in memory, each using 79MB of RAM. At that time one WU was aktive, another one was at some % and waiting to run again.

<core_client_version>5.10.8</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
# random seed: 2692519
pure virtual method called
SIGSEGV: segmentation violation
Stack trace (9 frames):
[0x8cdfe17]
[0x8cdac0c]
[0xffffe500]
[0x8d65433]
[0x8d4b794]
[0x8cdc897]
[0x8cddeb5]
[0x8cd6ea5]
[0x8d777fa]

Exiting...
terminate called without an active exception
SIGABRT: abort called
Stack trace (19 frames):
[0x8cdfe17]
[0x8cdac0c]
[0xffffe500]
[0x8d4b224]
[0x8d38b0e]
[0x8d35e9d]
[0x8d35ed2]
[0x8d355b5]
[0x8be23b3]
[0x8bea61d]
[0x8b50074]
[0x8c31c58]
[0x849a8a1]
[0x80dad6d]
[0x85c5a97]
[0x86eda4f]
[0x86edafa]
[0x8d44164]
[0x8048111]

Exiting...
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
SIGSEGV: segmentation violation
Stack trace (13 frames):
[0x8cdfe17]
[0x8cdac0c]
[0xffffe500]
[0x8c4a1db]
[0x8b51266]
[0x8c31c58]
[0x849a87c]
[0x80dad6d]
[0x85c5a97]
[0x86eda4f]
[0x86edafa]
[0x8d44164]
[0x8048111]

Exiting...
SIGSEGV: segmentation violation

</stderr_txt>
]]>


Right now two WUs are waiting to run:
Rosetta Beta 5.80: 1ubi__BOINC_ABRELAX_SHORTREL... 85,247 %
Rosetta 5.69: CNTRL_01ABRELAX_SAVE_ALL_OU... 9,768

Nothing related to Rosetta in memory. If this is going to change, I will report here.

cu,
Michael

edit: spelling
ID: 47535 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3458
Credit: 0
RAC: 0
Message 47540 - Posted: 9 Oct 2007, 12:39:58 UTC

Michael, BOINC only runs one task per CPU. But it can get more tasks started if it starts to use more memory then your preference. These tasks are preempted and go to a \"waiting for memory\" state. On Windows at least, these may still look like they consume memory, but it is only in the swap file while they await BOINC to move them back to a state of \"running\". So you may want to review your General Preferences for how much memory BOINC is allowed to use.

Please add your comments to the Linux thread as you study it further.
Rosetta Moderator: Mod.Sense
ID: 47540 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ziegenmelker

Send message
Joined: 26 Jul 06
Posts: 10
Credit: 26,061
RAC: 0
Message 47546 - Posted: 9 Oct 2007, 16:08:36 UTC - in response to Message 47540.  

Please add your comments to the Linux thread as you study it further.


Thanks, I\'ll do so.

cu,
Michal
ID: 47546 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1605
Credit: 44,291,141
RAC: 31,439
Message 47606 - Posted: 10 Oct 2007, 21:41:49 UTC

i\'ve just had a stalled task:

10/10/2007 22:41:00|rosetta@home|Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1cc8A-_filters_1782_552194_0 using rosetta version 569

did net stop/start boinc and it\'s continued running again.


ID: 47606 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Luuklag

Send message
Joined: 13 Sep 07
Posts: 262
Credit: 4,171
RAC: 0
Message 47855 - Posted: 19 Oct 2007, 9:02:24 UTC

also could some1 have a look at my topic abbrelax WU\'s, which is spelled wrong xD 3 failed WU\'s in a row
ID: 47855 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48159 - Posted: 30 Oct 2007, 18:41:08 UTC

Somethings wrong but i dont know what. Im continually getting validation errors on my results and not one has gone through successfully. You can check my results at:
http://boinc.bakerlab.org/rosetta/results.php?userid=164784

Can anyone help me?

I detached as i thought that might be an issue, but to no use. Ive suspended until i can figure this out.
ID: 48159 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile KSMarksPsych
Avatar

Send message
Joined: 15 Oct 05
Posts: 199
Credit: 22,337
RAC: 0
Message 48177 - Posted: 31 Oct 2007, 10:41:43 UTC - in response to Message 48159.  

Somethings wrong but i dont know what. Im continually getting validation errors on my results and not one has gone through successfully. You can check my results at:
http://boinc.bakerlab.org/rosetta/results.php?userid=164784

Can anyone help me?

I detached as i thought that might be an issue, but to no use. Ive suspended until i can figure this out.


You\'re running Vista. There\'s two things to check. Is BOINC installed to the default directory (c:\\program files\\boinc)? That generally causes trouble. If so, uninstall BOINC and reinstall to somewhere outside c:\\program files. I use c:\\boinc and it\'s fine. Do you exit out of BOINC before shutting down the computer? BOINC usually doesn\'t have enough time to do its housekeeping with Vista\'s super speedy shutdown. There\'s a registry hack here or you can just shut down BOINC manually (file -> exit out of the manager or stop the service).

Kathryn :o)
The BOINC FAQ Service
The Unofficial BOINC Wiki
The Trac System
More BOINC information than you can shake a stick of RAM at.
ID: 48177 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48178 - Posted: 31 Oct 2007, 11:44:12 UTC - in response to Message 48177.  

Somethings wrong but i dont know what. Im continually getting validation errors on my results and not one has gone through successfully. You can check my results at:
http://boinc.bakerlab.org/rosetta/results.php?userid=164784

Can anyone help me?

I detached as i thought that might be an issue, but to no use. Ive suspended until i can figure this out.


You\'re running Vista. There\'s two things to check. Is BOINC installed to the default directory (c:\\program files\\boinc)? That generally causes trouble. If so, uninstall BOINC and reinstall to somewhere outside c:\\program files. I use c:\\boinc and it\'s fine. Do you exit out of BOINC before shutting down the computer? BOINC usually doesn\'t have enough time to do its housekeeping with Vista\'s super speedy shutdown. There\'s a registry hack here or you can just shut down BOINC manually (file -> exit out of the manager or stop the service).


Well ive always had it in the default directory before and its never been an issue, but ill uninstall it to a different directory, as for shutting down I usually shut down boinc before down but sometimes theres an occasion where i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem?
ID: 48178 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile KSMarksPsych
Avatar

Send message
Joined: 15 Oct 05
Posts: 199
Credit: 22,337
RAC: 0
Message 48219 - Posted: 1 Nov 2007, 10:05:09 UTC - in response to Message 48178.  

i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem?


No idea. I\'ve never tried putting the computer to sleep and waking it back up. When I do need to take it to work (vary rarely) I just shut down BOINC and the entire computer because it ends up running on battery power all day (I can\'t plug it in at work because they don\'t have at 110-220 converter there).
Kathryn :o)
The BOINC FAQ Service
The Unofficial BOINC Wiki
The Trac System
More BOINC information than you can shake a stick of RAM at.
ID: 48219 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48240 - Posted: 1 Nov 2007, 19:23:05 UTC - in response to Message 48219.  

i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem?


No idea. I\'ve never tried putting the computer to sleep and waking it back up. When I do need to take it to work (vary rarely) I just shut down BOINC and the entire computer because it ends up running on battery power all day (I can\'t plug it in at work because they don\'t have at 110-220 converter there).


Well I did what was suggested and reinstalled boic and put it on the c drive and results are fine now, strange but thanks for the help. I think it might have been the putting to sleep thing, ill stop doing that and see how things go.
ID: 48240 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48260 - Posted: 1 Nov 2007, 23:37:43 UTC - in response to Message 48240.  

i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem?


No idea. I\'ve never tried putting the computer to sleep and waking it back up. When I do need to take it to work (vary rarely) I just shut down BOINC and the entire computer because it ends up running on battery power all day (I can\'t plug it in at work because they don\'t have at 110-220 converter there).


Well I did what was suggested and reinstalled boic and put it on the c drive and results are fine now, strange but thanks for the help. I think it might have been the putting to sleep thing, ill stop doing that and see how things go.


You can forget that i just said because i check and two units went through fine but there are two that turned into validation errors i dont see anything in my error log, and i dont wanna keep on if they just turn into errors, im kinda stuck!

My results page:
http://boinc.bakerlab.org/rosetta/results.php?userid=164784
ID: 48260 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48376 - Posted: 5 Nov 2007, 15:14:32 UTC - in response to Message 48260.  

i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem?


No idea. I\'ve never tried putting the computer to sleep and waking it back up. When I do need to take it to work (vary rarely) I just shut down BOINC and the entire computer because it ends up running on battery power all day (I can\'t plug it in at work because they don\'t have at 110-220 converter there).


Well I did what was suggested and reinstalled boic and put it on the c drive and results are fine now, strange but thanks for the help. I think it might have been the putting to sleep thing, ill stop doing that and see how things go.


You can forget that i just said because i check and two units went through fine but there are two that turned into validation errors i dont see anything in my error log, and i dont wanna keep on if they just turn into errors, im kinda stuck!

My results page:
http://boinc.bakerlab.org/rosetta/results.php?userid=164784


I dont know whats going on, but this is constantly continuing so im going to pull out of Rosetta for the time being. Sorry guys.
ID: 48376 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3458
Credit: 0
RAC: 0
Message 48379 - Posted: 5 Nov 2007, 17:30:22 UTC
Last modified: 5 Nov 2007, 17:31:39 UTC

Admin, I\'ve never seen so many errors on Windows before.

Have you tried detaching and reattaching to Rosetta? This will bring down a fresh copy of the control files that drive the application, in case one may have become corrupted somehow.

Is there anything else about your environment that might not be typical of other people running Windows? I guess the main difference is you are running Windows Vista.
Rosetta Moderator: Mod.Sense
ID: 48379 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48382 - Posted: 5 Nov 2007, 17:46:12 UTC - in response to Message 48379.  

Admin, I\'ve never seen so many errors on Windows before.

Have you tried detaching and reattaching to Rosetta? This will bring down a fresh copy of the control files that drive the application, in case one may have become corrupted somehow.

Is there anything else about your environment that might not be typical of other people running Windows? I guess the main difference is you are running Windows Vista.


Been there done that, i also tried moving boinc to the c drive, and if i put the computer to sleep i shutdown boinc before hand, and i make sure it has enough time to exit. I just dont know what to do, and i dont wanna keep sending you guys errors, since it doesnt help anyone. Nothing is showing in my error log so i have NO idea whats going on. ANY help would be appreciated at this point. But im thinking of detaching for now so i dont send any more errors.
ID: 48382 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3458
Credit: 0
RAC: 0
Message 48385 - Posted: 5 Nov 2007, 18:13:08 UTC

I certainly can see your point, and desire to exit until problems are resolved. There are some BOINC and potentially some Rosetta problems with Vista. Going forward, it will be important to have some numbers to see if any changes improve or correct Vista problems. Could I ask that instead of leaving, you just lower your resource share so that you do a few Rosetta tasks each week? That way the project will continue to have some data from Vista coming in to help assess future changes. You see, this is one of those cases where \"failure\" is still informative and helps work towards improvement and helps the project.
Rosetta Moderator: Mod.Sense
ID: 48385 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 48386 - Posted: 5 Nov 2007, 18:21:48 UTC - in response to Message 48385.  

I certainly can see your point, and desire to exit until problems are resolved. There are some BOINC and potentially some Rosetta problems with Vista. Going forward, it will be important to have some numbers to see if any changes improve or correct Vista problems. Could I ask that instead of leaving, you just lower your resource share so that you do a few Rosetta tasks each week? That way the project will continue to have some data from Vista coming in to help assess future changes. You see, this is one of those cases where \"failure\" is still informative and helps work towards improvement and helps the project.


If you want me to continue i sure will. Just for my curiosity what are you able to gain from the WU\'s such as mine? I feel it could potentially be an issue with shutting down or putting the computer to sleep, although i seem to exit boinc for some reason i think it might be corrupting the files. I dont know why but the only WUs that went through successfully is when the computer and been on and not shut off at all.
ID: 48386 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3458
Credit: 0
RAC: 0
Message 48398 - Posted: 5 Nov 2007, 21:23:11 UTC

I\'m just trying to say that if everyone with Vista had the same problem you are having, and they stopped running Rosetta, then it will be difficult to prove when you\'ve resolved the problems, because you have no Vista users left. So the knowledge that \"nope, that didn\'t correct the problem either\" is useful.

I haven\'t seen posts by many with Vista, and so have not reviewed many Vista hosts. But yours seems to be having exceptional problems. It might also be helpful if you attach to Ralph (where changes to Rosetta are tested).


Rosetta Moderator: Mod.Sense
ID: 48398 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3

Message boards : Number crunching : Problems with Rosetta stable version 5.69 and beta version 5.77



©2018 University of Washington
http://www.bakerlab.org