Message boards : Number crunching : Problems with Rosetta stable version 5.69 and beta version 5.77
Previous · 1 · 2 · 3
Author | Message |
---|---|
ziegenmelker Send message Joined: 26 Jul 06 Posts: 10 Credit: 26,061 RAC: 0 |
This host really tries out hard to get a valid WU. :-( Btw. when I shut down the machine yesterday, there were afair 9(!) instances of Rosetta@home in memory, each using 79MB of RAM. At that time one WU was aktive, another one was at some % and waiting to run again. <core_client_version>5.10.8</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 # random seed: 2692519 pure virtual method called SIGSEGV: segmentation violation Stack trace (9 frames): [0x8cdfe17] [0x8cdac0c] [0xffffe500] [0x8d65433] [0x8d4b794] [0x8cdc897] [0x8cddeb5] [0x8cd6ea5] [0x8d777fa] Exiting... terminate called without an active exception SIGABRT: abort called Stack trace (19 frames): [0x8cdfe17] [0x8cdac0c] [0xffffe500] [0x8d4b224] [0x8d38b0e] [0x8d35e9d] [0x8d35ed2] [0x8d355b5] [0x8be23b3] [0x8bea61d] [0x8b50074] [0x8c31c58] [0x849a8a1] [0x80dad6d] [0x85c5a97] [0x86eda4f] [0x86edafa] [0x8d44164] [0x8048111] Exiting... Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 SIGSEGV: segmentation violation Stack trace (13 frames): [0x8cdfe17] [0x8cdac0c] [0xffffe500] [0x8c4a1db] [0x8b51266] [0x8c31c58] [0x849a87c] [0x80dad6d] [0x85c5a97] [0x86eda4f] [0x86edafa] [0x8d44164] [0x8048111] Exiting... SIGSEGV: segmentation violation </stderr_txt> ]]> Right now two WUs are waiting to run: Rosetta Beta 5.80: 1ubi__BOINC_ABRELAX_SHORTREL... 85,247 % Rosetta 5.69: CNTRL_01ABRELAX_SAVE_ALL_OU... 9,768 Nothing related to Rosetta in memory. If this is going to change, I will report here. cu, Michael edit: spelling |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Michael, BOINC only runs one task per CPU. But it can get more tasks started if it starts to use more memory then your preference. These tasks are preempted and go to a "waiting for memory" state. On Windows at least, these may still look like they consume memory, but it is only in the swap file while they await BOINC to move them back to a state of "running". So you may want to review your General Preferences for how much memory BOINC is allowed to use. Please add your comments to the Linux thread as you study it further. Rosetta Moderator: Mod.Sense |
ziegenmelker Send message Joined: 26 Jul 06 Posts: 10 Credit: 26,061 RAC: 0 |
Please add your comments to the Linux thread as you study it further. Thanks, I'll do so. cu, Michal |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,860,059 RAC: 7,494 |
i've just had a stalled task: 10/10/2007 22:41:00|rosetta@home|Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1cc8A-_filters_1782_552194_0 using rosetta version 569 did net stop/start boinc and it's continued running again. |
Luuklag Send message Joined: 13 Sep 07 Posts: 262 Credit: 4,171 RAC: 0 |
also could some1 have a look at my topic abbrelax WU's, which is spelled wrong xD 3 failed WU's in a row |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
Somethings wrong but i dont know what. Im continually getting validation errors on my results and not one has gone through successfully. You can check my results at: https://boinc.bakerlab.org/rosetta/results.php?userid=164784 Can anyone help me? I detached as i thought that might be an issue, but to no use. Ive suspended until i can figure this out. |
KSMarksPsych Send message Joined: 15 Oct 05 Posts: 199 Credit: 22,337 RAC: 0 |
Somethings wrong but i dont know what. Im continually getting validation errors on my results and not one has gone through successfully. You can check my results at: You're running Vista. There's two things to check. Is BOINC installed to the default directory (c:program filesboinc)? That generally causes trouble. If so, uninstall BOINC and reinstall to somewhere outside c:program files. I use c:boinc and it's fine. Do you exit out of BOINC before shutting down the computer? BOINC usually doesn't have enough time to do its housekeeping with Vista's super speedy shutdown. There's a registry hack here or you can just shut down BOINC manually (file -> exit out of the manager or stop the service). Kathryn :o) The BOINC FAQ Service The Unofficial BOINC Wiki The Trac System More BOINC information than you can shake a stick of RAM at. |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
Somethings wrong but i dont know what. Im continually getting validation errors on my results and not one has gone through successfully. You can check my results at: Well ive always had it in the default directory before and its never been an issue, but ill uninstall it to a different directory, as for shutting down I usually shut down boinc before down but sometimes theres an occasion where i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem? |
KSMarksPsych Send message Joined: 15 Oct 05 Posts: 199 Credit: 22,337 RAC: 0 |
i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem? No idea. I've never tried putting the computer to sleep and waking it back up. When I do need to take it to work (vary rarely) I just shut down BOINC and the entire computer because it ends up running on battery power all day (I can't plug it in at work because they don't have at 110-220 converter there). Kathryn :o) The BOINC FAQ Service The Unofficial BOINC Wiki The Trac System More BOINC information than you can shake a stick of RAM at. |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem? Well I did what was suggested and reinstalled boic and put it on the c drive and results are fine now, strange but thanks for the help. I think it might have been the putting to sleep thing, ill stop doing that and see how things go. |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem? You can forget that i just said because i check and two units went through fine but there are two that turned into validation errors i dont see anything in my error log, and i dont wanna keep on if they just turn into errors, im kinda stuck! My results page: https://boinc.bakerlab.org/rosetta/results.php?userid=164784 |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
i put it to sleep while boinc is running and it starts up when i boot it back up. Is that a problem? I dont know whats going on, but this is constantly continuing so im going to pull out of Rosetta for the time being. Sorry guys. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Admin, I've never seen so many errors on Windows before. Have you tried detaching and reattaching to Rosetta? This will bring down a fresh copy of the control files that drive the application, in case one may have become corrupted somehow. Is there anything else about your environment that might not be typical of other people running Windows? I guess the main difference is you are running Windows Vista. Rosetta Moderator: Mod.Sense |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
Admin, I've never seen so many errors on Windows before. Been there done that, i also tried moving boinc to the c drive, and if i put the computer to sleep i shutdown boinc before hand, and i make sure it has enough time to exit. I just dont know what to do, and i dont wanna keep sending you guys errors, since it doesnt help anyone. Nothing is showing in my error log so i have NO idea whats going on. ANY help would be appreciated at this point. But im thinking of detaching for now so i dont send any more errors. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
I certainly can see your point, and desire to exit until problems are resolved. There are some BOINC and potentially some Rosetta problems with Vista. Going forward, it will be important to have some numbers to see if any changes improve or correct Vista problems. Could I ask that instead of leaving, you just lower your resource share so that you do a few Rosetta tasks each week? That way the project will continue to have some data from Vista coming in to help assess future changes. You see, this is one of those cases where "failure" is still informative and helps work towards improvement and helps the project. Rosetta Moderator: Mod.Sense |
Admin Send message Joined: 13 Apr 07 Posts: 42 Credit: 260,782 RAC: 0 |
I certainly can see your point, and desire to exit until problems are resolved. There are some BOINC and potentially some Rosetta problems with Vista. Going forward, it will be important to have some numbers to see if any changes improve or correct Vista problems. Could I ask that instead of leaving, you just lower your resource share so that you do a few Rosetta tasks each week? That way the project will continue to have some data from Vista coming in to help assess future changes. You see, this is one of those cases where "failure" is still informative and helps work towards improvement and helps the project. If you want me to continue i sure will. Just for my curiosity what are you able to gain from the WU's such as mine? I feel it could potentially be an issue with shutting down or putting the computer to sleep, although i seem to exit boinc for some reason i think it might be corrupting the files. I dont know why but the only WUs that went through successfully is when the computer and been on and not shut off at all. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
I'm just trying to say that if everyone with Vista had the same problem you are having, and they stopped running Rosetta, then it will be difficult to prove when you've resolved the problems, because you have no Vista users left. So the knowledge that "nope, that didn't correct the problem either" is useful. I haven't seen posts by many with Vista, and so have not reviewed many Vista hosts. But yours seems to be having exceptional problems. It might also be helpful if you attach to Ralph (where changes to Rosetta are tested). Rosetta Moderator: Mod.Sense |
Message boards :
Number crunching :
Problems with Rosetta stable version 5.69 and beta version 5.77
©2024 University of Washington
https://www.bakerlab.org