Message boards : Number crunching : Problems with Rosetta version 5.43
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 8 · Next
Author | Message |
---|---|
FluffyChicken Send message Joined: 1 Nov 05 Posts: 1260 Credit: 369,635 RAC: 0 |
Hi, Your other projects are more than likely just taken their share of the queue, it be fine in the long run. Team mauisun.org |
RebelRex Send message Joined: 15 Dec 06 Posts: 3 Credit: 109,567 RAC: 0 |
I am running Linux, with Boinc 5.4.9 and Rosetta 5.43. Dual CPU AMD 2600MP with 2 Gig RAM. My graphics button is grayed out thus have no graphic options, which I understand is part of the Linux version. I do find my Rosetta work units "haulting" or saying its running but its not (no cpu activity), whenever I make changes within my Personal settings. Hitting UPDATE makes no difference. Reboot fixes. I also find the setting found inside your General Preferences of Switch between applications, that when Boinc switches from a one unit to another, the work unit that it left will never get worked on again until computer reboots... When it resumes back to that work unit (before the reboot) it will say "running" but again no CPU activity. I just stay away from going into my General Preferences to make changes and have set Switch Between Applications to 540 minutes to avoid Boinc from switching between work units. Takes about 5-6 hrs to complete work units so 540min is plenty... |
288VKYUjwsXfAaTXn6SFJC4LVPRf Send message Joined: 16 Dec 05 Posts: 31 Credit: 153,110 RAC: 0 |
This workunit failed when I wanted to view the graphics. I don't know why this happened. I am doing this action multiple times each day with different WU's. Never had a problem the last month. https://boinc.bakerlab.org/rosetta/result.php?resultid=52886714 |
Joachim Send message Joined: 26 Nov 06 Posts: 5 Credit: 518,439 RAC: 84 |
I am running Linux, with Boinc 5.4.9 and Rosetta 5.43. I've the same phenomena. I've running SUSE Linux 10.0 with 512BM RAM on an AMD Athlon 2400+. If BOINC resumes the rosetta task, rosetta will be shown as aktive, but TOP shows four threads that are doing nothing (no CPU consumption). If I stop BOINC and then restart it, rosetta will work, since next pausing is required. On start of BOINC I see some other mistake of rosetta. It ignores my settings for home: 2006-12-19 19:10:40 [---] Starting BOINC client version 5.4.11 for i686-pc-linux-gnu 2006-12-19 19:10:40 [---] libcurl/7.16.0 OpenSSL/0.9.8d zlib/1.2.3 2006-12-19 19:10:40 [---] Data directory: /home/cheffe/BOINC 2006-12-19 19:10:40 [---] Processor: 1 AuthenticAMD AMD Athlon(tm) XP 2400+ 2006-12-19 19:10:40 [---] Memory: 503.70 MB physical, 2.72 GB virtual 2006-12-19 19:10:40 [---] Disk: 107.32 GB total, 98.38 GB free 2006-12-19 19:10:40 [Einstein@Home] URL: http://einstein.phys.uwm.edu/; Computer ID: 210695; location: home; project prefs: home 2006-12-19 19:10:40 [SETI@home] URL: http://setiathome.berkeley.edu/; Computer ID: 1909898; location: home; project prefs: home 2006-12-19 19:10:40 [rosetta@home] URL: https://boinc.bakerlab.org/rosetta/; Computer ID: 362206; location: ; project prefs: default ^^ ^^ 2006-12-19 19:10:40 [---] General prefs: from SETI@home (last modified 2006-11-28 23:26:41) 2006-12-19 19:10:40 [---] General prefs: using separate prefs for home 2006-12-19 19:10:40 [---] Local control only allowed 2006-12-19 19:10:40 [---] Listening on port 31416 ^^ here I missed the data "home", as you can see for Einstein@home and seti@home. Joachim Dinos are not dead. They are alive and well and living in data centers all around you. They speak in tongues and work strange magics with computers. Beware the dino! |
Chu Send message Joined: 23 Feb 06 Posts: 120 Credit: 112,439 RAC: 0 |
The graphic problem seems to happen randomly and you just caught one. This workunit failed when I wanted to view the graphics. I don't know why this happened. I am doing this action multiple times each day with different WU's. Never had a problem the last month. |
Chu Send message Joined: 23 Feb 06 Posts: 120 Credit: 112,439 RAC: 0 |
For those of you who have reported the problem of R@H not starting after being paused on linux client, did you see this problem just for 5.43 application? I am running Linux, with Boinc 5.4.9 and Rosetta 5.43. |
RebelRex Send message Joined: 15 Dec 06 Posts: 3 Credit: 109,567 RAC: 0 |
I have only ran 5.43... For those of you who have reported the problem of R@H not starting after being paused on linux client, did you see this problem just for 5.43 application? |
Jim Send message Joined: 15 Oct 06 Posts: 22 Credit: 5,410,546 RAC: 0 |
Chu No this problem has been happening on my Linux machines since around the 13th or 15th of November. At first I figured it was just my machines. I run two different "flavors" of Linux, Linspire and Mandriva. It has happened on both OS's but mainly on the Linspire machine. I have tried: 1- Turning OFF the "leave in memory" option when switching to another project. Didn't help 2- Extended the processing time from the default 60 min to a longer period. Didn't help 3- Reboot the computer. That works 4- Suspend and Resume task. Didn't help 5- Tonight I installed a newer version of Linux on machine 1. (Will watch and see what happens over night.) Machine 1: 1.3Ghz AMD processor (Linspire-Linux) 377 meg of ram (has only finished about 1/2 dozen Rosetta WUs in the last month) Machine 2: 475 Mhz AMD K6 processor (Mandriva 2007 - Linux) 256 meg of RAM (doesn't process once in a while, but when it doesn't it is after switching projects/tasks) As the other folks have noticed the project says it is RUNNING but CPU time, To Completion time, Progress percentage never changes. (The first time it was "hung" 5 days before I noticed it. Boinc 5.4.9 Rosetta 5.4.x on seems to be a problem Good luck, Jim |
Jim Send message Joined: 15 Oct 06 Posts: 22 Credit: 5,410,546 RAC: 0 |
Chu Concerning the problem with Linux machines not processing after switching tasks/projects. I upgraded my Linspire Linux (Debian 3.0 [2.6.10]) to Freespire Linux (2.6.14) At this point the system has completed 3 WUs without any problems. I am also running SETI and WCG (Genome Comparsion) on that machine. So it appears at this point (to me) that if a user has this problem they need to upgrade or change to a different Linux OS. I will post IF this problem reappears. Jim |
RebelRex Send message Joined: 15 Dec 06 Posts: 3 Credit: 109,567 RAC: 0 |
Swithching task/project works fine.. Its when bonic switches back to the "unfinished" WU is where mine hangs (says its running but its not)... Not sure its a Rosetta issue more than Boinc.. Chu |
Jim Send message Joined: 15 Oct 06 Posts: 22 Credit: 5,410,546 RAC: 0 |
RebelRex It appears I wasn't clear in describing the problem. We are talking the same problem with "unfinished" WU and switching task/projects. Upgrading my Linux DID solve the "unfinished" WU problem for me. It's now been over 24 hours of crunching with no problems.
|
EvilFluffy Send message Joined: 4 Oct 06 Posts: 1 Credit: 207,711 RAC: 0 |
Chu I tend to agree. A lot of posts here seem to be related to Linux users, but I'm running Windows XP on an Intel Pentium 4 D, and since the release of 5.4.1 I've been seeing large batches of WU with unrecoverable errors coming through: 2006-12-13 03:18:40 [---] Rescheduling CPU: files downloaded 2006-12-13 03:18:40 [rosetta@home] Starting task FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom003_2_S_00001_0005069_0.pdbIGNORE_THE_REST_1440_3733_0 using rosetta version 541 2006-12-13 03:19:08 [rosetta@home] Unrecoverable error for result FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom003_2_S_00001_0005069_0.pdbIGNORE_THE_REST_1440_3733_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 2006-12-13 03:19:08 [---] Rescheduling CPU: application exited 2006-12-13 03:19:08 [rosetta@home] Computation for task FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom003_2_S_00001_0005069_0.pdbIGNORE_THE_REST_1440_3733_0 finished 2006-12-13 03:20:43 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi 2006-12-13 03:20:43 [rosetta@home] Reason: To fetch work 2006-12-13 03:20:43 [rosetta@home] Requesting 8640 seconds of new work, and reporting 1 completed tasks 2006-12-13 03:20:48 [rosetta@home] Scheduler request succeeded 2006-12-13 03:20:50 [rosetta@home] Started download of file hom002_aat369_03_05.200_v1_3.gz 2006-12-13 03:20:50 [rosetta@home] Started download of file hom002_aat369_09_05.200_v1_3.gz 2006-12-13 03:22:05 [rosetta@home] Finished download of file hom002_aat369_03_05.200_v1_3.gz 2006-12-13 03:22:05 [rosetta@home] Throughput 25530 bytes/sec 2006-12-13 03:22:05 [rosetta@home] Started download of file hom002_t369_.fasta.gz 2006-12-13 03:22:08 [rosetta@home] Finished download of file hom002_t369_.fasta.gz 2006-12-13 03:22:08 [rosetta@home] Throughput 60 bytes/sec 2006-12-13 03:22:08 [rosetta@home] Started download of file hom002_t369.pdb.gz 2006-12-13 03:22:15 [rosetta@home] Finished download of file hom002_t369.pdb.gz 2006-12-13 03:22:15 [rosetta@home] Throughput 3904 bytes/sec 2006-12-13 03:22:15 [rosetta@home] Started download of file S_00001_0002876_0.pdb.gz 2006-12-13 03:22:23 [rosetta@home] Finished download of file S_00001_0002876_0.pdb.gz 2006-12-13 03:22:23 [rosetta@home] Throughput 5684 bytes/sec 2006-12-13 03:22:23 [rosetta@home] Started download of file t369_MultipleLoopRebuild_loopfile.gz 2006-12-13 03:22:26 [rosetta@home] Finished download of file t369_MultipleLoopRebuild_loopfile.gz 2006-12-13 03:22:26 [rosetta@home] Throughput 31 bytes/sec 2006-12-13 03:22:26 [rosetta@home] Started download of file t369_MultipleLoop_barcode.txt.gz 2006-12-13 03:22:29 [rosetta@home] Finished download of file t369_MultipleLoop_barcode.txt.gz 2006-12-13 03:22:29 [rosetta@home] Throughput 70 bytes/sec 2006-12-13 03:22:50 [rosetta@home] Finished download of file hom002_aat369_09_05.200_v1_3.gz 2006-12-13 03:22:50 [rosetta@home] Throughput 37448 bytes/sec 2006-12-13 03:22:51 [---] Rescheduling CPU: files downloaded 2006-12-13 03:22:51 [rosetta@home] Starting task FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom002_1_S_00001_0002876_0.pdbIGNORE_THE_REST_1436_4821_0 using rosetta version 541 2006-12-13 03:23:19 [rosetta@home] Unrecoverable error for result FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom002_1_S_00001_0002876_0.pdbIGNORE_THE_REST_1436_4821_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 2006-12-13 03:23:19 [---] Rescheduling CPU: application exited 2006-12-13 03:23:19 [rosetta@home] Computation for task FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom002_1_S_00001_0002876_0.pdbIGNORE_THE_REST_1436_4821_0 finished 2006-12-13 03:24:53 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi 2006-12-13 03:24:53 [rosetta@home] Reason: To fetch work 2006-12-13 03:24:53 [rosetta@home] Requesting 8640 seconds of new work, and reporting 1 completed tasks 2006-12-13 03:24:58 [rosetta@home] Scheduler request succeeded 2006-12-13 03:25:00 [rosetta@home] Started download of file S_00001_0000793_0.pdb.gz 2006-12-13 03:25:04 [rosetta@home] Finished download of file S_00001_0000793_0.pdb.gz 2006-12-13 03:25:04 [rosetta@home] Throughput 21433 bytes/sec 2006-12-13 03:25:05 [---] Rescheduling CPU: files downloaded 2006-12-13 03:25:05 [rosetta@home] Starting task FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom002_1_S_00001_0000793_0.pdbIGNORE_THE_REST_1436_4826_0 using rosetta version 541 2006-12-13 03:25:33 [rosetta@home] Unrecoverable error for result FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom002_1_S_00001_0000793_0.pdbIGNORE_THE_REST_1436_4826_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 2006-12-13 03:25:33 [---] Rescheduling CPU: application exited 2006-12-13 03:25:33 [rosetta@home] Computation for task FRA_t369_test_LARS_constraints_newfrags_barcode_enforced_hom002_1_S_00001_0000793_0.pdbIGNORE_THE_REST_1436_4826_0 finished 2006-12-13 03:29:04 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi 2006-12-13 03:29:04 [rosetta@home] Reason: To fetch work 2006-12-13 03:29:04 [rosetta@home] Requesting 8640 seconds of new work, and reporting 1 completed tasks 2006-12-13 03:29:09 [rosetta@home] Scheduler request succeeded 2006-12-13 03:29:11 [rosetta@home] Started download of file 1klfP.disulf 2006-12-13 03:29:11 [rosetta@home] Started download of file aa1klfP09_05.200_v1_3.gz 2006-12-13 03:29:12 [rosetta@home] Finished download of file 1klfP.disulf 2006-12-13 03:29:12 [rosetta@home] Throughput 9 bytes/sec 2006-12-13 03:29:12 [rosetta@home] Started download of file 1klfP.pdb.gz 2006-12-13 03:29:16 [rosetta@home] Finished download of file 1klfP.pdb.gz 2006-12-13 03:29:16 [rosetta@home] Throughput 14019 bytes/sec 2006-12-13 03:29:16 [rosetta@home] Started download of file 1klfP.fasta 2006-12-13 03:29:18 [rosetta@home] Finished download of file 1klfP.fasta 2006-12-13 03:29:18 [rosetta@home] Throughput 111 bytes/sec 2006-12-13 03:29:18 [rosetta@home] Started download of file aa1klfP03_05.200_v1_3.gz 2006-12-13 03:30:39 [rosetta@home] Finished download of file aa1klfP03_05.200_v1_3.gz 2006-12-13 03:30:39 [rosetta@home] Throughput 25314 bytes/sec 2006-12-13 03:30:39 [rosetta@home] Started download of file 1klfP.loop 2006-12-13 03:30:42 [rosetta@home] Finished download of file 1klfP.loop 2006-12-13 03:30:42 [rosetta@home] Throughput 17 bytes/sec 2006-12-13 03:31:19 [rosetta@home] Finished download of file aa1klfP09_05.200_v1_3.gz 2006-12-13 03:31:19 [rosetta@home] Throughput 37853 bytes/sec 2006-12-13 03:31:20 [---] Rescheduling CPU: files downloaded 2006-12-13 03:31:20 [rosetta@home] Starting task BAK_1klf_FimH_loop_model_1442_1960_0 using rosetta version 541 2006-12-13 03:31:49 [rosetta@home] Unrecoverable error for result BAK_1klf_FimH_loop_model_1442_1960_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 2006-12-13 03:31:49 [---] Rescheduling CPU: application exited I'd get so many that I'd hit the daily quote without ever having done any work: 2006-12-13 06:57:33 [rosetta@home] Message from server: No work sent 2006-12-13 06:57:33 [rosetta@home] Message from server: (reached daily quota of 66 results) 5.4.3 appears to have resolved my problem (I've successfully returned 4 results in 4 days). I've generally had the screensaver on, but it's usually only activated when I have the monitor turned off anyway (ie. when I'm not home). Thought I would contribute this and let you know it seems to have fixed some (if not all) the problems in 5.4.1 |
Chu Send message Joined: 23 Feb 06 Posts: 120 Credit: 112,439 RAC: 0 |
Hi, if you keep getting errors like "Incorrect functions(0x1)", it is mostly likely that the rosetta_database files on your computer somehow get corrupted. Then you need to force to retrieve a fresh set of those files by maybe resetting the project ( anyone corrects me if it is not the best procedure to do!).
|
Chu Send message Joined: 23 Feb 06 Posts: 120 Credit: 112,439 RAC: 0 |
Thanks for the effort. Glad to know it works out for you. Chu |
hedera Send message Joined: 15 Jul 06 Posts: 76 Credit: 5,263,150 RAC: 59 |
For some reason - maybe I did something funny with my preferences - I now only run 1 WU at a time. I used to have 2 cranking along at once, all the time. Did Rosetta change or have I set something weird? Here are my preferences: Processor usage Do work while computer is running on batteries? (matters only for portable computers) no Do work while computer is in use? yes Do work only between the hours of (no restriction) Leave applications in memory while suspended? (suspended applications will consume swap space if 'yes') yes Switch between applications every (recommended: 60 minutes) 60 minutes On multiprocessors, use at most 1 processors Use at most 100 percent of CPU time Disk and memory usage Use no more than 100 GB disk space Leave at least (Values smaller than 0.001 are ignored) 0.001 GB disk space free Use no more than 50% of total disk space Write to disk at most every 60 seconds Use no more than 75% of total virtual memory Network usage Connect to network about every (determines size of work cache; maximum 10 days) 0.1 days Confirm before connecting to Internet? (matters only if you have a modem, ISDN or VPN connection) no Disconnect when done? (matters only if you have a modem, ISDN or VPN connection) no Maximum download rate: no limit Maximum upload rate: no limit Use network only between the hours of Enforced by versions 4.46 and greater (no restriction) Skip image file verification? Check this ONLY if your Internet provider modifies image files (UMTS does this, for example). Skipping verification reduces the security of BOINC. no --hedera Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic. |
genes Send message Joined: 8 Oct 05 Posts: 60 Credit: 704,566 RAC: 48 |
On multiprocessors, use at most 1 processors This line would appear to be the culprit. Boinc will run one WU per processor on a multiprocessor/hyperthreaded machine, up to the limit that you set in this preference. Since you set it to one, you will only run one WU at a time from any project. |
hedera Send message Joined: 15 Jul 06 Posts: 76 Credit: 5,263,150 RAC: 59 |
What is the default on the number of processors? I think it was set to zero before I messed with it. Does zero mean "however many you can" ? --hedera Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic. |
genes Send message Joined: 8 Oct 05 Posts: 60 Credit: 704,566 RAC: 48 |
I don't know what the default is. Perhaps it is the number of processors you have (the Boinc client tests your machine when you install it). I think you would have to create a new user account and then look at the setting to really know what the default is. Once you've changed it, however, your preference will be used for all projects you attach to, assuming that you use the same email/password for each. I tried setting it to zero for my "school" preferences, down from 2, but it went to 1 instead. So I guess zero is not a valid choice. If you have several machines, with differing number of processors on each, it doesn't hurt to specify the largest number of processors you might want to use. This number is an "at most", not an "at least", so if you only have 2 processors, setting it to 4 won't cause a problem. |
hedera Send message Joined: 15 Jul 06 Posts: 76 Credit: 5,263,150 RAC: 59 |
Thanks, genes; I've reset to 2 CPU, it's still only running 1 WU but it hasn't done a request for work in a while, it just stopped and ran benchmarks, then went back to the single task it was working. Maybe in a while I'll raise it to 4, if it doesn't get another task at the next request for work. --hedera Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic. |
genes Send message Joined: 8 Oct 05 Posts: 60 Credit: 704,566 RAC: 48 |
Thanks, genes; I've reset to 2 CPU, it's still only running 1 WU but it hasn't done a request for work in a while, it just stopped and ran benchmarks, then went back to the single task it was working. Maybe in a while I'll raise it to 4, if it doesn't get another task at the next request for work. You can click the "projects" tab on the manager, select Rosetta (although it looks like that's all you're running), and click the "update" command. Boinc will then contact the Rosetta website and get your updated info. (It doesn't know you've changed the preferences until you "update".) It will(should) then get more work. |
Message boards :
Number crunching :
Problems with Rosetta version 5.43
©2024 University of Washington
https://www.bakerlab.org