Message boards : Number crunching : Minirosetta 3.73-3.78
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 14 · Next
Author | Message |
---|---|
Sid Celery Send message Joined: 11 Feb 08 Posts: 2121 Credit: 41,179,074 RAC: 11,480 |
Hi Sid, That would kind of explain why the task runs for a reasonable while before crashing out, and I've seen occasional tasks using 1.2Gb, but I'm running with just short of 10Gb free of 16Gb total. I set Boinc to run 60% of memory when the computer is in use (90% when not in use). Do people routinely allocate more than that? What can I safely adjust that setting to, or is it just trial and error? |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 1,429 |
Hi Sid, I've found that 64-bit Windows Vista is rather inefficient at handling memory for running 32-bit applications, so I set that computer to use 30% to 40% of the memory for BOINC out of 8 GB. 64-bit Windows 7 and Windows 10 are more efficient, so I set that computer to use 70% out of 16 GB. 64-bit BOINC is not very good at giving up memory when the computer is in use, so these settings are the same for when the computer is in use as when not in use. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2121 Credit: 41,179,074 RAC: 11,480 |
Hi Sid, Useful, thanks. I'll tweak my Min 60% Max 90% to 65% & 85% on both my Win7 machines and see how it goes for now |
Andy_Taximan Send message Joined: 20 Jan 14 Posts: 1 Credit: 736,798 RAC: 0 |
Not much of a problem but 3 hours to download minirosetta_database_d0bf94b.zip really is a pain ! lol and no its not my internet speed |
David Fickes Send message Joined: 12 Jul 15 Posts: 1 Credit: 1,113,855 RAC: 0 |
Just been having communications problem with the rosetta@home servers since moving to El Capitan. I had to update the BOINC software but other projects are still running the log follows:: Sat Jun 18 22:38:32 2016 | rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU and Intel GPU Sat Jun 18 22:39:03 2016 | | Project communication failed: attempting access to reference site Sat Jun 18 22:39:03 2016 | rosetta@home | Scheduler request failed: Server returned nothing (no headers, no data) Sat Jun 18 22:39:04 2016 | | Internet access OK - project servers may be temporarily down. Sat Jun 18 22:40:08 2016 | World Community Grid | Sending scheduler request: To fetch work. Sat Jun 18 22:40:08 2016 | World Community Grid | Requesting new tasks for CPU and Intel GPU Sat Jun 18 22:40:10 2016 | World Community Grid | Scheduler request completed: got 2 new tasks Sat Jun 18 22:40:12 2016 | World Community Grid | Started download of fahb.FAH2_avx40811-ls_000076-in1.dms Sat Jun 18 22:40:12 2016 | World Community Grid | Started download of fahb.FAH2_avx40811-ls_000076-in2.dms |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2121 Credit: 41,179,074 RAC: 11,480 |
Not sure what's happening with this task atm 000096_C5_0052_0004_fragments_relax_SAVE_ALL_OUT_402757_2_1 CPU time at last checkpoint 07:09:30 CPU time 07:26:30 Elapsed time 07:59:48 62.135% complete (of 8 hour runtime - lagging behind what it should be) Only at Model 1 Step 10 Getting full CPU time according to Task Manager - heading for the watchdog at that rate It looks very complicated when I show graphics. Is all well with it? |
anarchic teapot Send message Joined: 25 Mar 06 Posts: 2 Credit: 509,115 RAC: 262 |
Rosetta Mini 3.73 is running well past the time it's supposed to take on my computer. One task has been running for over 2 days, is shown as being less than 50% done, but the remaining estimated time is blank. From my logs, I see I've already had trouble with a different Rosetta module this morning: it ended with an error message 05/08/2016 11:12:31 | rosetta@home | Aborting task fEbH1149_fold_SAVE_ALL_OUT_402410_390_0; not started and deadline has passed There's also this on my account: 851723397 769539264 22 Jul 2016 9:12:30 UTC 5 Aug 2016 9:12:30 UTC Over No reply New 0.00 --- --- 851723358 769539231 22 Jul 2016 9:12:30 UTC 5 Aug 2016 9:13:03 UTC Over Client error Aborted by user 0.00 0.00 --- 851723337 769539210 22 Jul 2016 9:12:30 UTC 5 Aug 2016 9:13:03 UTC Over Client error Aborted by user 0.00 0.00 --- 851723269 769539146 22 Jul 2016 9:12:30 UTC 5 Aug 2016 9:12:30 UTC Over No reply New 0.00 --- --- 851718775 769535427 22 Jul 2016 8:58:34 UTC 5 Aug 2016 8:58:34 UTC Over No reply New 0.00 --- --- No, I haven't (yet) aborted any tasks, so I don't know why that message appears. It does look as if Mini 3.73 tasks are overrunning to the extent of being rejected by the server. I'm going to terminate all 8 Mini 3.73 tasks currently in my queue & turn Rosetta off for a bit, to give the devs time to fix the problem. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,551,716 RAC: 6,403 |
|
nanoprobe Send message Joined: 5 Apr 09 Posts: 8 Credit: 381,804 RAC: 0 |
I installed Android 5.1.1 on a Pine64 device and attached to Rosetta. I received 1 task which completed and validated. I'm not receiving any more tasks and the event logs says "Minirosetta is not available for your type of computer" every time I try to update. What's up with that? |
nanoprobe Send message Joined: 5 Apr 09 Posts: 8 Credit: 381,804 RAC: 0 |
Looking again there was an upload error. <message> upload failure: <file_xfer_error> <file_name>db_pred12_7mer_android_7res_t1c.2.86_0001_SAVE_ALL_OUT_344206_6803_3_0</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,551,716 RAC: 6,403 |
880278687 Unhandled Exception Detected... |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,551,716 RAC: 6,403 |
|
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,551,716 RAC: 6,403 |
|
Jesse Viviano Send message Joined: 14 Jan 10 Posts: 42 Credit: 2,700,472 RAC: 0 |
|
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,551,716 RAC: 6,403 |
Task 920456670 after 10 minutes: 1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
FurryGuy Send message Joined: 16 May 11 Posts: 2 Credit: 3,684,958 RAC: 0 |
Mini Rosetta 3.73 uploads are being rejected by the upload server: 9/5/2017 4:02:40 PM | Rosetta@home | Started upload of rb_09_04_77181_119995__t000__ab_robetta_IGNORE_THE_REST_514917_880_0_r297470578_0 |
kcirza Send message Joined: 30 May 10 Posts: 3 Credit: 12,413,356 RAC: 37 |
Ditto here, and most all day today, on all machines running the project. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 1,429 |
I've just had four 3,78 compute errors. https://boinc.bakerlab.org/result.php?resultid=947115613 https://boinc.bakerlab.org/result.php?resultid=947116461 https://boinc.bakerlab.org/result.php?resultid=947115957 https://boinc.bakerlab.org/result.php?resultid=947119040 All look likely to be due to a missing input file with a name ending in .9mers . |
bcavnaugh Send message Joined: 7 Dec 13 Posts: 7 Credit: 2,389,640 RAC: 0 |
Seems all the RDKIBLER-2layer_2+1 I got today failed under Windows OS Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz [Family 6 Model 79 Stepping 1] Microsoft Windows Server 2012 R2 Standard x64 Edition, (06.03.9600.00) https://boinc.bakerlab.org/results.php?hostid=3112116&offset=0&show_names=0&state=6&appid https://boinc.bakerlab.org/workunit.php?wuid=854588205 https://boinc.bakerlab.org/show_host_detail.php?hostid=3112116 Crunching@EVGA The Number One Team in the BOINC Community. Folding@EVGA The Number One Team in the Folding@Home Community. |
Mad_Max Send message Joined: 31 Dec 09 Posts: 209 Credit: 25,881,143 RAC: 10,303 |
Sometimes minirosetta lose calculated result somehow at task restarts (eg. computer or boinc reboot or just switch to another project if few are running on same CPU). I am not talking about checkpoints in the middle of model calculation but of entire models which was already successfully calculated but did not reported to the server. Here example: https://boinc.bakerlab.org/result.php?resultid=948310700 ====================================================== But after task restart (NOT crash/hang, just normal correct restart when taks unloaded from memory and loaded back from disk later ) only one last model (decoy) reported to server. ====================================================== All previous 133 calculated decoys lost. This is not happens often. But i see such task from time to time (may be 1-2 per week). Best way to track(search) for such task is to query databese for VALID task but with abnormal low credit compared to used CPU time - because credit calculated in proportion to number of decoys reported. And if many decoys were lost - granted CR will be abnormal low. |
Message boards :
Number crunching :
Minirosetta 3.73-3.78
©2024 University of Washington
https://www.bakerlab.org