Message boards : Number crunching : Discussion on increasing the default run time
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next
Author | Message |
---|---|
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
Do you think i should upgrade RAM to 32 because of higher usage? have two Ryzen 5 3600 12 Core with 16GB each. have always been enough so far.In the past (all of 3 weeks), the most RAM i have seen a single Task require has been 1.3GB. If you were to get nothing but those tasks then that system would be short of RAM for the system to function and all Tasks to be processed. There are plans to release some Tasks that may require as much as 4GB of RAM. So more RAM would allow you to run more Tasks -even those requiring huge amounts of RAM- at a given time. Otherwise cores will sit unused until there is enough RAM to start processing other Tasks again. If you can afford it, it certainly won't go to waste. Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
Presently processing rb_04_16_21806_21365_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_03_09_918009_94_1 So far, 10hrs 11min Runtime, 10hrs 04min 36sec CPU Time and still no checkpoint. Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
Presently processing rb_04_16_21806_21365_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_03_09_918009_94_1Finished. 12hr 13min 42 Sec Runtime. No checkpoints made. Grant Darwin NT |
xii5ku Send message Joined: 29 Nov 16 Posts: 22 Credit: 13,815,783 RAC: 105 |
On April 16 Mod.Sense wrote:
The abnormal results came from the Windows x86-32 application version. Could this have the same or similar bug as the Linux x86-32 v4.12 - v4.15 application (on x86-64 hosts at least)? On April 19 Grant (SSSF) wrote: Presently processing rb_04_16_21806_21365_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_03_09_918009_94_1Finished. This task on the other hand was run by the x86-64 application version; i.e. this one was not related to the specific problem of the Linux i686 application builds. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,244,751 RAC: 9,615 |
i think that might have caused my "problems". 2 days will definitely help you. 6 hours won't help or hinder you. 1.5 days and default 8hrs is more productive as the project has indicated a 2 day total turnaround of tasks is ideal (within the 3-day deadlines) On RAM, the way things are going, it may become essential before long. Start saving up |
Clive Send message Joined: 15 Jun 19 Posts: 4 Credit: 844,412 RAC: 0 |
Hi all: When I learned that I could assist in understanding COVID-19, I decided to pitch in. The hardware I bring with me is: 1. CPU - i7-8700K water cooled 2. GPU - NVIDIA Geoforce 1070 3. RAM - 16 GB 4. Fully patched Win 10 64 bit When I initially d/l the workunits, the estimated completion time was 6 hours, 40 minutes. Quite reasonable I thought. Today I have some workunits hitting 30 hours before they are forecast to be completed. Why is BIONIC so far off in the estimated completed times? Is it in my settings? Clive Hunt British Columbia, Canada [/list][/list] |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
3. RAM - 16 GBWon't be enough RAM if you are using all cores & threads. At present a Task can take up to 1.5GB of RAM, which would use more than the system has if you had nothing but them (extremely unlikely, but still...), and if you have allowed BOINC to use 100% of it. And there are set to be Tasks that may use up to 4GB of RAM coming out in the near future. I'd either take this as an excuse to upgrade to 32GB of RAM, or limit the number of cores/threads in use to 8 or less. That way your system can cope with several 1.3GB Tasks at a time, or one 4GB Task, and lots of other more regular RAM requirement Tasks (250MB to 850MB). I'd also allow BOINC to use more of the available RAM. In your Account page, Computing preferences. Memory When computer is in use, use at most 95 % When computer is not in use, use at most 95 % When I initially d/l the workunits, the estimated completion time was 6 hours, 40 minutes. Quite reasonable I thought. Today I have some workunits hitting 30 hours before they are forecast to be completed.All projects have the issue of Estimated completion time being out for a new Application or Task, as there is no history for the time it takes a system to process it. In the case of Rosetta, the run times of a Task are set in your preferences, but it still takes the Estimated completion times a while to settle down. And where in some projects the initial Estimated completion times are greater than the actual time, with Rosetta they tend to be less. And while the Target CPU time is set (the default is 8 hours), some Tasks do require more processing than others to give usable results. So there is a 10 hour extension for Tasks that don't end by their Target CPU Runtime after which the Tasks will be ended if it doesn't finish sooner. Given the short deadlines of most Rosetta Tasks, and the variability in processing time (even though there is a set Target time), a small cache is best. If you run more than one project, a very, very small cache is best. For Rosetta only, Other Store at least 1 days of work Store up to an additional 0.02 days of workis working pretty well for me. No missed deadlines yet, even with a few longer (way, way longer) than Target CPU Runtime tasks. Grant Darwin NT |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Clive, welcome aboard! Can you link to the WUs that are taking so long? Also, be sure to look at the properties of the task and see the CPU time there, rather than going by elapsed time shown in BOINC Manager. Rosetta Moderator: Mod.Sense |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,244,751 RAC: 9,615 |
When I learned that I could assist in understanding COVID-19, I decided to pitch in. The hardware I bring with me is: It is in your settings. Looking at this task of yours, go to Options/Computing Preferences and untick all the options under the Computing tab in the section "When to suspend" However you have them set at the moment is causing this: Run time 1 days 4 hours 16 min 30 sec CPU time 7 hours 51 min 19 sec By all means make the adjustments Grant has suggested too, but it actually looks like everything else you have set isn't causing a problem imo It does look like Boinc downloaded too many tasks for you on the assumption you'd completed them more quickly, but they'll be cancelled automatically if they pass deadline and the more tasks you complete before then, the more appropriate number will be called for next time. No need to intervene |
Clive Send message Joined: 15 Jun 19 Posts: 4 Credit: 844,412 RAC: 0 |
Thank you Sid, I have made yours and Grant's recommended changes to my BONIC settings. Hopefully these changes will speed things up. Clive Hunt British Columbia Canada |
Clive Send message Joined: 15 Jun 19 Posts: 4 Credit: 844,412 RAC: 0 |
Thank you Grant, I have made yours and Sid's recommended changes to my BONIC settings. Hopefully these changes will speed things up. Clive Hunt British Columbia Canada |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
Looking at this task of yours, go to Options/Computing Preferences and untick all the options under the Computing tab in the section "When to suspend"In addition to those changes, Suspend when non-BOINC CPU usage is above --- %is best left blank. Rosetta runs at Idle priority level, so pretty much everything else will run before Rosetta does anyway. No need to specifically suspend Rosetta. For a lightly used system, the difference between Run time & CPU time should only be 4min or so for a 8hr CPU time Task. A heavily used system will have a bigger discrepancy. A dedicated cruncher, less than 30sec. Grant Darwin NT |
MeeeK Send message Joined: 7 Feb 16 Posts: 31 Credit: 19,737,304 RAC: 0 |
https://boinc.bakerlab.org/rosetta/result.php?resultid=1156119366 I found a lot of these in my list. All with same error. And only at this client with upgraded RAM two days ago. The other client dont have this problems. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
https://boinc.bakerlab.org/rosetta/result.php?resultid=1156119366If you can, swap the RAM between systems, and see if the errors change systems as well. Grant Darwin NT |
MeeeK Send message Joined: 7 Feb 16 Posts: 31 Credit: 19,737,304 RAC: 0 |
Thats not possible. This system have 4 slots the other one just 2. But the "new additional" ram in this "faulty" system have been in the secound system before. They are both 3200 kits with same timings. Both kits worked fine. I will check ram settings in bios. But there aren't any issues for sure. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
Thats not possible.? So the system with 4 slots only gets 2 modules, the system with 2 slots can only have 2 modules. But the "new additional" ram in this "faulty" system have been in the secound system before. They are both 3200 kits with same timings. Both kits worked fine.If it is the only change you made, then it's very likely they are the cause of you problems & they don't work fine now. Are they the same brand & size as well as speed & timings? And even so, what works in one system may not work in another. Grant Darwin NT |
hnapel Send message Joined: 8 Apr 20 Posts: 8 Credit: 835,346 RAC: 0 |
Great I'm replying to a 12 year old post! But I don't see my question answered anywhere: If I increase the runtime from the (now) default 8 hours to for example 12 to help in reducing the load on the servers will the larger runtime also have an increased requirement on RAM usage? Will the workunits received be actually different or does it just try more science on the same units? |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
No, a 12hr runtime preference just runs along with the same memory footprint for a longer period of time. The workunit is the same, your system just computes more models from it until it gets near the runtime preference. Rosetta Moderator: Mod.Sense |
KWSN Ekky Ekky Ekky Send message Joined: 3 Apr 20 Posts: 9 Credit: 5,062,511 RAC: 0 |
The runtime limitations make things more than silly, to my very limited mind. Running Seti@home presented no such problems: if it took longer than expected, it was fine, but such short termism is driving me nuts. I may well join the great exodus if things are not managed better for us humble crunchers. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1683 Credit: 17,915,997 RAC: 22,688 |
The runtime limitations make things more than silly, to my very limited mind. Running Seti@home presented no such problems: if it took longer than expected, it was fine, but such short termism is driving me nuts. I may well join the great exodus if things are not managed better for us humble crunchers.Looking at you results the problem isn't the Target CPU time, but a large cache setting and a system that is busy doing things other than processing BOINC work. Hence the huge difference between CPU time & Runtime on your system- your main system is taking 24 hours to process a Task with an 8 hour Target CPU time. For a 8 hour Target CPU time the difference between CPU time & Runtime on a lightly used system is about 4 minutes. Set you cache to Other Store at least 0.5 days of work Store up to an additional 0.02 days of workand that should stop you from missing deadlines. If you choose to determine what is using your system so heavily, that will also help get more work done by not taking 3 times longer to do it than than actual Target time. Recent changes to how work is sent out will also help stop systems from getting more work than they can handle when new applications are released in the future, or new systems are added to do Rosetta work. Grant Darwin NT |
Message boards :
Number crunching :
Discussion on increasing the default run time
©2024 University of Washington
https://www.bakerlab.org