Rosetta stops the use of my 2nd GPU

Message boards : Number crunching : Rosetta stops the use of my 2nd GPU

To post messages, you must log in.

AuthorMessage
thimios

Send message
Joined: 23 Feb 20
Posts: 2
Credit: 10,685,941
RAC: 134
Message 106122 - Posted: 29 Apr 2022, 7:56:01 UTC

Hi all,

I have 2 GPUs in my pc, a 3070 which also drives the display, and a 3080.
Whenever rosetta is not running, all is fine, but whenever rosetta runs, only the 3080 is being used
by other GPU boinc projects.

Any ideas what how this is caused and how to fix it?


Thanks!
ID: 106122 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1675
Credit: 17,712,196
RAC: 20,695
Message 106123 - Posted: 29 Apr 2022, 8:43:08 UTC

With your systems hidden only a Wild Arse Guess is possible.

Most likely you need to make use of app_config files for each of your GPU projects to reserve a CPU core/thread to support each running GPU application, or just limit the number of Cores/Threads Rosetta can use so there are spare threads to support all running GPU applications.
Grant
Darwin NT
ID: 106123 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
thimios

Send message
Joined: 23 Feb 20
Posts: 2
Credit: 10,685,941
RAC: 134
Message 106124 - Posted: 29 Apr 2022, 8:50:20 UTC

Never mind. I fixed it.

I set the rosetta app_config.xml to run less instances than the available cpus.

Curious that this only happens with rosetta though.
ID: 106124 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 209
Credit: 25,867,393
RAC: 9,315
Message 106190 - Posted: 14 May 2022, 5:42:42 UTC

It probable due to combination of large work cache in your BOINC setting + short deadlines for R@H WUs (R@H server side settings).
It can drive BOINC into "panic" (high priority mode) - it allocate all CPU resources to a single project in try to avoid missing deadline.
Including CPU core(s) allocated for GPU support.

Its a stupid design decision(because it greatly decrease total performance by cut down of all GPU computations) but it have been this way in BOINC scheduler for years.

Possible workarounds for this
1 - reduce cache size setting or/and abort excessive WUs from a "hoarding" project in BOINC WUs queue.
2 - use of app_config.xml to limit max number of running instances for one project
ID: 106190 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Rosetta stops the use of my 2nd GPU



©2024 University of Washington
https://www.bakerlab.org