Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 35 · 36 · 37 · 38 · 39 · 40 · 41 . . . 309 · Next

AuthorMessage
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 93090 - Posted: 2 Apr 2020, 17:12:16 UTC - in response to Message 93087.  

System has 6 Gb configured (running inside VM).
Just checked settings, it has:

When in use, use at most 50%
When not in use, use at most 90%

Should have been plenty start at least one task.

That means you have only 3 GB available. If you have "leave applications in memory" enabled, any suspended task will be taking up memory too.
It is a memory problem.
ID: 93090 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BetelgeuseFive

Send message
Joined: 10 Aug 10
Posts: 4
Credit: 1,443,980
RAC: 382
Message 93096 - Posted: 2 Apr 2020, 17:58:38 UTC - in response to Message 93090.  

System has 6 Gb configured (running inside VM).
Just checked settings, it has:

When in use, use at most 50%
When not in use, use at most 90%

Should have been plenty start at least one task.

That means you have only 3 GB available. If you have "leave applications in memory" enabled, any suspended task will be taking up memory too.
It is a memory problem.


There were no other (suspended) tasks active and it didn't want to start even a single new v4.12 task while the system had been running v4.08 tasks for several days without any problems. I changed memory settings so it can always use 90% and enabled the v4.12 tasks again so I will find out if it helps.
Did anything change in v4.12 that will cause tasks to not even start ?

Thanks for your feedback, it is appreciated.

Tom
ID: 93096 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
vowelmarauder

Send message
Joined: 22 Mar 20
Posts: 2
Credit: 2,114,237
RAC: 0
Message 93105 - Posted: 2 Apr 2020, 19:51:31 UTC - in response to Message 92985.  
Last modified: 2 Apr 2020, 20:30:31 UTC

I just noticed that my tasks are taking almost twice as long as the ETA says. The time is either standing still with 1-2 seconds either way or counting *up*... I don't think I've tinkered with any settings and boinc is using all its cores fully. Is this normal? What's going on?

https://i.imgur.com/3uwyfAU.jpg

Sure enough all the new tasks are running like this as well (~16 hours) and I saw others report the same?

they're all "conducting_fiber_XXXX_fold_and_dock_XXX"

As suggested above, is this a different batch and nothing to worry about?

edit: thank you for the explanation 🙏🏻
I will reply only here so others can see your post
https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=93107#93107
ID: 93105 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JoshuaScholar

Send message
Joined: 26 Mar 20
Posts: 18
Credit: 232,183
RAC: 0
Message 93106 - Posted: 2 Apr 2020, 20:03:30 UTC
Last modified: 2 Apr 2020, 20:06:51 UTC

Bitdefender thinks that rosetta_4.12_windows_intelx86.exe "exhibits ransomeware behavior"
I thinks that it encrypted
boinc_checkpoint_count.txt
boinc_init_count.txt
chk_S_00000023_ClassicAbinito_stage4_kk_1.rng.state.gz
[a bunch similar like it]
I'm guessing that rng means random number generator and that it reinitialized a bunch of random number files, the program detected the maximum entropy and assumed that the files are encrypted.

I can make that program an exception, but I don't know what's ruined because the damn program restored some of the files to their previous state.
ID: 93106 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 93107 - Posted: 2 Apr 2020, 20:14:44 UTC - in response to Message 93105.  

ID: 93107 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 5144
Credit: 0
RAC: 0
Message 93109 - Posted: 2 Apr 2020, 21:13:49 UTC - in response to Message 93107.  

Sorry, we changed that back to 8 hours
ID: 93109 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,382,444
RAC: 19,446
Message 93163 - Posted: 3 Apr 2020, 6:13:57 UTC - in response to Message 93059.  

Hi especially @Grant (SSSF)

Where I am wrong?
I need 2x more time to finish the tasks and 50% GFLOPS on similar i7-8700K CPU

Compare:
- https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=3933928
- https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=3914491

Thanks in advance.

Could be the Tasks in question?
On my system all Tasks are running to the Target time (other than the odd one that bails out early), and apart from a glitch with some Tasks a few days back that paid out bugger all Credit (and the few early exits), Credit has generally been inline with Runtime.
Grant
Darwin NT
ID: 93163 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JoshuaScholar

Send message
Joined: 26 Mar 20
Posts: 18
Credit: 232,183
RAC: 0
Message 93167 - Posted: 3 Apr 2020, 6:42:53 UTC - in response to Message 93106.  

What do I do to clean my system since the damn antivirus program "restored" some of rosetta's files to a previous state, assuming that Rosetta 4.12 is a ransomware program?

I tried aborting the WU's currently being calculated but one finished.
ID: 93167 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stephen "Heretic"

Send message
Joined: 2 Apr 20
Posts: 21
Credit: 11,028
RAC: 0
Message 93169 - Posted: 3 Apr 2020, 6:50:56 UTC
Last modified: 3 Apr 2020, 6:57:07 UTC

. . OK, I am totally new to this project. I started cautiously giving it one core of my i5-6400 with the other 3 cores idle as backup and support for E@H on the GPU. One task ran and was looking good, pretty much on target (8 hours) after 6 hours runtime with CPU utilisation remaining under 50% on all 4 cores. To try and improve CPU usage I increased it to 2 cores but it remained at one task running. I then increased commitment to 3 cores and it started a 2nd task, but soon crashed BOINC requiring me to go to task manager to kill all Rosetta functions and E@H before I could get BOINC to launch again. I reduced CPU commitment back to 1 core and left it running, but upon returning to this machine about 8 hours later it had crashed the boinc-client several times and despite trying to kill off still active app components I could not get BOINC to restart, so I had to reboot the machine. I suspended the idle Rosetta tasks but now the one running task has gone to 'waiting to run". This machine has 8GB RAM. If I cannot get Rosetta to play nice with E@H it may have to go.

. . I increased CPU commitment back to 2 cores and the stalled task has resumed, but I am now waiting for the other shoe to drop. Will it crash BOINC yet again?

Stephen

? ?
ID: 93169 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,382,444
RAC: 19,446
Message 93171 - Posted: 3 Apr 2020, 7:03:45 UTC - in response to Message 93169.  
Last modified: 3 Apr 2020, 7:05:11 UTC

. . I increased CPU commitment back to 2 cores and the stalled task has resumed, but I am now waiting for the other shoe to drop. Will it crash BOINC yet again?
Settings that are working for me (keep in mind 6c/12t), 32GB of RAM.

Other
     Store at least	        1    days of work
     Store up to an additional  0.02 days of work

Disk
     Use no more than  12 GB
     Leave at least    2 GB free
     Use no more than  40% of total

Memory
     When computer is in use, use at most          95 %
     When computer is not in use, use at most      95 %
     Leave non-GPU tasks in memory while suspended (not selected)
     Page/swap file: use at most                   75 %


Running more than one project i'd suggest "Store at least x days of work" to be 0.5 or less.
Grant
Darwin NT
ID: 93171 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MarkJ

Send message
Joined: 28 Mar 20
Posts: 72
Credit: 25,238,680
RAC: 0
Message 93197 - Posted: 3 Apr 2020, 9:33:28 UTC - in response to Message 93106.  

Bitdefender thinks that rosetta_4.12_windows_intelx86.exe "exhibits ransomware behavior"

<snip>

I can make that program an exception, but I don't know what's ruined because the damn program restored some of the files to their previous state.

Set Rosetta to No New Tasks in BOINC. Make the BOINC folders and program an exception in Bitdefender. On the projects tab in BOINC reset the project and then set it to Allow New Tasks. What that will do is clean out the project folder and download the apps again. It will get rid of any running task (if you have any) .
BOINC blog
ID: 93197 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
RT

Send message
Joined: 14 Mar 20
Posts: 6
Credit: 1,155,031
RAC: 0
Message 93198 - Posted: 3 Apr 2020, 9:50:55 UTC

For some reason since v4.12 was released, one of my machines has failed computation on every Rosetta task, my other machines seem to be fine at the moment but the following host:
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=3849302 seems to fail every task it gets after 2-3 seconds of computation.
ID: 93198 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JoshuaScholar

Send message
Joined: 26 Mar 20
Posts: 18
Credit: 232,183
RAC: 0
Message 93200 - Posted: 3 Apr 2020, 9:55:28 UTC - in response to Message 93197.  

Sadly exceptions for ransomware are by program, not by folder. It seems my choices are:
1) turn off ransomware protection altogether
or
2) except Rosetta_4.12_windows_intelx86.exe and know that I'm going to go through the same sh_tshow next time you update the client.
ID: 93200 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JoshuaScholar

Send message
Joined: 26 Mar 20
Posts: 18
Credit: 232,183
RAC: 0
Message 93208 - Posted: 3 Apr 2020, 11:22:56 UTC
Last modified: 3 Apr 2020, 11:31:30 UTC

Even though I have 64 bit windows, it was using the 32 bit version of 4.12. when the virus program went nuts on it.
*Since I had to reset everything, I can see that it's stuck while trying to load the 64 bit version.
Progress is stuck at 99.63% 85.77/86.09 MB I think that happened last time too.
Never mind, it finished downloading and the task list on the website shows that the files it started after that point are running in 64 bits.
Why I was having problems with a 32 bit version before, I don't know.

*correction, after retry it started working.
ID: 93208 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 93221 - Posted: 3 Apr 2020, 12:33:24 UTC - in response to Message 93198.  

For some reason since v4.12 was released, one of my machines has failed computation on every Rosetta task, my other machines seem to be fine at the moment but the following host:
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=3849302 seems to fail every task it gets after 2-3 seconds of computation.

That host has less memory per processor than the others, so it looks worth trying to have BOINC set a lower limit on the number of tasks it can run at once. The 4.12 application seems seems to use more memory than earlier versions.

Also, it appears to be the only one on your list with two GPUs, so it might be worthwhile to try running Rosetta tasks while doing as little as possible on the GPUs.
ID: 93221 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
RT

Send message
Joined: 14 Mar 20
Posts: 6
Credit: 1,155,031
RAC: 0
Message 93224 - Posted: 3 Apr 2020, 13:21:56 UTC - in response to Message 93221.  

Cheers, will try tweaking the config, for now both GPU's are idle on that machine at the moment so shouldn't factor into this.

Seeing a quite a few other Mac users reporting the same issues over at https://boinc.bakerlab.org/rosetta/forum_thread.php?id=12554&postid=93165
ID: 93224 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
RT

Send message
Joined: 14 Mar 20
Posts: 6
Credit: 1,155,031
RAC: 0
Message 93225 - Posted: 3 Apr 2020, 13:46:30 UTC - in response to Message 93224.  

Tried adjusting the resource share as suggested, still having the same issues, got 2 machines now exhibiting this problem, both of them Macs. Windows PC which is the least powerful in terms of specs is happy to crunch the work units without any problems. It has less RAM than either of the Macs.
ID: 93225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,525,460
RAC: 10,413
Message 93246 - Posted: 3 Apr 2020, 16:49:55 UTC - in response to Message 93079.  

Oh, you're not going to like this...
I've just checked my own PC to see how my dribble of tasks have performed on a mere FX8370
1 Apr - Mini & 4.12 tasks around 45/hr, 280-340/8hr task. Better than I usually get tbh
2 Apr - Mini only (4.12 not reported yet) 110-120/hr, 890-950/8hr task. Lol

Nothing I can say to that...

All problems now solved. I think my power supply failed overnight, so nothing running at all <sob>
ID: 93246 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,525,460
RAC: 10,413
Message 93249 - Posted: 3 Apr 2020, 16:57:30 UTC

Front page says 560k tasks queued
Server status page says 15k ready to send
Lower on server status page, all 15k are Android tasks. No Rosetta nor MiniRosetta available to download

We're out for the moment
ID: 93249 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,525,460
RAC: 10,413
Message 93250 - Posted: 3 Apr 2020, 16:59:43 UTC - in response to Message 93249.  

Front page says 560k tasks queued
Server status page says 15k ready to send
Lower on server status page, all 15k are Android tasks. No Rosetta nor MiniRosetta available to download

We're out for the moment

Oh, cancel that. Server status page just updated and tasks already coming down. Sorry
ID: 93250 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 35 · 36 · 37 · 38 · 39 · 40 · 41 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org