Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 292 · 293 · 294 · 295 · 296 · 297 · 298 . . . 301 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109802 - Posted: 2 Oct 2024, 10:32:01 UTC

The boinc-process host has died again, so Validations are building up.
Grant
Darwin NT
ID: 109802 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 259
Credit: 489,025
RAC: 575
Message 109803 - Posted: 2 Oct 2024, 11:48:48 UTC
Last modified: 2 Oct 2024, 11:54:44 UTC

Graphics process crashes

e:\programdata\BOINC\slots\5>E:\programdata\BOINC\projects\boinc.bakerlab.org_rosetta\rosetta_graphics_6.06_windows_x86_64.exe -database e:\programdata\BOINC\projects\boinc.bakerlab.org_rosetta\database_f5ae1de8e1\database
********  (C) Copyright Rosetta Commons Member Institutions.  ***************
* Use of Rosetta for commercial purposes may require purchase of a license. *
********  See LICENSE.md or email license@uw.edu for more details. **********
core.init: Checking for fconfig files in pwd and ./rosetta/flags
core.init: Rosetta version: 2024.24.post.dev+4.main.f5ae1de8e1 f5ae1de8e146ed3da2662da903342c9c1ad0b046 https://github.com/RosettaCommons/rosetta 2024-08-12T12:35:30
core.init: Rosetta extras: []
core.init: command: E:\programdata\BOINC\projects\boinc.bakerlab.org_rosetta\rosetta_graphics_6.06_windows_x86_64.exe -database e:\programdata\BOINC\projects\boinc.bakerlab.org_rosetta\database_f5ae1de8e1\database
basic.random.init_random_generator: 'RNG device' seed mode, using 'RtlGenRandom', seed=2029587963 seed_offset=0 real_seed=2029587963
basic.random.init_random_generator: RandomGenerator:init: Normal mode, seed=2029587963 RG_type=mt19937
Attached shared memory segment
core.chemical.GlobalResidueTypeSet: Finished initializing fa_standard residue type set.  Created 985 residue types
core.chemical.GlobalResidueTypeSet: Total time to initialize 1.737 seconds.

e:\programdata\BOINC\slots\5>cat stderrgfx.txt
14:43:58 (40372): Starting graphics application.
Opened semaphore

ERROR: The residue SER:NtermTruncation could not be generated.  Has a suitable params file been loaded? (Note that custom params files not in the Rosetta database can be loaded with the -extra_res or -extra_res_fa command-line flags.)
ERROR:: Exit from: src/core/chemical/ResidueTypeSet.cc line: 116
14:44:00 (40372): called boinc_finish(0)

e:\programdata\BOINC\slots\5>
ID: 109803 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2122
Credit: 41,184,189
RAC: 10,001
Message 109804 - Posted: 2 Oct 2024, 18:05:49 UTC - in response to Message 109802.  

The boinc-process host has died again, so Validations are building up.

Just as well tasks ran out to download at about the same time... <sigh>
ID: 109804 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kasdashdfjsah

Send message
Joined: 15 Jan 24
Posts: 6
Credit: 0
RAC: 0
Message 109806 - Posted: 2 Oct 2024, 20:43:29 UTC - in response to Message 109804.  
Last modified: 2 Oct 2024, 20:44:21 UTC

Yeah, no tasks for me either.

Also, please have the Ralph@home project removed from the website, since it's not active anymore, so people don't waste time reading about it and trying to create and account which doesn't work.
ID: 109806 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109808 - Posted: 3 Oct 2024, 10:42:01 UTC - in response to Message 109802.  

The boinc-process host has died again, so Validations are building up.
Still dead, backlog up to 109,000.
Grant
Darwin NT
ID: 109808 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2122
Credit: 41,184,189
RAC: 10,001
Message 109809 - Posted: 3 Oct 2024, 23:02:26 UTC - in response to Message 109808.  

The boinc-process host has died again, so Validations are building up.
Still dead, backlog up to 109,000.

Midnight in the UK, I'm back from work and all servers running and the backlog fully cleared.
No new tasks yet, but credits rolling in at least
ID: 109809 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109813 - Posted: 6 Oct 2024, 3:30:50 UTC

Web site took ages to come up, forums extremely sluggish, Server Status page showing all green however i'm unable to upload any results.
Grant
Darwin NT
ID: 109813 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109814 - Posted: 6 Oct 2024, 10:34:29 UTC

And now it's all working again.
Grant
Darwin NT
ID: 109814 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher

Send message
Joined: 10 Jun 13
Posts: 35
Credit: 33,075,573
RAC: 44,296
Message 109830 - Posted: 9 Oct 2024, 18:20:55 UTC

Four days, maybe it's been five as I've lost count, and no new work.
ID: 109830 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2122
Credit: 41,184,189
RAC: 10,001
Message 109831 - Posted: 9 Oct 2024, 18:32:52 UTC - in response to Message 109830.  
Last modified: 9 Oct 2024, 18:35:34 UTC

Four days, maybe it's been five as I've lost count, and no new work

I think it's now 7 days since the final tasks of the last batch were all picked up, going by the comments above
Not to mention WCG has been down for maintenance the last few days and I've got 150 tasks waiting to upload there too
With the weather turning colder recently I could do with some tasks from somewhere to warm the place up a little
ID: 109831 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher

Send message
Joined: 10 Jun 13
Posts: 35
Credit: 33,075,573
RAC: 44,296
Message 109833 - Posted: 9 Oct 2024, 21:32:36 UTC - in response to Message 109831.  

With the weather turning colder recently I could do with some tasks from somewhere to warm the place up a little


I understand this...it's currently 40F (lets call it 4.5F) outside, and I too generate a lot of heat with the computers. Although it's not the most cost effective heating method I consider it my little contribution to the greater good. WCG is at least accepting results now.
ID: 109833 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2122
Credit: 41,184,189
RAC: 10,001
Message 109834 - Posted: 9 Oct 2024, 22:04:27 UTC - in response to Message 109833.  
Last modified: 9 Oct 2024, 22:05:13 UTC

With the weather turning colder recently I could do with some tasks from somewhere to warm the place up a little

I understand this... it's currently 40F (lets call it 4.5F) outside, and I too generate a lot of heat with the computers. Although it's not the most cost effective heating method I consider it my little contribution to the greater good. WCG is at least accepting results now.

I just remembered - you're in Alaska. It's 11C here right now, but going down as low as 1C overnight in the next couple of days. I need work!
And yes, my WCG results have been accepted too, now that you mention it. No new tasks just yet though.
ID: 109834 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,274,116
RAC: 1,462
Message 109836 - Posted: 9 Oct 2024, 23:22:33 UTC
Last modified: 9 Oct 2024, 23:26:48 UTC

The SiDock@home project has a rather steady supply of tasks. You might want to look there to see if you like those tasks, and if so, check how much they warm up your computer.

The weather report here in Alabama has recently gone as low as 51 F.
ID: 109836 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2122
Credit: 41,184,189
RAC: 10,001
Message 109838 - Posted: 10 Oct 2024, 0:38:55 UTC - in response to Message 109836.  

The SiDock@home project has a rather steady supply of tasks. You might want to look there to see if you like those tasks, and if so, check how much they warm up your computer.

The weather report here in Alabama has recently gone as low as 51 F

I've known the name of that project for a while, but for some reason I've only just checked it out.
Seems ok, so I've joined and grabbed tasks already.
Let's see how it goes. Ta.
ID: 109838 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109844 - Posted: 12 Oct 2024, 3:42:54 UTC
Last modified: 12 Oct 2024, 4:01:52 UTC

There is a new batch of Beta work out.
Just be warned- you need roughly 2.2GB of RAM per Task (although it looks like it will drop down after a while to 1-2GB).
My system that picked up some work a while ago is using 70% of available RAM. The one that only just picked up some work, is using 98% of available RAM (both systems 6c/12t with 32GB RAM).


Edit- the longer running Tasks have reduced their need for RAM even further, now down to around 1GB per Task.
Grant
Darwin NT
ID: 109844 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher

Send message
Joined: 10 Jun 13
Posts: 35
Credit: 33,075,573
RAC: 44,296
Message 109845 - Posted: 12 Oct 2024, 4:38:49 UTC - in response to Message 109844.  

Yep, they got me again. 2 computers, each running 32 threads, each with 32GB of memory, swapping to beat the dickens. Reboot, suspend all the processes, abort all the beta jobs, stop getting Rosetta work for a while and let those things filter out, and resume processing. I'll ask for more early next week. Not looking good to allow these computers to run Rosetta for 5 months when I'm 4,000 miles away.
ID: 109845 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2122
Credit: 41,184,189
RAC: 10,001
Message 109847 - Posted: 12 Oct 2024, 10:54:21 UTC - in response to Message 109844.  

There is a new batch of Beta work out.
Just be warned- you need roughly 2.2GB of RAM per Task (although it looks like it will drop down after a while to 1-2GB).
My system that picked up some work a while ago is using 70% of available RAM. The one that only just picked up some work, is using 98% of available RAM (both systems 6c/12t with 32GB RAM).

Edit- the longer running Tasks have reduced their need for RAM even further, now down to around 1GB per Task.

You made me look.
All less than 400Mb here for some reason
I'm more annoyed I now have tasks for 2 other projects getting in the way of grabbing more from here.
I can't win
ID: 109847 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Klimax

Send message
Joined: 27 Apr 07
Posts: 44
Credit: 2,800,788
RAC: 1,094
Message 109850 - Posted: 12 Oct 2024, 14:50:47 UTC

All of my tasks are using after several hours about 2GBs. Also only three days of deadline? Will be tight.
ID: 109850 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109851 - Posted: 12 Oct 2024, 21:43:35 UTC - in response to Message 109850.  
Last modified: 12 Oct 2024, 21:50:26 UTC

Also only three days of deadline?
Which has been the case for years now, and which is why you don't return a large amount of the work you download- you miss the deadline almost 50% of the time.

Your initial Estimated completion times are set by the project at 8 hours, but your actual Target CPU time appears to be 12 hours. So even with a small cache, you would miss deadlines. You need no cache, or at the very most a very small cache to avoid missing deadlines (1 day or less).
Running more than one project there is no need for a cache at all. If one project doesn't have work, the other will make up that gap, till the first project has work again. No missed deadlines, no not getting work from one project because the other has filled the cache with work.

Ideally
           Store at least 0.02 days of work
Store up to an additional 0.01 days of work

But if you really feel the need for a cache
           Store at least 0.35 days of work
Store up to an additional 0.01 days of work
would be plenty.


If all of your projects have server issues/lots of periods with no work, then "Store at least xx days of work could be set to 1 day (or 1.5 days if you go with the default Target CPU time of 8 hours).
Grant
Darwin NT
ID: 109851 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1679
Credit: 17,789,442
RAC: 22,628
Message 109852 - Posted: 12 Oct 2024, 21:56:06 UTC - in response to Message 109847.  
Last modified: 12 Oct 2024, 21:56:41 UTC

You made me look.
All less than 400Mb here for some reason
After about half an hour mine end up down around 700-800MB. But when they first start after 5min or so they're still up around 2GB+ before dropping down again.
I'd check what application is actually running those other Tasks- probably your other projects, or they're resends and not one of the latest Beta batch.
Grant
Darwin NT
ID: 109852 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 292 · 293 · 294 · 295 · 296 · 297 · 298 . . . 301 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org