Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 281 · 282 · 283 · 284 · 285 · 286 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109410 - Posted: 22 Jun 2024, 11:33:54 UTC - in response to Message 109406.  

And now everything is finally back. Currently

As of 22 Jun 2024, 11:02:26 UTC [ Scheduler running ]
Total queued jobs: 1,336,930
In progress: 153,424
Successes last 24h: 91,239

and

Tasks ready to send 4785
Tasks in progress 153988
Workunits waiting for validation 0
Workunits waiting for assimilation 0
ID: 109410 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1563
Credit: 16,362,682
RAC: 12,621
Message 109411 - Posted: 22 Jun 2024, 21:06:58 UTC - in response to Message 109410.  

And now everything is finally back. Currently

As of 22 Jun 2024, 11:02:26 UTC [ Scheduler running ]
Total queued jobs: 1,336,930
In progress: 153,424
Successes last 24h: 91,239
At last!
And plenty of work as well.

Now things just need to stop falling over in the first place.
Grant
Darwin NT
ID: 109411 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109412 - Posted: 23 Jun 2024, 16:01:19 UTC - in response to Message 109411.  

Now things just need to stop falling over in the first place.

Yes, but also I'd remind everyone of my view
Rosetta Beta 6.04 tasks wrongly default to 3hrs CPU runtime while Rosetta v4.20 rightly default to 8hrs.

So set the Rosetta@home Target CPU Runtime explicitly to 8hrs so that CPU runtime matches what Boinc is told to assume, and not to 'not selected'.

Do more work, get more credits, Boinc schedules more correctly and sooner, batches of tasks issued by Rosetta last longer. Rosetta tasks run out less often. <Everyone> wins.

The alternative is what we have now - no new tasks. Everyone loses.

The more people make this change, the better for everyone, whether that boinc-process server goes down or not
ID: 109412 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1563
Credit: 16,362,682
RAC: 12,621
Message 109416 - Posted: 26 Jun 2024, 8:50:24 UTC

boinc-process server is dead again, Validation backlog continues to grow.
Grant
Darwin NT
ID: 109416 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1563
Credit: 16,362,682
RAC: 12,621
Message 109418 - Posted: 26 Jun 2024, 10:06:36 UTC - in response to Message 109416.  

boinc-process server is dead again, Validation backlog continues to grow.
And it's back again.
Grant
Darwin NT
ID: 109418 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109419 - Posted: 26 Jun 2024, 23:18:31 UTC - in response to Message 109418.  
Last modified: 26 Jun 2024, 23:26:47 UTC

boinc-process server is dead again, Validation backlog continues to grow.
And it's back again.

This is getting like my home-life...
"I've lost my xyz"
"You could at least help to look"
"Oh, there it is"
Me: "What was that you said?"

If I play dumb long enough before paying any attention, most things right themselves on their own

Edit: I just reached 40,000,000 on Rosetta
Edit2: And 100,000,000 for my team across all projects
ID: 109419 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109425 - Posted: 2 Jul 2024, 1:35:11 UTC - in response to Message 109412.  

Now things just need to stop falling over in the first place.

Yes, but also I'd remind everyone of my view
Rosetta Beta 6.04 tasks wrongly default to 3hrs CPU runtime while Rosetta v4.20 rightly default to 8hrs.

So set the Rosetta@home Target CPU Runtime explicitly to 8hrs so that CPU runtime matches what Boinc is told to assume, and not to 'not selected'.

Do more work, get more credits, Boinc schedules more correctly and sooner, batches of tasks issued by Rosetta last longer. Rosetta tasks run out less often. <Everyone> wins.

The alternative is what we have now - no new tasks. Everyone loses.

The more people make this change, the better for everyone, whether that boinc-process server goes down or not

Queued jobs down to 153k 3hrs ago, so another shout out for this.
I'm estimating we only have another 12-13hrs of tasks unless more get queued up.
ID: 109425 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109426 - Posted: 2 Jul 2024, 20:29:38 UTC - in response to Message 109425.  

Queued jobs down to 153k 3hrs ago, so another shout out for this.
I'm estimating we only have another 12-13hrs of tasks unless more get queued up.

I think we had a few extra Rosetta 4.20 tasks but not many and we're out anyway now
Fingers crossed for another batch
ID: 109426 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1563
Credit: 16,362,682
RAC: 12,621
Message 109428 - Posted: 3 Jul 2024, 10:16:45 UTC

boinc-process server has died, again.
Grant
Darwin NT
ID: 109428 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1928
Credit: 9,048,244
RAC: 6,318
Message 109429 - Posted: 3 Jul 2024, 12:21:46 UTC - in response to Message 109428.  

boinc-process server has died, again.


Once a week, approximately....
ID: 109429 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rilian
Avatar

Send message
Joined: 16 Jun 07
Posts: 17
Credit: 2,273,987
RAC: 7,902
Message 109430 - Posted: 3 Jul 2024, 17:24:15 UTC - in response to Message 109429.  

While there are no Rosetta tasks, you can crunch some Ralph nvidia GPU tasks (1000 available at this moment https://ralph.bakerlab.org/server_status.php) and help accelerate release of GPU app to Rosetta!
i crunch for Ukraine. Join our team forums about Rosetta@home
ID: 109430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109431 - Posted: 3 Jul 2024, 18:16:28 UTC - in response to Message 109428.  

boinc-process server has died, again.

I didn't notice again and, now I look, it's back.
Maybe I should look more often.
Or you should look less often...

The last of my Rosetta tasks are running now, showing the benefit of ensuring all my runtimes are at least 8hrs rather than the 3hr mistake Rosetta Beta tasks are set to
ID: 109431 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109432 - Posted: 3 Jul 2024, 18:36:22 UTC - in response to Message 109430.  

While there are no Rosetta tasks, you can crunch some Ralph nvidia GPU tasks (1000 available at this moment https://ralph.bakerlab.org/server_status.php) and help accelerate release of GPU app to Rosetta!

I just did.
And then remembered the minimum 5Gb (6Gb) req't for RAM on my Video Card, which only has 4Gb... <sigh>
ID: 109432 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1928
Credit: 9,048,244
RAC: 6,318
Message 109433 - Posted: 3 Jul 2024, 18:56:42 UTC - in response to Message 109432.  

And then remembered the minimum 5Gb (6Gb) req't for RAM on my Video Card, which only has 4Gb... <sigh>


I also have a 4gb gpu....and it's AMD :-(
ID: 109433 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rilian
Avatar

Send message
Joined: 16 Jun 07
Posts: 17
Credit: 2,273,987
RAC: 7,902
Message 109434 - Posted: 4 Jul 2024, 15:34:06 UTC - in response to Message 109432.  

While there are no Rosetta tasks, you can crunch some Ralph nvidia GPU tasks (1000 available at this moment https://ralph.bakerlab.org/server_status.php) and help accelerate release of GPU app to Rosetta!

I just did.
And then remembered the minimum 5Gb (6Gb) req't for RAM on my Video Card, which only has 4Gb... <sigh>

prev batch required 6gb, current batch 5gb, who knows maybe next batch will be 4gb :) so keep the project active :)
i crunch for Ukraine. Join our team forums about Rosetta@home
ID: 109434 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109435 - Posted: 7 Jul 2024, 19:02:39 UTC

New tasks came down about an hour ago
ID: 109435 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 381
Credit: 11,643,982
RAC: 7,911
Message 109440 - Posted: 8 Jul 2024, 18:00:14 UTC - in response to Message 109435.  

New tasks came down about an hour ago


Sadly, still with the lower connect error
ID: 109440 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Landjunge

Send message
Joined: 15 Jan 08
Posts: 1
Credit: 11,463,688
RAC: 13,985
Message 109441 - Posted: 8 Jul 2024, 21:01:29 UTC - in response to Message 109434.  
Last modified: 8 Jul 2024, 21:02:03 UTC

While there are no Rosetta tasks, you can crunch some Ralph nvidia GPU tasks (1000 available at this moment https://ralph.bakerlab.org/server_status.php) and help accelerate release of GPU app to Rosetta!

I just did.
And then remembered the minimum 5Gb (6Gb) req't for RAM on my Video Card, which only has 4Gb... <sigh>

prev batch required 6gb, current batch 5gb, who knows maybe next batch will be 4gb :) so keep the project active :)


i had no problem running two ralph's in parallel on a 8gb rtx3070.
ID: 109441 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1928
Credit: 9,048,244
RAC: 6,318
Message 109442 - Posted: 9 Jul 2024, 4:51:27 UTC - in response to Message 109440.  

Sadly, still with the lower connect error


+1
ID: 109442 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2048
Credit: 40,342,779
RAC: 13,849
Message 109444 - Posted: 9 Jul 2024, 7:45:12 UTC - in response to Message 109442.  
Last modified: 9 Jul 2024, 7:49:06 UTC

Sadly, still with the lower connect error

+1

I've had one.
CPU runtime 2 seconds
Even clicking reply, typing +1, then clicking send takes more time, let alone the time taken checking if I had any
I can't bring myself to care, let alone mention it

In the meantime, the whole site went down for a few hours, in which time Boinc decided to bring down 21 WCG tasks I didn't really want to have in my cache, which I consider a waste of time even of it will keep my PC occupied
ID: 109444 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 281 · 282 · 283 · 284 · 285 · 286 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org