Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 306 · 307 · 308 · 309 · 310 · Next

AuthorMessage
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 248
Message 111787 - Posted: 20 Dec 2024, 14:04:13 UTC - in response to Message 111785.  

I used this batch script
cd /d c:\Program Files\BOINC
:loop
boinccmd.exe --project https://www.gpugrid.net update

TIMEOUT /T 600 
goto loop
ID: 111787 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,451,410
RAC: 20,088
Message 111790 - Posted: 20 Dec 2024, 20:33:27 UTC - in response to Message 111785.  

I have had to add

watch -n 300 ./boinccmd --project https://boinc.bakerlab.org/rosetta/ update

to my Linux boxhttps://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6306257 to reliably get tasks.
That just makes it much more likely you'll pick up resends when they become available.


But several top performing Windows system running 256 cores are clearly not getting tasks reliably. The reason I am claiming this is my smaller 128 core system is currently parked at #1. That doesn't seem reasonable given the specifications of the other systems.
Even for systems that run 24/7/365 Resource share & the number of other projects they do will have a massive effect on just how high a RAC a particular system will have for any given project they are doing, along with the size of their cache & work availability- with sporadic work available in small batches and as large as possible cache, that will keep a system crunching for as long as possible when it does have work. However, the large cache will also mean that if the system is out of Rosetta work, then it will fill up on other projects, and won't be able to get any Rosetta work during the short time it is available.
Hence huge core count systems with relatively low RACs for their processing capacity.
Grant
Darwin NT
ID: 111790 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,451,410
RAC: 20,088
Message 111791 - Posted: 21 Dec 2024, 8:16:10 UTC

On the main page in the Server Status section, we've got that error again
Notice: Undefined variable: stats in /projects/boinc/rosetta/html/user/index.php on line 81

Grant
Darwin NT
ID: 111791 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tom M

Send message
Joined: 20 Jun 17
Posts: 94
Credit: 16,528,639
RAC: 50,393
Message 111792 - Posted: 21 Dec 2024, 16:31:55 UTC - in response to Message 111787.  

Thank you.
Help, my tagline is missing..... Help, my tagline is......... Help, m........ Hel.....
ID: 111792 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
EvolvingDoor
New member

Send message
Joined: 13 Dec 24
Posts: 2
Credit: 21,040
RAC: 1,759
Message 111793 - Posted: 21 Dec 2024, 21:44:06 UTC

I'm new to Rosetta. I signed up on Dec. 13 using BOINC since World Community Grid is hibernating until January sometime. According to my stats on BOINC, it ran some tasks on the 13th but hasn't done anything since. I'm seeing reports in this thread of recent problems with accounts. I don't see anything to suggest my account was dropped (other than no tasks). I've requested new tasks a number of times but can't get anything to download. Can someone please help me get my account working? Thanks in advance.
ID: 111793 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 402
Credit: 12,294,748
RAC: 4,622
Message 111794 - Posted: 21 Dec 2024, 21:52:51 UTC - in response to Message 111793.  

I'm new to Rosetta. I signed up on Dec. 13 using BOINC since World Community Grid is hibernating until January sometime. According to my stats on BOINC, it ran some tasks on the 13th but hasn't done anything since. I'm seeing reports in this thread of recent problems with accounts. I don't see anything to suggest my account was dropped (other than no tasks). I've requested new tasks a number of times but can't get anything to download. Can someone please help me get my account working? Thanks in advance.


Your account is working fine, Rosetta currently has no tasks to send - see either the front page or the server status page under computing.
ID: 111794 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,550,899
RAC: 9,975
Message 111795 - Posted: 22 Dec 2024, 0:52:28 UTC

Arrive home to find 16 Robetta tasks sneaked onto my main machine.
Grateful for small mercies - I don't think there were many available at all (100s not even 1000s)
ID: 111795 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 1,496
Message 111796 - Posted: 22 Dec 2024, 1:19:55 UTC - in response to Message 111793.  
Last modified: 22 Dec 2024, 1:21:17 UTC

I'm new to Rosetta. I signed up on Dec. 13 using BOINC since World Community Grid is hibernating until January sometime. According to my stats on BOINC, it ran some tasks on the 13th but hasn't done anything since. I'm seeing reports in this thread of recent problems with accounts. I don't see anything to suggest my account was dropped (other than no tasks). I've requested new tasks a number of times but can't get anything to download. Can someone please help me get my account working? Thanks in advance.

I've found that PrimeGrid is currently often about the only source of CPU workunits in the English language. Their automatic workunit generating programs should keep SOMETHING available for at least a few weeks,
ID: 111796 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lem Novantotto

Send message
Joined: 13 Sep 23
Posts: 1
Credit: 452,488
RAC: 5,773
Message 111797 - Posted: 22 Dec 2024, 9:10:21 UTC - in response to Message 111796.  

I've found that PrimeGrid is currently often about the only source of CPU workunits in the English language. Their automatic workunit generating programs should keep SOMETHING available for at least a few weeks,


A possible alternative would be to switch to folding@home (not a BOINC project).
https://foldingathome.org/
--
Bye, Lem
ID: 111797 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
seanr22a

Send message
Joined: 10 Jan 19
Posts: 1
Credit: 3,765,804
RAC: 3,749
Message 111798 - Posted: 22 Dec 2024, 10:05:15 UTC
Last modified: 22 Dec 2024, 10:14:08 UTC

Is the download issue back again or is it just me? The other projects I'm running works just fine only Rosetta has problem.

I have 100++ jobs saying download but nothing happens. The logs are filled with transient HTTP error on all my servers.

I tried what is mentioned here earlier adding ip to the host file and flushing DNS caches. I tried reboot as well but still the same.

Dec 22 08:10:04 pm108 boinc[793256]: 22-Dec-2024 08:10:04 [Rosetta@home] Started download of rosetta_4.20_x86_64-pc-linux-gnu
Dec 22 08:10:04 pm108 boinc[793256]: 22-Dec-2024 08:10:04 [Rosetta@home] Started download of database_357d5d93529_n_methyl.zip
Dec 22 08:10:05 pm108 boinc[793256]: 22-Dec-2024 08:10:05 [Rosetta@home] Temporarily failed download of rosetta_4.20_x86_64-pc-linux-gnu: transient HTTP error
Dec 22 08:10:05 pm108 boinc[793256]: 22-Dec-2024 08:10:05 [Rosetta@home] Backing off 00:49:02 on download of rosetta_4.20_x86_64-pc-linux-gnu
Dec 22 08:10:05 pm108 boinc[793256]: 22-Dec-2024 08:10:05 [Rosetta@home] Temporarily failed download of database_357d5d93529_n_methyl.zip: transient HTTP error
Dec 22 08:10:05 pm108 boinc[793256]: 22-Dec-2024 08:10:05 [Rosetta@home] Backing off 00:31:40 on download of database_357d5d93529_n_methyl.zip
Dec 22 08:35:01 pm108 boinc[793256]: 22-Dec-2024 08:35:01 [Rosetta@home] update requested by user
Dec 22 08:37:28 pm108 boinc[793256]: 22-Dec-2024 08:37:28 [Rosetta@home] Sending scheduler request: Requested by user.
Dec 22 08:37:28 pm108 boinc[793256]: 22-Dec-2024 08:37:28 [Rosetta@home] Not requesting tasks: "no new tasks" requested via Manager
Dec 22 08:37:31 pm108 boinc[793256]: 22-Dec-2024 08:37:31 [Rosetta@home] Scheduler request completed
Dec 22 08:37:31 pm108 boinc[793256]: 22-Dec-2024 08:37:31 [Rosetta@home] Project requested delay of 31 seconds
Dec 22 08:46:40 pm108 boinc[793256]: 22-Dec-2024 08:46:40 [Rosetta@home] Started download of rosetta_graphics_4.20_x86_64-pc-linux-gnu
Dec 22 08:46:40 pm108 boinc[793256]: 22-Dec-2024 08:46:40 [Rosetta@home] Started download of database_357d5d93529_n_methyl.zip
Dec 22 08:46:42 pm108 boinc[793256]: 22-Dec-2024 08:46:42 [Rosetta@home] Temporarily failed download of rosetta_graphics_4.20_x86_64-pc-linux-gnu: transient HTTP error
Dec 22 08:46:42 pm108 boinc[793256]: 22-Dec-2024 08:46:42 [Rosetta@home] Backing off 01:11:40 on download of rosetta_graphics_4.20_x86_64-pc-linux-gnu
Dec 22 08:46:42 pm108 boinc[793256]: 22-Dec-2024 08:46:42 [Rosetta@home] Temporarily failed download of database_357d5d93529_n_methyl.zip: transient HTTP error

[EDIT]
Classical ... trying to find what is wrong for hours and finaly write a post here to see if it's just me. 5 minutes later it starts working by itself :)
ID: 111798 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 196
Credit: 6,613,600
RAC: 5,017
Message 111799 - Posted: 22 Dec 2024, 14:32:36 UTC - in response to Message 111798.  

Is the download issue back again or is it just me? The other projects I'm running works just fine only Rosetta has problem.

I have 100++ jobs saying download but nothing happens. The logs are filled with transient HTTP error on all my servers.

I tried what is mentioned here earlier adding ip to the host file and flushing DNS caches. I tried reboot as well but still the same.


I guess it is partly you and partly the server,

I do not get that at all. All I get is:

Sat 21 Dec 2024 09:24:22 PM EST | Rosetta@home | Sending scheduler request: To fetch work.
Sat 21 Dec 2024 09:24:22 PM EST | Rosetta@home | Requesting new tasks for CPU
Sat 21 Dec 2024 09:24:25 PM EST | Rosetta@home | Scheduler request completed: got 0 new tasks
Sat 21 Dec 2024 09:24:25 PM EST | Rosetta@home | No tasks sent
Sat 21 Dec 2024 09:24:25 PM EST | Rosetta@home | Project requested delay of 31 seconds

ID: 111799 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
EvolvingDoor
New member

Send message
Joined: 13 Dec 24
Posts: 2
Credit: 21,040
RAC: 1,759
Message 111800 - Posted: 22 Dec 2024, 18:09:40 UTC - in response to Message 111794.  
Last modified: 22 Dec 2024, 18:10:02 UTC

Thanks Bryn Mawr. I was checking the server status page and it said (and still says) all are running (see below).

In any case, like Sid Celery I got up today to find that I had about 150 new tasks on my main computer. Yay! But my secondary computer still has no Rosetta tasks. Why does Rosetta have so few tasks to run?


Upload server boinc.bakerlab.org Running
Scheduler bwsrv1 Running
Download server boinc-files.bakerlab.org Running
feeder bwsrv1 Running
rah_make_work_rosetta bwsrv1 Running
rah_make_work_rosetta_python_projects bwsrv1 Running
transitioner1 bwsrv1 Running
transitioner2 bwsrv1 Running
rah_assimilator_rosetta1 (rosetta) boinc-process Running
rah_assimilator_rosetta2 (rosetta) boinc-process Running
rah_assimilator_rosetta3 (rosetta) boinc-process Running
rah_assimilator_rosetta4 (rosetta) boinc-process Running
rah_assimilator_rosetta5 (rosetta) boinc-process Running
rah_assimilator_mini1 (minirosetta) boinc-process Running
rah_validator_rosetta1 (rosetta) boinc-process Running
rah_validator_rosetta2 (rosetta) boinc-process Running
rah_validator_mini1 (minirosetta) boinc-process Running
file_deleter1 bwsrv1 Running
file_deleter2 bwsrv2 Running
db_purge bwsrv2 Running
rah_validator_rosetta_python_projects (rosetta_python_projects) boinc-process Running
rah_assimilator_rosetta_python_projects (rosetta_python_projects) boinc-process Running
rah_make_work_rosetta_beta bwsrv1 Running
rah_assimilator_rosetta_beta (rosetta_beta) boinc-process Running
rah_validator_rosetta_beta (rosetta_beta) boinc-process Running
ID: 111800 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 402
Credit: 12,294,748
RAC: 4,622
Message 111801 - Posted: 22 Dec 2024, 23:36:55 UTC - in response to Message 111800.  

To the right, “Computing Status” > Tasks Ready to Send = 0
ID: 111801 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
StarCastle

Send message
Joined: 25 Apr 20
Posts: 9
Credit: 1,030,088
RAC: 511
Message 111802 - Posted: 23 Dec 2024, 0:51:51 UTC

I am also having the download issue.

Messages are:

Temporarily failed download of rb_***********: Transient HTTP error

It then backs off for a period of time and fails again.

I noticed this about a week or so ago. Typically do not have any issues with Rosetta.
ID: 111802 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
skydivingnerd

Send message
Joined: 28 Jan 13
Posts: 7
Credit: 88,002,466
RAC: 585
Message 111803 - Posted: 23 Dec 2024, 2:29:23 UTC

I'm also having issues downloading eleven tasks I have at queue. I also run LHC@Home and have all my boinc traffic running through Squid Proxy. I'm getting the following in my Squid logs
TCP_TUNNEL_ABORTED/200 boinc-files.bakerlab.org:443 - 128.95.160.134

I've been having download issues for some time now, not sure when it started.

Scott
ID: 111803 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 19
Credit: 26,638,392
RAC: 18,133
Message 111806 - Posted: 23 Dec 2024, 13:24:55 UTC - in response to Message 111803.  

I'm also having issues downloading eleven tasks I have at queue. I also run LHC@Home and have all my boinc traffic running through Squid Proxy. I'm getting the following in my Squid logs
TCP_TUNNEL_ABORTED/200 boinc-files.bakerlab.org:443 - 128.95.160.134

I've been having download issues for some time now, not sure when it started.

Scott

Your 11 task is on your Linux box so the 128.95.160.134 is not working for you, SSL certificate problems. You need to add 128.95.160.156 in your ect/hosts. Various solutions suggested in this thread.
ID: 111806 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Opolis

Send message
Joined: 1 May 12
Posts: 2
Credit: 4,065,667
RAC: 2,550
Message 111807 - Posted: 23 Dec 2024, 17:50:28 UTC

Thanks for the tip. After my account was restored, I had to add the ip to etc/hosts for downloads to work again.
ID: 111807 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
skydivingnerd

Send message
Joined: 28 Jan 13
Posts: 7
Credit: 88,002,466
RAC: 585
Message 111808 - Posted: 23 Dec 2024, 18:51:29 UTC - in response to Message 111806.  

Thanks. I've put the DNS host override in my firewall. Got 17 tasks through 128.95.160.156.
ID: 111808 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
StarCastle

Send message
Joined: 25 Apr 20
Posts: 9
Credit: 1,030,088
RAC: 511
Message 111809 - Posted: 23 Dec 2024, 19:00:19 UTC - in response to Message 111802.  

The host files fix worked for me as well.

Thanks
ID: 111809 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tom M

Send message
Joined: 20 Jun 17
Posts: 94
Credit: 16,528,639
RAC: 50,393
Message 111810 - Posted: 23 Dec 2024, 23:42:17 UTC - in response to Message 111809.  

The host files fix worked for me as well.

Thanks

Me Too.
Help, my tagline is missing..... Help, my tagline is......... Help, m........ Hel.....
ID: 111810 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 306 · 307 · 308 · 309 · 310 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org