Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 75 · 76 · 77 · 78 · 79 · 80 · 81 . . . 309 · Next

AuthorMessage
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 2,018
Message 100088 - Posted: 21 Dec 2020, 12:09:27 UTC
Last modified: 21 Dec 2020, 12:20:47 UTC

I've only got one horns5 but it's running fine under Ubuntu 18.04 at Google Colab.
ID: 100088 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 100093 - Posted: 21 Dec 2020, 18:18:42 UTC

The strangest part is: some work units that have failed on my machines have succeeded elsewhere (example), and some that have failed elsewhere have succeeded on mine (example).
ID: 100093 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 399
Credit: 12,294,748
RAC: 6,222
Message 100094 - Posted: 21 Dec 2020, 18:34:50 UTC

No problem this end, two running at the moment and no failures.
ID: 100094 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 100095 - Posted: 21 Dec 2020, 19:54:08 UTC

I just had one horns5 fail on my computer after about 20 minutes while another is past that point and about half finished.

The error message for the one that failed looks likely to mean a problem in an input file.
ID: 100095 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 100096 - Posted: 21 Dec 2020, 20:06:44 UTC - in response to Message 100093.  

The strangest part is: some work units that have failed on my machines have succeeded elsewhere (example), and some that have failed elsewhere have succeeded on mine (example).

This could mean that the application is picking up a random number from somewhere and using it as part of its input.

If this in not deliberate, it could be the application program using the contents of some memory location without first setting it to a known value.
ID: 100096 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 100097 - Posted: 21 Dec 2020, 20:18:16 UTC - in response to Message 100095.  
Last modified: 21 Dec 2020, 20:18:38 UTC

I just had one horns5 fail on my computer after about 20 minutes while another is past that point and about half finished.

The error message for the one that failed looks likely to mean a problem in an input file.

I now have three horns5 tasks running at once on my computer, all of them well past 20 minutes.
ID: 100097 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 100098 - Posted: 21 Dec 2020, 20:50:15 UTC - in response to Message 100095.  

I just had one horns5 fail
That’s the same error Grant reported this morning. Interesting that those ones detected a problem and exited, while the others have just fallen over in a heap. Of course if, as you suggest, they’re using uninitialised data somewhere, anything could happen…
ID: 100098 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,518,559
RAC: 10,612
Message 100101 - Posted: 22 Dec 2020, 3:33:11 UTC - in response to Message 100094.  

No problem this end, two running at the moment and no failures.

Same
ID: 100101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 100102 - Posted: 22 Dec 2020, 6:18:56 UTC

At least this time around the with the horns5 Tasks i've had more Valid ones than errors. Last time it was easily 90% were errors.
Grant
Darwin NT
ID: 100102 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tom

Send message
Joined: 29 Nov 08
Posts: 10
Credit: 6,044,733
RAC: 0
Message 100118 - Posted: 23 Dec 2020, 23:29:15 UTC - in response to Message 99767.  

for 5 months i have been producing exactly one error-free task a day. it's hard to do more than that when there's a limit, wouldn't you think?

as for why i'm supposedly "producing errors", i'm running the same software that the project provided, on the same computer that has run it for years. and just coincidentally, these "errors" only started with the switchover to secure http.

not interested anymore.
ID: 100118 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 100119 - Posted: 24 Dec 2020, 4:24:50 UTC - in response to Message 100118.  

As we have tried to explain to you before: the limit is there to protect the project from hosts that fail to perform useful work, and if the problem were related to SSL you wouldn’t be able to download any tasks in the first place. It genuinely is coincidence that your trouble started around the same time as the switch.

You’re not alone in finding that application version 4.20 doesn’t work on older versions of Mac OS, though the only resolution seems to be “try a different project”.
ID: 100119 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 100123 - Posted: 24 Dec 2020, 14:24:56 UTC - in response to Message 100119.  

You’re not alone in finding that application version 4.20 doesn’t work on older versions of Mac OS, though the only resolution seems to be “try a different project”.

Or waiting a bugfixed version for Mac....
ID: 100123 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 100127 - Posted: 25 Dec 2020, 10:27:18 UTC
Last modified: 25 Dec 2020, 10:30:24 UTC

Plenty of WUs ready to go (11 million queued jobs), but all i get is No Tasks sent when requesting new work to replace returned work (Ready to send is zero).
In progress has fallen from 550k down to 400k.

Someone needs to give the servers a kick.
Grant
Darwin NT
ID: 100127 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 100128 - Posted: 25 Dec 2020, 10:39:32 UTC - in response to Message 100127.  

Yes, something’s clogged up. Tasks are trickling through now and again, so it’s not completely stopped. But I’m not sure we’ll get much attention from the server admins at 02:30 on Christmas morning… :-⁠)
ID: 100128 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 100131 - Posted: 25 Dec 2020, 14:44:54 UTC - in response to Message 100127.  
Last modified: 25 Dec 2020, 14:45:12 UTC

Someone needs to give the servers a kick.

I think it's difficult during Christmas's week

P.S.
Mary Christmas to all of you!!
ID: 100131 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 100132 - Posted: 25 Dec 2020, 15:18:28 UTC - in response to Message 100131.  

buon natale a tutti
ID: 100132 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Link
Avatar

Send message
Joined: 4 May 07
Posts: 356
Credit: 382,349
RAC: 0
Message 100133 - Posted: 25 Dec 2020, 15:24:32 UTC - in response to Message 100127.  

11 million queued jobs

What matters are "Tasks ready to send" on the Server Status Page and that is near 0.
.
ID: 100133 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,518,559
RAC: 10,612
Message 100134 - Posted: 25 Dec 2020, 15:27:42 UTC - in response to Message 100127.  

Plenty of WUs ready to go (11 million queued jobs), but all i get is No Tasks sent when requesting new work to replace returned work (Ready to send is zero).
In progress has fallen from 550k down to 400k.

Someone needs to give the servers a kick.

Everyone says ditto.

Usually I send a post pre-Xmas msg suggesting a reboot of to make sure everything's ok for the holiday period. This year I was a bit distracted.

The week of Monday 4th January is favourite now - and not the beginning of that week either.
At least I get time to set up my new PC properly now I'm home. Lots of tweaking to do

Merry Christmas.
ID: 100134 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jo

Send message
Joined: 16 May 20
Posts: 10
Credit: 3,813,274
RAC: 0
Message 100138 - Posted: 25 Dec 2020, 18:13:30 UTC - in response to Message 100134.  

Are you saying that no one can remote into and reboot the servers? Is so, that is not good. There is a _lot_ of computing and good will lost if there are millions of jobs available and no one can download them.
ID: 100138 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,518,559
RAC: 10,612
Message 100139 - Posted: 25 Dec 2020, 19:29:50 UTC - in response to Message 100138.  

Are you saying that no one can remote into and reboot the servers? If so, that is not good.
There is a _lot_ of computing and good will lost if there are millions of jobs available and no one can download them.

In the past, some holidays have been out of all and any contact.
That may not be the case this time, but it's a possibility.
Point being, it'll be fixed when it's fixed and complaining repeatedly in the forums has rarely ever solved it
ID: 100139 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 75 · 76 · 77 · 78 · 79 · 80 · 81 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org