Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 28 · Next

AuthorMessage
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,958,237
RAC: 710
Message 80668 - Posted: 22 Sep 2016, 19:46:39 UTC
Last modified: 22 Sep 2016, 19:47:28 UTC

We are still working on reducing a backlog of jobs so you should still be getting work. There are 270,000+ jobs and many many researchers waiting to submit more jobs when this backlog subsides which should be any day now. So there are plenty of jobs. Not sure what is going on with the mac clients though. My mac seems to be fine. When I get a chance, I'll look into this in more detail. I sent a note to the researcher with the large memory jobs (haven't heard back since he's on vacation in France I think).
ID: 80668 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admpicard999

Send message
Joined: 12 Oct 05
Posts: 1
Credit: 16,117
RAC: 0
Message 80671 - Posted: 24 Sep 2016, 3:15:16 UTC

Work unit generator for Android appears to be down despite it being up for other platforms. Any chance of a resolution soon?
ID: 80671 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,958,237
RAC: 710
Message 80673 - Posted: 24 Sep 2016, 18:14:16 UTC

We don't have any android jobs lined up as of yet. I'll see if Vikram has some more peptide design jobs to run. We are limiting the android jobs to smaller proteins with low memory requirements. I'll let you know as soon as I do. thanks.
ID: 80673 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 930
Credit: 3,552,372
RAC: 3,389
Message 80681 - Posted: 27 Sep 2016, 21:24:15 UTC
Last modified: 27 Sep 2016, 21:25:27 UTC

Again, another down of servers.
Are these downs scheduled (as, for example, Seti) or are the usual problems??
ID: 80681 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile JonPer

Send message
Joined: 4 May 06
Posts: 14
Credit: 505,062
RAC: 188
Message 80682 - Posted: 27 Sep 2016, 22:25:58 UTC - in response to Message 80657.  

My mac is running well. Are others having mac client issues?


My iMac hasn\'t received any work for several days. I\'ve tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


All of my other cross projects are working just fine, epecially Poem... But i just cant get Rosetta to let me download work packages to my MBP - to saad...
ID: 80682 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Killersocke@rosetta

Send message
Joined: 13 Nov 06
Posts: 26
Credit: 1,251,405
RAC: 1,956
Message 80683 - Posted: 28 Sep 2016, 10:13:43 UTC

Again killed Rosetta my PC :-(

Task ID 878041758
Name SS_Mk1_0005_fold_SAVE_ALL_OUT_440853_1113_1
Received 28 Sep 2016 8:55:09 UTC
ERROR: unrecognized residue HCY
ERROR:: Exit from: ..\..\..\src\core\io\pose_from_sfr\PoseFromSFRBuilder.cc line: 1030
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

Task ID 878045826
Name rb_09_24_68009_112766__t000__2_C1_SAVE_ALL_OUT_IGNORE_THE_REST_440797_28_0
Received 28 Sep 2016 9:13:30 UTC
ERROR: Unable to open atomset parameter file: minirosetta_database\chemical/atom_type_sets/fa_standard//
</stderr_txt>

Think i will stop and leave this Project now.
ID: 80683 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 979
Credit: 21,679,601
RAC: 13,648
Message 80684 - Posted: 28 Sep 2016, 11:43:24 UTC - in response to Message 80683.  

Again killed Rosetta my PC :-(

Think i will stop and leave this Project now.

Because of 20 cumulative seconds on 2 out of 8 cores?

Tough crowd...
ID: 80684 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,629,339
RAC: 1,187
Message 80685 - Posted: 28 Sep 2016, 12:50:01 UTC - in response to Message 80684.  

Again killed Rosetta my PC :-(

Think i will stop and leave this Project now.

Because of 20 cumulative seconds on 2 out of 8 cores?

Tough crowd...


In the time I have been on Rosie I have seen server failures, bad work units. Files that are not formatted correctly and everything else under the moon. I don't quit. Rosie gets what she gives. If there is a lot of work, well a lot of work is returned. If not, oh well, other projects take up the slack until Rosie's handlers get the problems solved. 20 seconds is a microsecond of time compared to all the other stuff I have seen. If you quit you quit, but it is really a lame excuse that you lost a little bit of time due to some file compatibility problems.

ID: 80685 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,958,237
RAC: 710
Message 80686 - Posted: 28 Sep 2016, 17:33:39 UTC - in response to Message 80682.  

My mac is running well. Are others having mac client issues?


My iMac hasn't received any work for several days. I've tried manual Updates and Reset Project from the BOINC Manager, but still get no new work.


All of my other cross projects are working just fine, epecially Poem... But i just cant get Rosetta to let me download work packages to my MBP - to saad...



I'm wondering if there's an issue with new OSX versions. Who else is having issues and can you provide as much info as possible to help us solve this? OSX version, any log info, etc....
ID: 80686 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,958,237
RAC: 710
Message 80687 - Posted: 28 Sep 2016, 17:38:00 UTC - in response to Message 80683.  

Again killed Rosetta my PC :-(

Task ID 878041758
Name SS_Mk1_0005_fold_SAVE_ALL_OUT_440853_1113_1
Received 28 Sep 2016 8:55:09 UTC
ERROR: unrecognized residue HCY
ERROR:: Exit from: ......srccoreiopose_from_sfrPoseFromSFRBuilder.cc line: 1030
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

Task ID 878045826
Name rb_09_24_68009_112766__t000__2_C1_SAVE_ALL_OUT_IGNORE_THE_REST_440797_28_0
Received 28 Sep 2016 9:13:30 UTC
ERROR: Unable to open atomset parameter file: minirosetta_databasechemical/atom_type_sets/fa_standard//


Think i will stop and leave this Project now.



The first task we just recognized as a bad job (all will fail and we canceled it). Sorry about that. The researcher is looking into this. He though it was just a standard forward folding type job but obviously there's a special residue type that is not being recognized. They are supposed to test stuff locally and for the most part do, but this was thought of as a standard job mistakingly.

The second task is a Robetta job which is definitely a standard job. This looks like there was an issue extracting the database or reading a standard database file. Can it have run out of disk space? Not sure what caused this error. If it continues, please let us know.
ID: 80687 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dr. Merkwürdigliebe
Avatar

Send message
Joined: 5 Dec 10
Posts: 81
Credit: 2,657,273
RAC: 0
Message 80693 - Posted: 29 Sep 2016, 18:13:34 UTC

More validate errors from the rb_*_jobs

https://boinc.bakerlab.org/rosetta/result.php?resultid=878146671

https://boinc.bakerlab.org/rosetta/result.php?resultid=878146673

https://boinc.bakerlab.org/rosetta/result.php?resultid=878146677
ID: 80693 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Darrell

Send message
Joined: 28 Sep 06
Posts: 25
Credit: 44,240,683
RAC: 41,164
Message 80696 - Posted: 2 Oct 2016, 0:42:30 UTC

I am also seeing invalids on 5 of my computers:

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793157085 Octopus
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793156528 BigMax
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793109268 Cruncher
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793109067
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793109039 DDW3770k
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793108991
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793108774 WinPro7
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=793107947

plus 10 more all finished on October 1. I didn\'t look further back
than this.



ID: 80696 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dr. Merkwürdigliebe
Avatar

Send message
Joined: 5 Dec 10
Posts: 81
Credit: 2,657,273
RAC: 0
Message 80697 - Posted: 2 Oct 2016, 11:46:11 UTC

Compute error on some rb_10_* job

https://boinc.bakerlab.org/rosetta/result.php?resultid=878604899
ID: 80697 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Extra Ball

Send message
Joined: 26 Sep 07
Posts: 1
Credit: 251,134
RAC: 0
Message 80698 - Posted: 2 Oct 2016, 18:48:39 UTC

Dunno if this can help with the rb_* problematic tasks but just got this message from the BoincManager :

02/10/2016 20:41:43 | rosetta@home | Task rb_10_01_69170_112916_ab_stage0_t000___robetta_IGNORE_THE_REST_05_10_441827_9_0 exited with zero status but no 'finished' file

Note that this task didn't finish :) and is still in progress atm (26%)
ID: 80698 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Voivodul

Send message
Joined: 9 Sep 06
Posts: 2
Credit: 763,367
RAC: 1,427
Message 80699 - Posted: 3 Oct 2016, 11:57:17 UTC

Hi, had lot of validate errors from the rb_*_jobs

http://boinc.bakerlab.org/rosetta/result.php?resultid=878191424
http://boinc.bakerlab.org/rosetta/result.php?resultid=878191312
http://boinc.bakerlab.org/rosetta/result.php?resultid=878173536
http://boinc.bakerlab.org/rosetta/result.php?resultid=878173302
http://boinc.bakerlab.org/rosetta/result.php?resultid=878171363
http://boinc.bakerlab.org/rosetta/result.php?resultid=878161616
http://boinc.bakerlab.org/rosetta/result.php?resultid=878158332
http://boinc.bakerlab.org/rosetta/result.php?resultid=878158417
http://boinc.bakerlab.org/rosetta/result.php?resultid=878157450
http://boinc.bakerlab.org/rosetta/result.php?resultid=878157270
http://boinc.bakerlab.org/rosetta/result.php?resultid=878157430
http://boinc.bakerlab.org/rosetta/result.php?resultid=878224361
http://boinc.bakerlab.org/rosetta/result.php?resultid=878224361
ID: 80699 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 979
Credit: 21,679,601
RAC: 13,648
Message 80700 - Posted: 3 Oct 2016, 20:23:36 UTC - in response to Message 80699.  

Hi, had lot of validate errors from the rb_*_jobs

Same here. 14 of my last 110 completed tasks came up as a validate error.

I know these get picked up and sorted out quickly enough, but it seems like a duplication of effort somewhere down the line
ID: 80700 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,958,237
RAC: 710
Message 80702 - Posted: 4 Oct 2016, 17:58:20 UTC

These errors are related to the issues we had with the database server a few weeks back. Things should stabilize as new jobs get pushed through and the old jobs get processed. If there are still more than normal validation issues after a week from now, please let me know. Thanks!
ID: 80702 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile JonPer

Send message
Joined: 4 May 06
Posts: 14
Credit: 505,062
RAC: 188
Message 80703 - Posted: 4 Oct 2016, 19:05:19 UTC

Yet another week without being assigned work from server, last assignment from Rosetta back on the 15th of september. Seti@home, Lattice & Poem are workning around the clock - whats happening with Rosetta!?
ID: 80703 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,958,237
RAC: 710
Message 80704 - Posted: 4 Oct 2016, 19:08:48 UTC - in response to Message 80703.  

Yet another week without being assigned work from server, last assignment from Rosetta back on the 15th of september. Seti@home, Lattice & Poem are workning around the clock - whats happening with Rosetta!?


Can you check if there's any useful log information from your BOINC client. I'm not sure why you are not getting work. There's plenty of work available. Anyone else on this forum have any suggestions to help find a solution? I wonder if it's an OSX version incompatibility with our current app version.
ID: 80704 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
matteo

Send message
Joined: 20 Jul 16
Posts: 2
Credit: 248,948
RAC: 20
Message 80714 - Posted: 7 Oct 2016, 6:30:12 UTC

Hello,
i had an error while subscribing with a new computer but I got these errors...

07/10/2016 08.20.08 | rosetta@home | Started download of minirosetta_database_d0bf94b.zip
07/10/2016 08.20.31 | rosetta@home | Temporarily failed download of minirosetta_database_d0bf94b.zip: connect() failed
07/10/2016 08.20.32 | rosetta@home | Started download of minirosetta_database_d0bf94b.zip
07/10/2016 08.20.35 | | Project communication failed: attempting access to reference site
07/10/2016 08.20.36 | | Internet access OK - project servers may be temporarily down.

ID: 80714 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 28 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2019 University of Washington
http://www.bakerlab.org