Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 22 · 23 · 24 · 25 · 26 · 27 · 28 · Next

AuthorMessage
Jim1348

Send message
Joined: 19 Jan 06
Posts: 302
Credit: 9,695,446
RAC: 19,242
Message 90007 - Posted: 16 Dec 2018, 19:50:52 UTC - in response to Message 90006.  

According to Rosetta I currently have a total of 1709 tasks in progress. For example host 1770544 it is not running any Rosetta tasks but yet the In progress count is 216.

This is interesting. I have 8 in progress, but Rosetta "In progress" shows 11.
https://boinc.bakerlab.org/rosetta/results.php?hostid=3510039&offset=0&show_names=0&state=1&appid=

It is the oldest three that are missing. That isn't a big difference, so I thought I would take a look in the BOINC log. I see the following curious entry for the oldest one (but it is the only one I see):

43	Rosetta@home	12/14/2018 3:52:05 PM	[error] Can't parse file info in scheduler reply: file name is empty or has '..'	
44	Rosetta@home	12/14/2018 3:52:05 PM	[error] Can't parse file info in scheduler reply: file name is empty or has '..'	
46	Rosetta@home	12/14/2018 3:52:05 PM	[error] State file error: missing file r1_r1_ems_3hC_984_0002_000000007_0001_0001_0001_23_41_H_.._EHEE_10482_0001_0001_0001_0001_15_38_H_.._DHR70_DHR15_l2_t3_t2_D20_D25_ct21_nTerm_3x_r8_0001_0001_0001_0001_0002_0001_0001_0001_0001_fragments_data.zip	
47	Rosetta@home	12/14/2018 3:52:05 PM	[error] State file error: missing input file r1_r1_ems_3hC_984_0002_000000007_0001_0001_0001_23_41_H_.._EHEE_10482_0001_0001_0001_0001_15_38_H_.._DHR70_DHR15_l2_t3_t2_D20_D25_ct21_nTerm_3x_r8_0001_0001_0001_0001_0002_0001_0001_0001_0001_fragments_data.zip	
48	Rosetta@home	12/14/2018 3:52:05 PM	[error] Can't handle task r1_r1_ems_3hC_984_0002_000000007_0001_0001_0001_23_41_H_.._EHEE_10482_0001_0001_0001_0001_15_38_H_.._DHR70_DHR15_l2_t3_t2_D20_D25_ct21_nTerm_3x_r8_0001_0001_0001_0001_0002_0001_0001_0001_0001_fragment_706193_213 in scheduler repl	
49	Rosetta@home	12/14/2018 3:52:05 PM	[error] State file error: missing task r1_r1_ems_3hC_984_0002_000000007_0001_0001_0001_23_41_H_.._EHEE_10482_0001_0001_0001_0001_15_38_H_.._DHR70_DHR15_l2_t3_t2_D20_D25_ct21_nTerm_3x_r8_0001_0001_0001_0001_0002_0001_0001_0001_0001_fragment_706193_213	
50	Rosetta@home	12/14/2018 3:52:05 PM	[error] Can't handle task r1_r1_ems_3hC_984_0002_000000007_0001_0001_0001_23_41_H_.._EHEE_10482_0001_0001_0001_0001_15_38_H_.._DHR70_DHR15_l2_t3_t2_D20_D25_ct21_nTerm_3x_r8_0001_0001_0001_0001_0002_0001_0001_0001_0001_fragment_706193_213_1 in scheduler re	


Maybe someone can figure it out.
ID: 90007 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Killersocke@rosetta

Send message
Joined: 13 Nov 06
Posts: 26
Credit: 1,322,550
RAC: 2,845
Message 90008 - Posted: 16 Dec 2018, 22:44:45 UTC

I'm scared
I see 27 tasks with Status given up
They are all from December 14th
ID: 90008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Killersocke@rosetta

Send message
Joined: 13 Nov 06
Posts: 26
Credit: 1,322,550
RAC: 2,845
Message 90009 - Posted: 17 Dec 2018, 0:01:36 UTC

Sorry Guys
these are my time, my money and my costs
So i will stop Rosetta now
ID: 90009 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 302
Credit: 9,695,446
RAC: 19,242
Message 90010 - Posted: 17 Dec 2018, 2:01:19 UTC - in response to Message 90009.  

I don't see a problem with your completion rate. Everything looks pretty good.
You may just see a status problem.
ID: 90010 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Killersocke@rosetta

Send message
Joined: 13 Nov 06
Posts: 26
Credit: 1,322,550
RAC: 2,845
Message 90011 - Posted: 17 Dec 2018, 8:20:28 UTC - in response to Message 90010.  

I don't see a problem with your completion rate. Everything looks pretty good.
You may just see a status problem.


Sorry but this not my Problem
ID: 90011 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 980
Credit: 22,034,754
RAC: 14,250
Message 90016 - Posted: 17 Dec 2018, 16:20:32 UTC - in response to Message 90009.  

Sorry Guys
these are my time, my money and my costs
So i will stop Rosetta now

I've got a similar problem - just posted somewhere else.
Having evaluated what's happened, no time was involved, no download took place and no costs were incurred.
Maybe 7 seconds of processing time were affected per download - once every few hours - but I'm not sure it was in place of anything else.
The only problem for users seems to be a mismatch between the online list of your tasks and what shows in your offline task list.
I suspect you wasted more energy clicking reply, typing 17 words and clicking Post reply.
ID: 90016 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jjch

Send message
Joined: 10 Nov 13
Posts: 12
Credit: 367,655,442
RAC: 282,542
Message 90025 - Posted: 18 Dec 2018, 19:10:02 UTC
Last modified: 18 Dec 2018, 19:11:04 UTC

From what I can tell these work units were cancelled but the status remained In progress.

If you check the Workunit under errors you will see WU cancelled.

For example: https://boinc.bakerlab.org/workunit.php?wuid=942284714

I don't think there is anything major to worry about just an annoyance. It's not likely you lost any compute cycles either.

The Rosetta programming team should clean this up if possible however I think they will disappear after the deadline expires.

For now I have stopped all Rosetta computing until after Dec 23rd to see if this is true. FYI, I am giving WCG cycles in the meantime.
ID: 90025 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
fcbrants
Avatar

Send message
Joined: 25 Mar 13
Posts: 13
Credit: 3,933,177
RAC: 0
Message 90035 - Posted: 19 Dec 2018, 20:48:49 UTC - in response to Message 90009.  
Last modified: 19 Dec 2018, 20:55:39 UTC

I saw a similar group of tasks that timed out. The system is robust & performed as expected. After the tasks timed out, they were re-assigned.

My thoughts:

1. Don't worry about micro-managing Rosetta - there WILL be errors every now & again.
2. Keep an eye on the prize: understanding how proteins work, how to synthesize them to help cure Cancer, AIDS, etc.

Cheers,

Franko

Sorry Guys
these are my time, my money and my costs
So i will stop Rosetta now

ID: 90035 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile shanen
Avatar

Send message
Joined: 16 Apr 14
Posts: 187
Credit: 11,462,427
RAC: 6,572
Message 90037 - Posted: 19 Dec 2018, 23:15:17 UTC

Since this is the preeminent and locked-at-the-top thread and it has such a broad Subject, I was hoping to see something about the current lack of tasks... Server statuses appear to be nominal.

However I'll mention excessive memory use as an annoying problem on one of my machines with a relatively small SSD. However mostly I blame that on Microsoft for another horrendous update.
#1 Freedom = (Meaningful - Constrained) Choice{5} != (Beer^3 | Speech)
ID: 90037 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 980
Credit: 22,034,754
RAC: 14,250
Message 90038 - Posted: 20 Dec 2018, 0:36:19 UTC - in response to Message 90037.  

Since this is the preeminent and locked-at-the-top thread and it has such a broad Subject, I was hoping to see something about the current lack of tasks... Server statuses appear to be nominal.

I've just arrived at one of my machines to find no Rosetta tasks available at all and my backup project running.

There have been very few Rosetta 4.07 tasks recently and no Android tasks for months.

Now Mini-Rosetta 3.83 tasks have dropped to zero available and with the Christmas break almost upon us it's not looking good for the holiday period.

This is an urgent issue I hope there's time to address
ID: 90038 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
porbund

Send message
Joined: 27 Mar 15
Posts: 2
Credit: 3,836,661
RAC: 24,978
Message 90047 - Posted: 21 Dec 2018, 12:27:51 UTC - in response to Message 90038.  

Same here. I have multiple of my computers that have been crunching nonstop for weeks, and they are suddenly not getting new tasks from rosetta@home.
ID: 90047 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
porbund

Send message
Joined: 27 Mar 15
Posts: 2
Credit: 3,836,661
RAC: 24,978
Message 90048 - Posted: 21 Dec 2018, 12:31:51 UTC - in response to Message 90038.  

Actually, taking a look at this page:

https://boinc.bakerlab.org/rosetta/server_status.php

It looks like only ~8000 tasks are ready to send (vs 389000 in progress). They may be legitimately out of tasks for the moment...
ID: 90048 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Spectre1966

Send message
Joined: 23 Nov 18
Posts: 1
Credit: 4,148,528
RAC: 5,303
Message 90050 - Posted: 21 Dec 2018, 13:01:27 UTC

Guess I’ll search for aliens until Rosetta has some work to crunch....
ID: 90050 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 5,420,944
RAC: 2,561
Message 90055 - Posted: 22 Dec 2018, 13:00:23 UTC

Almost all downloads are in error status.
Have reset project and only 1 of 9 downloads worked.
Willing to reset again if that will help. should I detach erase Boinc and start over or is there some problem with downloads?
ID: 90055 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John Newbould

Send message
Joined: 8 Aug 07
Posts: 8
Credit: 5,420,944
RAC: 2,561
Message 90056 - Posted: 22 Dec 2018, 13:08:57 UTC - in response to Message 90055.  

OK after several try's its OK now.
ID: 90056 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jjch

Send message
Joined: 10 Nov 13
Posts: 12
Credit: 367,655,442
RAC: 282,542
Message 90066 - Posted: 23 Dec 2018, 18:49:32 UTC

The tasks that I had piling up with the In progress status are now cleared. They have gone to the error status list with the Timed out - no response status.
See here: https://boinc.bakerlab.org/rosetta/results.php?userid=486414

A sample work unit show Too many total results WU cancelled https://boinc.bakerlab.org/workunit.php?wuid=942284714
Looks like something was crossed up with these tasks on the processing side but I didn't see any loss of compute cycles.

I think the rewards are better with Rosetta so I may go back to that project after I let the WCG clear out.
ID: 90066 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
amgthis

Send message
Joined: 25 Mar 06
Posts: 66
Credit: 175,651,807
RAC: 108,063
Message 90072 - Posted: 24 Dec 2018, 16:24:58 UTC

No work is being sent out apparently.
-----------------------------------------------------
Mon 24 Dec 2018 07:31:30 AM PST | Rosetta@home | Requesting new tasks for CPU
Mon 24 Dec 2018 07:31:38 AM PST | Rosetta@home | Scheduler request completed: got 0 new tasks
Mon 24 Dec 2018 07:31:38 AM PST | Rosetta@home | No tasks sent
-------------------------------------------------------

same with a few boxes here. I remember back in the old days when you could
pretty much count on the project running out of work over Christmas and New Year
holidays. It's been awhile though, with the newer hardware upgrades this has not
been the case for a few years.

Happy Holidays everyone.

/Mike
ID: 90072 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,652,148
RAC: 888
Message 90180 - Posted: 10 Jan 2019, 20:41:37 UTC

Hey Mod,

Ask the guys at the lab why some of us are getting no work at all!
I am almost down to 0 credits with this project because it does not send me any work.
My cache fills up with the other projects in the meantime and when Rosie's turn does come she does not send out any tasks.

What's going on?
ID: 90180 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,652,148
RAC: 888
Message 90182 - Posted: 10 Jan 2019, 22:26:19 UTC - in response to Message 90180.  
Last modified: 10 Jan 2019, 22:27:40 UTC

Hey Mod,

Ask the guys at the lab why some of us are getting no work at all!
I am almost down to 0 credits with this project because it does not send me any work.
My cache fills up with the other projects in the meantime and when Rosie's turn does come she does not send out any tasks.

What's going on?



1/10/2019 11:24:51 PM | Rosetta@home | update requested by user
1/10/2019 11:24:54 PM | Rosetta@home | Sending scheduler request: Requested by user.
1/10/2019 11:24:54 PM | Rosetta@home | Requesting new tasks for CPU
1/10/2019 11:24:56 PM | Rosetta@home | Scheduler request completed: got 0 new tasks
1/10/2019 11:24:56 PM | Rosetta@home | No tasks sent

I guess when credit reaches 0 I will give up my first ever BOINC project because no one seems to know how or be willing to fix this problem.
ID: 90182 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 302
Credit: 9,695,446
RAC: 19,242
Message 90185 - Posted: 11 Jan 2019, 1:20:49 UTC - in response to Message 90182.  

If you should trouble yourself to read the forum, you find out the reason.

It has something to do with last December 25, I believe.
ID: 90185 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 22 · 23 · 24 · 25 · 26 · 27 · 28 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2019 University of Washington
http://www.bakerlab.org