Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 91 · 92 · 93 · 94 · 95 · 96 · 97 . . . 309 · Next

AuthorMessage
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 101093 - Posted: 6 Apr 2021, 7:50:02 UTC - in response to Message 101090.  

No wonder that last batch of new work went so quickly.

Wus with bugs, empty queues, lack of communications.
This is why i reduce my cpu time in this project...
ID: 101093 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 101094 - Posted: 6 Apr 2021, 11:44:59 UTC - in response to Message 101093.  

This is why i reduce my cpu time in this project...
I used to run Rosetta by itself. Now I run SiDock too (both at 100%) and don't have to worry about switching anything.
Of course, Rosetta then gets less total CPU time, but it appears that they don't need it/can't use it anyway.
ID: 101094 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 101096 - Posted: 6 Apr 2021, 12:01:23 UTC - in response to Message 101094.  

Of course, Rosetta then gets less total CPU time, but it appears that they don't need it/can't use it anyway.

It doesn't take much to keep the volunteers...
ID: 101096 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tom

Send message
Joined: 17 May 20
Posts: 2
Credit: 82,463
RAC: 0
Message 101097 - Posted: 6 Apr 2021, 13:11:21 UTC

I must agree with Brian, pretty disappointing that I have yet to see a project admin come in even acknowledging there is an issue. I'll give it to the end of the week. If it doesn't appear anyone is working on the problem I'll most likely drop Rosetta and look for a different project to donate processing to.
ID: 101097 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Greger

Send message
Joined: 3 May 14
Posts: 10
Credit: 116,447,255
RAC: 27,086
Message 101100 - Posted: 6 Apr 2021, 18:11:25 UTC

<message>
process exited with code 1 (0x1, -255)</message>
ERROR: ERROR: FragmentIO: could not open file 00001.500.6mers
ERROR:: Exit from: src/core/fragment/FragmentIO.cc line: 233
BOINC:: Error reading and gzipping output datafile: default.out
ID: 101100 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 10,612
Message 101101 - Posted: 6 Apr 2021, 20:13:08 UTC - in response to Message 101076.  

Returning to my anecdote about a remote PC I have being unable to download any Rosetta tasks, so running its backup project, WCG, 24/7, my local laptop is also doing weird things. It refuses to run a particular Rosetta task, so it's running those it has room for - a combination of WCG and later Rosetta tasks, but only 3 on 4 cores. Now I know it's definitely happening, I've set NNT and suspended all running tasks except for the one problem Rosetta task. It still refuses to run, even as the only task. No tasks are running in my experiment!

So, maintaining NNT, I've found some combination of WCG and Rosetta tasks that'll run together on all 4 cores. I'll work my way through my small cache until all are completed bar the problem task and see if it runs then. If not, I'll finally abort it and just grab fresh tasks.

Bit of a weird one. Even attempting to micromanage tasks doesn't entirely work. No wonder that graph is running so much lower than it was, if I'm any example

Finally got to the end of this.
Last night I had 3 WCG tasks running (2 of which were Africa Rainfall project that use slightly more RAM, but in fact were only using 300Mb each) and my one weird Rosetta nip* task reporting "waiting for memory" on my 4-core laptop.
Looking at my Event log, it was only when the last ARP was wrapping up that sufficient RAM was available for the Rosetta task to begin running. The last ARP task completed 3 minutes later and now the Rosetta task is the only task running.
Looking at the task's properties, it's only using between 271Mb & 292Mb RAM, while earlier complaining that it needed something like 6.6Gb RAM to begin.
I'm going to wait for completion before dragging any more tasks down. Hopefully there are some new tasks available to download at that time.

The task in question is this one

With 30 minutes to go, I've allowed new tasks and 11have come down. Stage one successful.
3 of the new tasks attempt to start. Stage two successful.
2 of the new tasks are waiting for memory... Oh

I'm going out for a while. When I return the older task will have completed and I'll see if the new tasks all run ok.
The journey continues...

And the answer is... no.
Still two new Rosetta tasks running and two more new ones waiting for memory...

It seems we're a way from having a solution or correction

The saga continues after a few tasks completed.
The same two tasks refuse to run and other tasks run in their place - 2 out of 4 cores.
I suspended all tasks except the ones waiting for memory and still they refused to run [Leave non-GPU tasks in memory while suspended - checked]

I decide to abort the 2 tasks. Two new tasks run in their place and all 4 cores now in use.
That took more messing around than I expected. Let's see how it goes from now on
ID: 101101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 10,612
Message 101102 - Posted: 6 Apr 2021, 20:44:38 UTC

Seeing as it seems to be whining season for the excessively-entitled it might be worth making myself as popular as usual by stating the obvious that I can't be the only one aware of.

1) Looking at my main PC, I've downloaded 54 tasks today across 10 separate downloads. Even my weirdly-running laptop managed 6 more. And my Android 4 more.
For all the comments about it (none) it seems like I'm the only one.
Of course, I'm not.

2) There are people here from all over the world, and a similar range of nationalities and creeds at UW from what I've observed, so some people may not be aware that UW is in the USA, which is a largely Christian country.
Traditionally their institutions and work-places close for the Easter holidays which have been taking place over the last4 days, from "Good Friday" to "Easter Monday" even if some of the people working there don't personally celebrate.

But, of course, during this holiday period people here expect to demand the creation and issue of sufficient work to serve at least a third of a million tasks per day to the world at large, with no respite.
And the amazing thing is, a fair few do seem to have come down. Maybe my caches aren't quite completely full, but near enough.

And as thanks, I see the usual levels of appreciation here, to wit, "if you don't supply what I need 24hrs a day, even during the holiday season, and provide chapter and verse on progress to tell us what we already know, loyalty will be shown by reducing contribution levels or leaving altogether"

Maybe you should have the occasional day off from your disgusting levels or personal entitlement too.
Though tbf, there's never any shortage of that.

Yeah, save it. I heard last time too. And the time before that. And the time before that etc
ID: 101102 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 101103 - Posted: 6 Apr 2021, 21:53:59 UTC
Last modified: 6 Apr 2021, 22:18:15 UTC

We had an interruption in power here last night (it happens), and when I got up this AM I found one of my hosts was completely dead. So, I restarted it and when I got into BOINC, I am finding that every task that I download aborts for "Computational Error" within about 20 seconds.

While this was going on, I tried to add SiDock as another project, with the intent to diversify as some have suggested (currently running Rosetta only). That produced an error that said that I could not join because I am required to agree to the terms of service. Which I did. I tried it a couple more times to confirm it.

Now I'm wondering if the power failure messed up something in BOINC, or if the problems are coincidental and not related to me.

Any suggestions?

{EDIT} I fixed the problems with joining SiDock. But all of the tasks from that project error out just like the ones for Rosetta. So I've suspended both projects until I can figure out a solution.
ID: 101103 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 101104 - Posted: 6 Apr 2021, 22:04:55 UTC - in response to Message 101102.  
Last modified: 6 Apr 2021, 22:06:45 UTC

Maybe you should have the occasional day off from your disgusting levels or personal entitlement too.


You said it yourself: there's a lot of diversity here. Some people are easygoing; others are more anal for whatever reason.

Me, I have obsessive-compulsive tendencies, so it rubs me the wrong way when I can't get the idealized version of Rosetta participation that I envision. Some people might find that disgusting. I know that others dislike the use of "@" and the slag use of "dude." People have bad days sometimes, and it's easy to take it out on an online project that seems to be more machine than human.

Speaking of Easter and bad days, just for kicks, I bought some cool egg decorating kits, which contained markers, stickers, stencils, etc. I was trying to do an elaborate floral pattern on my eggs, to give it some "wow" cred, but the stencil kept slipping. I became frustrated, and then distraught.

More distraught than I am now, with one of my hosts not producing anything. More distraught than I had any right to be. But no matter how carefully I worked at it, I couldn't get the stencil to stay in the right place.

I guess that you could say I was having an eggs 'n stencil crisis.
ID: 101104 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 2,018
Message 101105 - Posted: 6 Apr 2021, 22:11:45 UTC - in response to Message 101103.  

I received 5 Rosetta work units on my laptop today. They all errored out.

The desktop also received 8 work units, 6 of which errored out.
ID: 101105 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 398
Credit: 12,294,748
RAC: 6,222
Message 101106 - Posted: 6 Apr 2021, 22:17:47 UTC - in response to Message 101103.  

We had an interruption in power here last night (it happens), and when I got up this AM I found one of my hosts was completely dead. So, I restarted it and when I got into BOINC, I am finding that every task that I download aborts for "Computational Error" within about 20 seconds.

While this was going on, I tried to add SiDock as another project, with the intent to diversify as some have suggested (currently running Rosetta only). That produced an error that said that I could not join because I am required to agree to the terms of service. Which I did. I tried it a couple more times to confirm it.

Now I'm wondering if the power failure messed up something in BOINC, or if the problems are coincidental and not related to me.

Any suggestions?


At the moment every Rosetta task I get is bombing Computation Error within 25 seconds of starting.
ID: 101106 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 101107 - Posted: 6 Apr 2021, 22:18:06 UTC - in response to Message 101103.  
Last modified: 6 Apr 2021, 23:10:02 UTC

Ignore the …_abinitio_1_abinitio_… tasks; lots of people are reporting problems with those.

For the others, this is a clue that all is not well:
couldn't start app: Input file rosetta_4.20_x86_64-pc-linux-gnu missing or invalid: file missing
Try Reset project under Project commands (Simple view) or on the Projects tab (Advanced view).

Not sure about the SiDock issue; try the forums over there.
{EDIT, though we’re way off topic here} You’re missing some prerequisite system libraries (glibc-⁠2.27) which are newer than your version of Linux. It might be possible to install those alongside your system glibc, or it might be easier to update the whole OS. SiDock uses much more modern software than Rosetta; I have the same problem that I can’t run it on the ancient version of Windows on my crunchers…
ID: 101107 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrhastyrib

Send message
Joined: 18 Feb 21
Posts: 90
Credit: 2,541,890
RAC: 0
Message 101108 - Posted: 6 Apr 2021, 23:39:21 UTC - in response to Message 101107.  

You’re missing some prerequisite system libraries (glibc-⁠2.27)


Woah, dude, you can tell that even though my system was turned off?

Did you find the horse porn folder too? That was from...uh, my kid brother.
ID: 101108 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,338,560
RAC: 2,014
Message 101109 - Posted: 7 Apr 2021, 0:59:18 UTC - in response to Message 101106.  

We had an interruption in power here last night (it happens), and when I got up this AM I found one of my hosts was completely dead. So, I restarted it and when I got into BOINC, I am finding that every task that I download aborts for "Computational Error" within about 20 seconds.

While this was going on, I tried to add SiDock as another project, with the intent to diversify as some have suggested (currently running Rosetta only). That produced an error that said that I could not join because I am required to agree to the terms of service. Which I did. I tried it a couple more times to confirm it.

Now I'm wondering if the power failure messed up something in BOINC, or if the problems are coincidental and not related to me.

Any suggestions?


At the moment every Rosetta task I get is bombing Computation Error within 25 seconds of starting.

The last dozen I received also failed within 25 seconds. All of them had log files mentioning a missing file with 6mers in its name.

It looks like the project staff contains some students who still need lessons on how to read the task log files and correct any errors they find.
ID: 101109 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
DizzyD

Send message
Joined: 23 Nov 20
Posts: 6
Credit: 1,438,330
RAC: 0
Message 101111 - Posted: 7 Apr 2021, 5:57:09 UTC - in response to Message 101102.  



But, of course, during this holiday period people here expect to demand the creation and issue of sufficient work to serve at least a third of a million tasks per day to the world at large, with no respite.
And the amazing thing is, a fair few do seem to have come down. Maybe my caches aren't quite completely full, but near enough.



Maybe we are all just waiting for the resurrection of this project? Isn't that what Easter is all about?
ID: 101111 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 101113 - Posted: 7 Apr 2021, 8:38:02 UTC - in response to Message 101109.  
Last modified: 7 Apr 2021, 8:47:08 UTC

The last dozen I received also failed within 25 seconds. All of them had log files mentioning a missing file with 6mers in its name.

It looks like the project staff contains some students who still need lessons on how to read the task log files and correct any errors they find.
And only being able to get 2 new Tasks at a time it's going to take a long time for all of these duds to be cleared out.
My maximum Tasks per day & the number of errors i'm returning are rapidly approaching each other.


At this rate no one will be able to get more than a couple of Tasks (when some that can be processed without error are eventually released) because they will have used up all of their Maximum Tasks per day credit.
It'll take 8 hours to return a valid result (with luck) & then bump up the limit to a couple of Tasks, Another 8 hours to return some more Valid results, and bump up the limit by a few more Tasks.
It's leading to an extremely ugly mess.
Grant
Darwin NT
ID: 101113 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 101114 - Posted: 7 Apr 2021, 9:07:04 UTC - in response to Message 101107.  

Ignore the …_abinitio_1_abinitio_… tasks; lots of people are reporting problems with those.
Of the ones i have had, i have 6 running, and 475 - so far - that errored out in seconds.
Grant
Darwin NT
ID: 101114 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 101116 - Posted: 7 Apr 2021, 10:16:59 UTC - in response to Message 101114.  

Ignore the …_abinitio_1_abinitio_… tasks; lots of people are reporting problems with those.
Of the ones i have had, i have 6 running, and 475 - so far - that errored out in seconds.


So errors are in:
- _abinitio_1_abinitio_
- 6mers

Practicaly, all wus!!
ID: 101116 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 10,612
Message 101121 - Posted: 7 Apr 2021, 12:46:44 UTC - in response to Message 101111.  

But, of course, during this holiday period people here expect to demand the creation and issue of sufficient work to serve at least a third of a million tasks per day to the world at large, with no respite.
And the amazing thing is, a fair few do seem to have come down. Maybe my caches aren't quite completely full, but near enough.

Maybe we are all just waiting for the resurrection of this project? Isn't that what Easter is all about?

Everything running great last night. Got up to see 3 remaining running tasks and the entire rest of my cache with the Computation errors we've all now seen.
So much for rushing through fresh tasks. Slower and more sure may be the better policy.

Took WCG off NNT. 217 tasks came down...
That should certainly provide some breathing space
ID: 101121 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101122 - Posted: 7 Apr 2021, 16:53:50 UTC - in response to Message 101088.  

Unless you have set it to use local settings, it will use whatever you have set in your account's Computing preferences section.
What causes it to switch to local settings?
ID: 101122 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 91 · 92 · 93 · 94 · 95 · 96 · 97 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org