Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 93 · 94 · 95 · 96 · 97 · 98 · 99 . . . 309 · Next

AuthorMessage
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 101146 - Posted: 8 Apr 2021, 1:04:35 UTC - in response to Message 101138.  

Hello. After approx. 15 years with Baker Lab, I've experienced an interesting problem, under the general category, computer errors.
Sorry to copy the entire err. report, but perhaps it will clarify.

Any ideas? The past three downloads gave, basically, the same results. Unless computing requirements have changed, recently, then I'll
have to change. Otherwise, perhaps, UW's end is with some new problems?

[snip]

This line seems to be the important one:

ERROR: ERROR: FragmentIO: could not open file 00001.500.6mers

I've seen such errors in most of my recent failed tasks from R@H, so it is probably an error in what was included in the workunit.
ID: 101146 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 101147 - Posted: 8 Apr 2021, 1:13:48 UTC - in response to Message 101144.  

[snip]
Have you considered the possibility that many of those creating work-units are not yet very good at estimating how much RAM they will need to run?

I suspect that many of them are also not yet very good at reading the task log files, recognizing the problems they show, and correcting them.

I hadn't considered it because if someone can code for the kind of work we're getting I wouldn't be so grossly insulting as to suggest they're a bit thick.
I can easily imagine either the slip of a finger or maybe some kind of test that they didn't want to be limited by RAM or disk space to have accidentally been left in.

Honestly, of all the things to suggest... have a word with yourself

Aside from that, it would be nice if we could have a few more of those tasks that it seems I was lucky to pick up. They seem fine on my main PC with plenty of RAM

Edit again: Miraculously picked up 4 tasks on my laptop immediately after posting. None again when I tried on the desktop. They're trying, but hand to mouth.

You appear to be assuming that those who write the code for the application are the same ones creating the workunits.

They have to be taught at some time. Which activity do you think they are allowed to do first? Or do you think that they can start both at the same time, with no tests of what the are doing works properly?
ID: 101147 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 101148 - Posted: 8 Apr 2021, 1:50:59 UTC - in response to Message 101138.  

Name ajzjTxIe_YBAABB_ABYBB_AAAAAAXB_AAY_CGGGGGGCCGGGGGCGGGGGGGGCGGGC_1-4_2-5_3-6.pdb_0001_abinitio_1_abinitio_SAVE_ALL_OUT_1389656_916_1

It seems to be the work units that have names that look like someone fell asleep at the keyboard
or a tin o fizzy stuff gon sticky keys that are the worst
with over 800 of the whack job units in my Error list,
that's is a lot of duff guff to clog up the data base and turn it into more of a septic tank.
ID: 101148 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,214,047
RAC: 1,450
Message 101151 - Posted: 8 Apr 2021, 2:34:43 UTC - in response to Message 101148.  

Name ajzjTxIe_YBAABB_ABYBB_AAAAAAXB_AAY_CGGGGGGCCGGGGGCGGGGGGGGCGGGC_1-4_2-5_3-6.pdb_0001_abinitio_1_abinitio_SAVE_ALL_OUT_1389656_916_1

It seems to be the work units that have names that look like someone fell asleep at the keyboard
or a tin o fizzy stuff gon sticky keys that are the worst
with over 800 of the whack job units in my Error list,
that's is a lot of duff guff to clog up the data base and turn it into more of a septic tank.


Hopefully it means something to someone somewhere.
ID: 101151 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,518,559
RAC: 10,612
Message 101153 - Posted: 8 Apr 2021, 3:02:41 UTC - in response to Message 101147.  
Last modified: 8 Apr 2021, 3:03:53 UTC

Have you considered the possibility that many of those creating work-units are not yet very good at estimating how much RAM they will need to run?

I suspect that many of them are also not yet very good at reading the task log files, recognizing the problems they show, and correcting them.

I hadn't considered it because if someone can code for the kind of work we're getting I wouldn't be so grossly insulting as to suggest they're a bit thick.
I can easily imagine either the slip of a finger or maybe some kind of test that they didn't want to be limited by RAM or disk space to have accidentally been left in.

Honestly, of all the things to suggest... have a word with yourself

Aside from that, it would be nice if we could have a few more of those tasks that it seems I was lucky to pick up. They seem fine on my main PC with plenty of RAM

Edit again: Miraculously picked up 4 tasks on my laptop immediately after posting. None again when I tried on the desktop. They're trying, but hand to mouth.

You appear to be assuming that those who write the code for the application are the same ones creating the workunits.

They have to be taught at some time. Which activity do you think they are allowed to do first? Or do you think that they can start both at the same time, with no tests of what the are doing works properly?

Christ...
ID: 101153 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 101154 - Posted: 8 Apr 2021, 7:39:04 UTC - in response to Message 101151.  
Last modified: 8 Apr 2021, 7:42:39 UTC

Hopefully it means something to someone somewhere.
Ah: divided by a common language… :-⁠)
Perfectly comprehensible this side of the pond.

tin o[f] fizzy stuff = can of soda
duff = defective
guff = nonsense

Also “septic tank” is occasionally used as rhyming slang for “Yank”, but I don’t think that’s what’s meant here.
ID: 101154 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 101156 - Posted: 8 Apr 2021, 8:09:25 UTC - in response to Message 101147.  

You appear to be assuming that those who write the code for the application are the same ones creating the workunits.

I hope they work on the same office/building.....or that they keep in touch constantly!

Or do you think that they can start both at the same time, with no tests of what the are doing works properly?

Ralph@home exists for that.
ID: 101156 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 101158 - Posted: 8 Apr 2021, 8:24:58 UTC - in response to Message 101145.  

will just run SiDock@home, for awhile. Natalia has a cheerful, and informative approach to running things.
Yeah – it’s a new project: they’re still keen. I dare say it was like that here in the early days, too? And that 15 years from now users will be moaning that the SiDock admins don’t talk to them any more…
ID: 101158 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 101159 - Posted: 8 Apr 2021, 8:33:54 UTC - in response to Message 101158.  

And that 15 years from now users will be moaning that the SiDock admins don’t talk to them any more…

There are "old" projects that have active admins on forums.
ID: 101159 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dougga

Send message
Joined: 27 Nov 06
Posts: 28
Credit: 5,248,050
RAC: 0
Message 101165 - Posted: 8 Apr 2021, 18:06:28 UTC - in response to Message 80621.  

My linux client appears to be stuck with issues with messaging from the server.
I removed Rosetta and added it back.
I'm not getting any work untis.
It says ...

Requesting new tasks from CPU
Scheduler requests complete: got 0 tasks
No tasks sent
Project requested delay of 31s

After the 31 seconds, no further requests are made
Manual requests reruns the loop

I've uninstalled the program, rebooted etc...
ID: 101165 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 101167 - Posted: 8 Apr 2021, 18:17:56 UTC - in response to Message 101165.  

I'm not getting any work untis.
Neither is anybody else. There is no work that needs doing at the moment. Try again in a couple of days’ time.
ID: 101167 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PorkyPies

Send message
Joined: 6 Apr 20
Posts: 45
Credit: 1,650,779
RAC: 0
Message 101173 - Posted: 8 Apr 2021, 23:27:08 UTC

I'm getting new work. My larger x64 machines are only getting 11 per request so it takes a few goes to get enough for all cores.

On the Pi4 4GB I'm still getting;
Rosetta@home 9/04/2021 9:16:50 AM Message from server: Rosetta needs 6675.72 MB RAM but only 3460.72 MB is available for use.

The 6.6GB free memory requirement hasn't changed.
MarksRpiCluster
ID: 101173 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,518,559
RAC: 10,612
Message 101174 - Posted: 8 Apr 2021, 23:53:25 UTC - in response to Message 101154.  

Hopefully it means something to someone somewhere.
Ah: divided by a common language… :-⁠)
Perfectly comprehensible this side of the pond.

tin o[f] fizzy stuff = can of soda
duff = defective
guff = nonsense

Also “septic tank” is occasionally used as rhyming slang for “Yank”, but I don’t think that’s what’s meant here.

Lol! Too much information.

Wrt whether that's what was meant, I didn't think it was wrt the slang. I thought it was wrt to the CGCCGGG stuff in the task names

Isn't that genetic code? I've assumed it was. Whether it is or not, it's only important it means something to them, not to us.

Unless they're too dumb to understand what they're naming means - far be it for me to exclude all possibilities...
ID: 101174 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101181 - Posted: 9 Apr 2021, 16:47:09 UTC - in response to Message 101133.  

Maybe it has something to do with the recent SSL post they posted on the Forum News.
Maybe every client instance has to manually reset the URL like months ago.
I don't really know, but you are right, the project is in risk of losing a lot of WUs if they don't attest to this issue.
Keep the faith in this project!
What post do you refer to? I can't see one in the news section of the forums, apart from the 200 post long thread from last year.
ID: 101181 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101182 - Posted: 9 Apr 2021, 16:50:10 UTC - in response to Message 101134.  

Unless you have set it to use local settings, it will use whatever you have set in your account's Computing preferences section.
What causes it to switch to local settings?

My bet is, accidentally pressing the button to see what it does, not noticing anything immediate, then shrugging, followed by not remembering some years later whether you pressed the button or not for that host (puts hand up)
Where is this button? I've never seen it. I assumed that it changed mine to local because I changed a local setting - eg the buffer size.

It's at the very top of Computing Preferences - above all the tabs.
I know what you mean - it's so obvious I go blind to it
The reason I didn't see it is I don't use that horrid black and white neanderthal Boinc Manager, I use Boinctasks. If Boinctasks didn't exist, I wouldn't be running Boinc.

I never changed it to local, I would have remembered doing that, especially since I've installed Boinc on new machines recently. So I guess it changes to local the first time you input a local setting.

And yes, I subconsciously ignore anything bright, larger, flashing, etc, as it's probably an advert.
ID: 101182 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101183 - Posted: 9 Apr 2021, 16:52:07 UTC - in response to Message 101135.  

I allocate 28Gb from 32Gb total
They don't need the RAM. If they run, they generally use 300Mb, not 5 or 6Gb each. It's more than a bit crackers.
Once you've started them, I wonder if Boinc adjusts to what they're actually using, or leaves the requested 6.5GB there just in case? You could see what happens if you try to run only Rosettas.
ID: 101183 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101184 - Posted: 9 Apr 2021, 16:54:10 UTC - in response to Message 101142.  

When I was a teenager,


If you're THAT old, you shouldn't be getting hot flashes every time someone says "dude."
Why not? It's a word for teenagers to use, no matter when they were born. I'm no longer a teenager unfortunately.
ID: 101184 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101185 - Posted: 9 Apr 2021, 16:55:00 UTC - in response to Message 101143.  

There’s nothing we can do about it

We could ritualistically sacrifice a chicken, and then sprinkle its blood and entrails on @Peter Hucker.

The best part is, none of the staff of the nursing home would believe that it happened, when it's reported by the other residents.
Stating "when I was a teenager" only implies I'm at least 20.
ID: 101185 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101186 - Posted: 9 Apr 2021, 16:56:35 UTC - in response to Message 101144.  
Last modified: 9 Apr 2021, 17:02:29 UTC

They have to be taught at some time. Which activity do you think they are allowed to do first? Or do you think that they can start both at the same time, with no tests of what the are doing works properly?

Christ...
What? It's a valid point. Why would you think a biologist knows about the code?
ID: 101186 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 101187 - Posted: 9 Apr 2021, 17:00:36 UTC - in response to Message 101148.  

Name ajzjTxIe_YBAABB_ABYBB_AAAAAAXB_AAY_CGGGGGGCCGGGGGCGGGGGGGGCGGGC_1-4_2-5_3-6.pdb_0001_abinitio_1_abinitio_SAVE_ALL_OUT_1389656_916_1

It seems to be the work units that have names that look like someone fell asleep at the keyboard
or a tin o fizzy stuff gon sticky keys that are the worst
with over 800 of the whack job units in my Error list,
that's is a lot of duff guff to clog up the data base and turn it into more of a septic tank.
I sent an email like that, or rather my cat did. The cat I found a new home for when it decided the monitor was a toilet. Cost me money and you leave.
ID: 101187 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 93 · 94 · 95 · 96 · 97 · 98 · 99 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org