Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 204 · 205 · 206 · 207 · 208 · 209 · 210 . . . 309 · Next

AuthorMessage
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106031 - Posted: 25 Apr 2022, 16:39:39 UTC - in response to Message 106026.  

but Boinc does not do all that much disk IO as to slow me down much.
It depends on the application.
In the case of Rosetta, the Rosetta 4.20 Tasks don't require much disk I/O, however the Python Tasks require massive amounts of disk I/O when starting up & ending. And apparently they also require quite a bit during processing. The more cores & threads a system has & uses, then the higher the disk I/O requirements will be.
LHC aswell.
ID: 106031 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106032 - Posted: 25 Apr 2022, 16:43:02 UTC - in response to Message 106027.  

Looking at this message was a reminder to do all this.
No new .tmp files, freed up a few Gb here too, grabbed Treeview but it's not telling me anything I expect to find useful so removed again.
I've got BoincTasks but hadn't set it up to run at startup, which I've now done. Yes, very useful in finding tasks that are very far behind in CPU time compared to Elapsed time.
More useful when running VBox tasks compared to running plain Rosetta tasks - I'll keep this going now.
All good, ta
I assume you used treeSIZE. That one is great, it's like a windows explorer tree, but with sizes. I removed a whole load of stuff *I* had put there that I didn't need. Games I no longer played, films that can be archived onto the rust spinner, etc.

No idea how people can manage with just the plain Boinc Manager, it's absolutely horrid, especially if you have a lot of tasks. No colour coding, no grouping of queued tasks, etc. And with me having 7 computers, I really need a central controller. At least Folding at Home supplies such a thing, but I don't think the Boinc Manager will look at many computers easily.
ID: 106032 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106033 - Posted: 25 Apr 2022, 16:44:52 UTC - in response to Message 106028.  

Update: The first task to be postponed reached the end of its one day postponement, and now appears to be computing successfully (in VBox 6.1.34). Haven't tried reverting to previous version to see what happens, but whatever the problem was it seems to have resolved.


I may have spoken too soon. The tasks were running for exceptionally long times (18-26 hours) - although unlike the normal "not doing anything" vbox tasks, they were showing significant CPU time utilised (rather than the tasks that "run" for 18 hours but have only consumed 10-20 seconds of CPU). I shut down BOINC, rolled VBox back to version 6.1.12 (BOINC recommended version, not 6.1.32 which I had been running), restarted, and all the vbox tasks came up with computation errors.

Oh well, will see what happens with the next tasks to run.
I'm on VB version 5 (or Cosmology breaks completely) and it seems to run Rosetta Python just as well as 6. LHC is also happy with it. Kryptos at Home also hates 6. Seems like VB screwed up when they made the new one.
ID: 106033 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106034 - Posted: 25 Apr 2022, 16:46:50 UTC - in response to Message 106033.  

Does QuChempedia hate vbox 6 too?
ID: 106034 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106035 - Posted: 25 Apr 2022, 17:02:34 UTC - in response to Message 106034.  

Does QuChempedia hate vbox 6 too?
Never heard of it, let alone used it. Can't test here as I have VB 5 on everything.
ID: 106035 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106036 - Posted: 25 Apr 2022, 17:05:05 UTC - in response to Message 106035.  

https://boinc.berkeley.edu/projects.php#:~:text=Details-,QuChemPedIA%40home,-%D0%A4%D0%B8%D0%B7%D0%B8%D0%BA%D0%B0
ID: 106036 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106037 - Posted: 25 Apr 2022, 17:09:38 UTC - in response to Message 106034.  

Does QuChempedia hate vbox 6 too?
Thanks for telling me about that one, I was looking for a chemistry project! If you use windows you have to turn on beta tests, as only linux is non-beta.
ID: 106037 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106038 - Posted: 25 Apr 2022, 17:10:25 UTC - in response to Message 106035.  
Last modified: 25 Apr 2022, 17:29:45 UTC

Does QuChempedia hate vbox 6 too?
Never heard of it, let alone used it. Can't test here as I have VB 5 on everything.


Vbox per project is not relevant.
It is a matter of does the project run their tasks with Vbox
In this case yes it does use Vbox
| QuChemPedIA@home | Finished download of vboxwrapper_26200_windows_x86_64.exe
You can also check this page [url]https://boinc.berkeley.edu/projects.php [/url] to see if your project choice uses Vbox or not. The blue box symbol at the end of the icon list is for Vbox.


Have to try QuChem again, last time it was not stable enough to run on my system with other projects.
ID: 106038 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106039 - Posted: 25 Apr 2022, 17:10:44 UTC - in response to Message 106036.  

https://boinc.berkeley.edu/projects.php#:~:text=Details-,QuChemPedIA%40home,-%D0%A4%D0%B8%D0%B7%D0%B8%D0%BA%D0%B0
I think I hadn't looked through that list since I set up Boinc originally.
ID: 106039 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106040 - Posted: 25 Apr 2022, 17:11:38 UTC - in response to Message 106038.  

Does QuChempedia hate vbox 6 too?
Never heard of it, let alone used it. Can't test here as I have VB 5 on everything.


Vbox per project is not relevant.
We were discussing VB version 6 messing things up. It breaks Cosmology, I have to use 5.
ID: 106040 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106041 - Posted: 25 Apr 2022, 17:31:31 UTC - in response to Message 106040.  

Does QuChempedia hate vbox 6 too?
Never heard of it, let alone used it. Can't test here as I have VB 5 on everything.


Vbox per project is not relevant.
We were discussing VB version 6 messing things up. It breaks Cosmology, I have to use 5.


That's strange. How vbox processes tasks is above my head.
Computermeze or whatever his name is knows more about that kind of stuff.
Have you asked in Cosmo forum at all if anyone knows why 6 does not work?
Maybe post in Github and see what the experts say.
ID: 106041 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106042 - Posted: 25 Apr 2022, 17:38:48 UTC

Vbox projects listed:

BOINC@TACC (not worth the time, its leftovers)
Cosmology (as you already know)
LHC@home
QuChemPedIA@home
RNA World (also leftovers or stuff that can not run on the supercomputer they have) so 0 tasks available.
Here at Rosetta

And that's all that is on the list.
ID: 106042 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106043 - Posted: 25 Apr 2022, 17:40:26 UTC - in response to Message 106042.  

nanoHUB@Home too.
ID: 106043 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tullio

Send message
Joined: 10 May 20
Posts: 63
Credit: 630,125
RAC: 0
Message 106044 - Posted: 25 Apr 2022, 19:35:23 UTC

I am using VitualBox to build a Linux Virtual Machine that runs OpenSuSE Tumbleweed, a development versio with kernel 5.17.3, frequently updated so I have to reboot it frequently. It runs Einstein@home CPU tasks not being able to use the nVidia GTX1060 board of its Windows 10 host and QuChem, which being a Linux project does not need VirtualBox.
Tullio
ID: 106044 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106045 - Posted: 25 Apr 2022, 21:11:02 UTC
Last modified: 25 Apr 2022, 21:13:04 UTC

I can't run Quchem.
It goes to 60% and becomes unstable.
VmJob unmanagable message.
2022-04-25 20:38:52 (2024): VM state change detected. (old = 'running', new = 'paused')
2022-04-25 21:05:27 (2024): VM state change detected. (old = 'paused', new = 'running')
2022-04-25 21:06:03 (2024): Creating new snapshot for VM.
2022-04-25 21:06:11 (2024): Deleting stale snapshot.
2022-04-25 21:06:12 (2024): Checkpoint completed.
2022-04-25 21:10:16 (2024): VM state change detected. (old = 'running', new = 'paused')
2022-04-25 21:14:52 (2024): VM state change detected. (old = 'paused', new = 'running')
2022-04-25 21:16:38 (2024): VM state change detected. (old = 'running', new = 'paused')
2022-04-25 21:32:41 (2024): ERROR: Vboxwrapper lost communication with VirtualBox, rescheduling task for a later time.

And GPU gives me the exit child error on their new pythons.
ID: 106045 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106046 - Posted: 25 Apr 2022, 21:16:57 UTC - in response to Message 106045.  

Maybe there isn't enough ram for python gpu?
ID: 106046 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,538,222
RAC: 10,691
Message 106047 - Posted: 25 Apr 2022, 21:51:42 UTC - in response to Message 106032.  

Looking at this message was a reminder to do all this.
No new .tmp files, freed up a few Gb here too, grabbed Treeview but it's not telling me anything I expect to find useful so removed again.
I've got BoincTasks but hadn't set it up to run at startup, which I've now done. Yes, very useful in finding tasks that are very far behind in CPU time compared to Elapsed time.
More useful when running VBox tasks compared to running plain Rosetta tasks - I'll keep this going now.
All good, ta
I assume you used treeSIZE. That one is great, it's like a windows explorer tree, but with sizes. I removed a whole load of stuff *I* had put there that I didn't need. Games I no longer played, films that can be archived onto the rust spinner, etc.

No idea how people can manage with just the plain Boinc Manager, it's absolutely horrid, especially if you have a lot of tasks. No colour coding, no grouping of queued tasks, etc. And with me having 7 computers, I really need a central controller. At least Folding at Home supplies such a thing, but I don't think the Boinc Manager will look at many computers easily.

Oops, yes Treesize. I can see how useful it might be, but I keep a pretty tight ship at the best of times, so no need for it here.
I'd used Boinctasks before, but prior to installing VirtualBox, and I didn't have the kind of problems that BoincTasks would solve back then, so it just seemed an unnecessary duplication.
Nothing against it - just not enough going for it with my limited uses. Until now.
ID: 106047 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106048 - Posted: 26 Apr 2022, 6:08:05 UTC - in response to Message 106046.  

Maybe there isn't enough ram for python gpu?

48 gigs? not enough?
I got a ACEMD 3 and running ATLAS and Prime Grid and only using 32% of my total RAM
I don't think its RAM.

The STDERR goes on about memory leaks in its setup, but this is fresh RAM (well half new and half less new but not ancient)
ID: 106048 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106050 - Posted: 26 Apr 2022, 9:02:10 UTC - in response to Message 106048.  

I have only 16 gb ram.
ID: 106050 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tullio

Send message
Joined: 10 May 20
Posts: 63
Credit: 630,125
RAC: 0
Message 106056 - Posted: 26 Apr 2022, 11:33:39 UTC
Last modified: 26 Apr 2022, 11:35:43 UTC

I have only 12 GB RAM on thjis Windows 11 PC and can run both rosetta python and QuChem, but not at the same time. QuChem runs also on my Linux Virtual Machine with 8 GB RAM. I could not run rosetta python on it. Now I am running Rosetta 4.20 on the Windows 11 PC.
Tullio
ID: 106056 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 204 · 205 · 206 · 207 · 208 · 209 · 210 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org