Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 37 · 38 · 39 · 40 · 41 · 42 · 43 . . . 299 · Next

AuthorMessage
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1231
Credit: 14,237,719
RAC: 3,838
Message 93329 - Posted: 4 Apr 2020, 1:19:11 UTC - in response to Message 93327.  

same issue - added a few machines today, some got WU's and some didn't. I thought maybe there was an account based throttle, where because I have a bunch downloaded already maybe I couldn't get more until some were reported back. But just totally guessing, no idea how the allocations work.

In the mean time I added some Milkyway@Home work.

Your computers are hidden, so I can't see enough information to tell how many of them are suitable for Rosetta work.

However, there are so many new users here that there are usually very few if any tasks available for download.

Milkyway@Home looks like a good source of short tasks, to fill in the times between any tasks you get here.
ID: 93329 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dduggan47

Send message
Joined: 18 Sep 05
Posts: 12
Credit: 3,926,507
RAC: 11,166
Message 93334 - Posted: 4 Apr 2020, 3:01:06 UTC - in response to Message 93254.  

I'm replying to myself because, while there seem to be a lot of people having trouble getting work, nobody else seems to be getting the 400 error. This is happening on only one of the two machines I have and that machine is having no trouble with the other projects I've tested. (I'm running Rosetta exclusively now except for those tests.) This problem started suddenly yesterday or possibly the day before. It had been working fine.

I've rebooted to no avail. The only other thing I can think of to try is to reset the project. I'm reluctant to do that for fear of losing the 41 (at this point) tasks listed as "Ready to Report".

If anybody has any advice I'd appreciate it. If not, tomorrow morning (it's late where I am now) I'll go for the nuclear option and reset.

Thanks.
ID: 93334 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1663
Credit: 17,328,604
RAC: 24,398
Message 93335 - Posted: 4 Apr 2020, 3:13:16 UTC - in response to Message 93334.  

With your computers hidden it's pretty much impossible to offer any useful advice.
A quick Google comes up with that in the past that error has been associated with older BOINC Manager versions & invalid Certificates, in those cases upgrading to a more recent BOINC version sorted it out. In others it has been due to an incorrectly set clock (wrong year), or the proxy they are using causing issues, but whether or not any of that applies to you, i've not got the slightest idea.
Grant
Darwin NT
ID: 93335 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dduggan47

Send message
Joined: 18 Sep 05
Posts: 12
Credit: 3,926,507
RAC: 11,166
Message 93338 - Posted: 4 Apr 2020, 3:50:54 UTC - in response to Message 93335.  

With your computers hidden it's pretty much impossible to offer any useful advice.
A quick Google comes up with that in the past that error has been associated with older BOINC Manager versions & invalid Certificates, in those cases upgrading to a more recent BOINC version sorted it out. In others it has been due to an incorrectly set clock (wrong year), or the proxy they are using causing issues, but whether or not any of that applies to you, i've not got the slightest idea.


Thank you, Grant.

The BOINC version is the latest and greatest and my clock is fine. I don't use a proxy ... or didn't think I did. A bank I have an account with though suggested a a day and a half ago that I download a security app so, foolishly, I did. I uninstalled it and all seems to be fine.

If you're ever in my neighborhood (MA, USA), PM me and I owe you a beer!

- Dick
ID: 93338 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1663
Credit: 17,328,604
RAC: 24,398
Message 93339 - Posted: 4 Apr 2020, 3:56:59 UTC - in response to Message 93338.  

A bank I have an account with though suggested a a day and a half ago that I download a security app
Yeah, that'll do it too. Some AV programmes also have a nasty habit of deleting BOINC Tasks as they download because they don't like the look of them, or think the programme's behaviour is supicious.
Glad you got it sorted.


If you're ever in my neighborhood (MA, USA), PM me and I owe you a beer!
I'm one of those very rare Aussies that don't drink. But a Coke (original i think they are over your way) would hit the spot.
:-)

Although it's going to be quite a while before international travel is on the cards for the average person again.
Grant
Darwin NT
ID: 93339 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1663
Credit: 17,328,604
RAC: 24,398
Message 93340 - Posted: 4 Apr 2020, 4:10:32 UTC - in response to Message 93163.  

Hi especially @Grant (SSSF)

Where I am wrong?
I need 2x more time to finish the tasks and 50% GFLOPS on similar i7-8700K CPU

Compare:
- https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=3933928
- https://boinc.bakerlab.org/rosetta/host_app_versions.php?hostid=3914491

Thanks in advance.

Could be the Tasks in question?
On my system all Tasks are running to the Target time (other than the odd one that bails out early), and apart from a glitch with some Tasks a few days back that paid out bugger all Credit (and the few early exits), Credit has generally been inline with Runtime.
That was then.

As time has gone on, it looks like the amount of Credit per Task has dropped quite a bit.
I was getting around 400 per 8 hour Task.
Now it's probably around 280 (a 216 low up to 340 high, so far), FLOPs per Application has dropped by around 3% for the current applications with the present Tasks, which is tracking pretty closely with the drop in Credit awarded.
Grant
Darwin NT
ID: 93340 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1231
Credit: 14,237,719
RAC: 3,838
Message 93342 - Posted: 4 Apr 2020, 4:21:33 UTC - in response to Message 93339.  
Last modified: 4 Apr 2020, 4:22:17 UTC

A bank I have an account with though suggested a a day and a half ago that I download a security app
Yeah, that'll do it too. Some AV programmes also have a nasty habit of deleting BOINC Tasks as they download because they don't like the look of them, or think the programme's behaviour is supicious.

I use the Windows Defender AV program, available free from Microsoft if you have a sufficiently recent version of Windows.

It does not have that nasty habit.

However, a Google search for where to download it gave only three sites, none of which will currently respond.
ID: 93342 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ezzz

Send message
Joined: 19 Mar 20
Posts: 8
Credit: 32,322
RAC: 0
Message 93343 - Posted: 4 Apr 2020, 4:22:17 UTC - in response to Message 92853.  

BTW, we posted the application update and plan to make an official announcement tomorrow. Our site is getting quite a bit of traffic now but hopefully it will settle down soon.


This may be a dumb question, but is there anything different that I need to do to utilize this update? Thanks.
ID: 93343 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1663
Credit: 17,328,604
RAC: 24,398
Message 93345 - Posted: 4 Apr 2020, 4:50:15 UTC - in response to Message 93343.  

This may be a dumb question, but is there anything different that I need to do to utilize this update? Thanks.
Nope.
As you finish any existing work and download new work (although there's none available at the moment), the new Application will be downloaded automatically by the Manager.
Grant
Darwin NT
ID: 93345 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
KWSN Ekky Ekky Ekky

Send message
Joined: 3 Apr 20
Posts: 9
Credit: 5,062,511
RAC: 0
Message 93353 - Posted: 4 Apr 2020, 7:28:31 UTC - in response to Message 93328.  

Many thanks everyone. Got some tasks now, though nothing for NVIDIA yet.
ID: 93353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC] BlueTooth76

Send message
Joined: 23 Mar 20
Posts: 4
Credit: 47,577,853
RAC: 0
Message 93354 - Posted: 4 Apr 2020, 7:48:52 UTC - in response to Message 93353.  

Many thanks everyone. Got some tasks now, though nothing for NVIDIA yet.


It's CPU only, so that may explain it ;)
ID: 93354 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
strongboes

Send message
Joined: 3 Mar 20
Posts: 27
Credit: 5,394,270
RAC: 0
Message 93358 - Posted: 4 Apr 2020, 8:36:01 UTC

4.12 is running very poorly on AMD.

Average processing rate is reasonable measure to go on. 4.07 my 3990x was regularly hitting over 30 GFLOPS on x86_64. If you look now its been left on 19 but that's an anomaly.

4.12 I can't get it over 11.5 no matter how many cores I run. Yet 2 of the 3 old laptops I have running this with vastly slower chips are hitting over 12 running the same work units.

Something is clearly wrong and I did notice immediately but have had no response to other posts.
ID: 93358 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1231
Credit: 14,237,719
RAC: 3,838
Message 93369 - Posted: 4 Apr 2020, 12:08:17 UTC - in response to Message 93358.  
Last modified: 4 Apr 2020, 12:16:16 UTC

4.12 is running very poorly on AMD.

Average processing rate is reasonable measure to go on. 4.07 my 3990x was regularly hitting over 30 GFLOPS on x86_64. If you look now its been left on 19 but that's an anomaly.

4.12 I can't get it over 11.5 no matter how many cores I run. Yet 2 of the 3 old laptops I have running this with vastly slower chips are hitting over 12 running the same work units.

Something is clearly wrong and I did notice immediately but have had no response to other posts.

On CPUs with very high numbers of cores, the speed of the path from the CPU to the memory is usually inadequate for effective use of all the cores at once, especially for programs with heavy use of the L3 cache in the CPU. The 4.12 program is significantly larger than the 4.07 program, and therefore likely to have heavier L3 use.

You may need to experiment with various numbers of cores in use, and draw a graph of total throughput per day versus number of cores in use, so you can get the maximum total throughput in a day.

Also, Rosetta work seems to run best when the computer has at least 2 GB of memory per processor in use, and your computer has considerably less.

What kind of drive are you running BOINC from? An SSD drive instead of a hard drive helps the speed considerably.

Whether the CPU was made by AMD or by Intel is not the major factor in this problem.
ID: 93369 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
strongboes

Send message
Joined: 3 Mar 20
Posts: 27
Credit: 5,394,270
RAC: 0
Message 93370 - Posted: 4 Apr 2020, 12:13:40 UTC - in response to Message 93369.  
Last modified: 4 Apr 2020, 12:32:29 UTC

4.12 is running very poorly on AMD.

Average processing rate is reasonable measure to go on. 4.07 my 3990x was regularly hitting over 30 GFLOPS on x86_64. If you look now its been left on 19 but that's an anomaly.

4.12 I can't get it over 11.5 no matter how many cores I run. Yet 2 of the 3 old laptops I have running this with vastly slower chips are hitting over 12 running the same work units.

Something is clearly wrong and I did notice immediately but have had no response to other posts.

On CPUs with very high numbers of cores, the speed of the path from the CPU to the memory is usually inadequate for effective use of all the cores at once, especially for programs with heavy use of the L3 cache in the CPU. The 4.12 program is significantly larger than the 4.07 program, and therefore likely to have heavier L3 use.

You may need to experiment with various numbers of cores in use, and draw a graph of total throughput per day versus number of cores in use, so you can get the maximum total throughput in a day.

Whether the CPU was made by AMD or by Intel is not the major factor in this problem.


Well each core has 4.2mb of L3, more than the Intel chips. I've tried with 8mb of L3 per core, no difference.

The i5 processor has only 3MB in total 1.5 per core. It clearly isn't down to L3 cache levels.

I'm only running it on the 64 cores, so has 2gb per core, although not running the full 64. I've tried on just 32 cores, makes no difference. Like I said, was running at 3 times the speed of the intel laptops which are slower and have considerably less cache on the 4.07 tasks, now regardless it's running marginally slower than them. That is an enormous drop.

ssd drive

There are other posts from people running AMD chips noticing their temps and power usage is reduced, all indicating less work being done, for whatever reason.

My clock speed is also up depending on wu between 10% and 30%, which again suggests that the chip is running faster but doing less per cycle.
ID: 93370 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1231
Credit: 14,237,719
RAC: 3,838
Message 93372 - Posted: 4 Apr 2020, 12:49:55 UTC - in response to Message 93370.  
Last modified: 4 Apr 2020, 13:00:21 UTC

4.12 is running very poorly on AMD.
[snip]

On CPUs with very high numbers of cores, the speed of the path from the CPU to the memory is usually inadequate for effective use of all the cores at once, especially for programs with heavy use of the L3 cache in the CPU. The 4.12 program is significantly larger than the 4.07 program, and therefore likely to have heavier L3 use.

You may need to experiment with various numbers of cores in use, and draw a graph of total throughput per day versus number of cores in use, so you can get the maximum total throughput in a day.

Whether the CPU was made by AMD or by Intel is not the major factor in this problem.


Well each core has 4.2mb of L3, more than the Intel chips. I've tried with 8mb of L3 per core, no difference.

The i5 processor has only 3MB in total 1.5 per core. It clearly isn't down to L3 cache levels.

I'm only running it on the 64 cores, so has 2gb per core, although not running the full 64. I've tried on just 32 cores, makes no difference. Like I said, was running at 3 times the speed of the intel laptops which are slower and have considerably less cache on the 4.07 tasks, now regardless it's running marginally slower than them. That is an enormous drop.

ssd drive

There are other posts from people running AMD chips noticing their temps and power usage is reduced, all indicating less work being done, for whatever reason.

My clock speed is also up depending on wu between 10% and 30%, which again suggests that the chip is running faster but doing less per cycle.

I misread your total amount of main memory.

All of the cores must share the same path to main memory. With today's memory speeds, that means that each core in use spends most of its cycles waiting for access to the main memory, rather than doing any useful work, if many cores are in use. This reduces the power used, and therefore the amount of heat generated.

If you are using less than half of the total number of processors on the CPU chip, turning off hyperthreading, or AMD's equivalent, often helps somewhat.

I'm not saying that which company made the CPU has no effect, I just expect it to be much less than many cores using the same path to main memory. Recompiling 4.12 to take advantage of the things the AMD CPU has but Intel CPUs don't would offer some help, but that's not something I can do.
ID: 93372 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
strongboes

Send message
Joined: 3 Mar 20
Posts: 27
Credit: 5,394,270
RAC: 0
Message 93373 - Posted: 4 Apr 2020, 13:04:32 UTC - in response to Message 93372.  

4.12 is running very poorly on AMD.
[snip]

On CPUs with very high numbers of cores, the speed of the path from the CPU to the memory is usually inadequate for effective use of all the cores at once, especially for programs with heavy use of the L3 cache in the CPU. The 4.12 program is significantly larger than the 4.07 program, and therefore likely to have heavier L3 use.

You may need to experiment with various numbers of cores in use, and draw a graph of total throughput per day versus number of cores in use, so you can get the maximum total throughput in a day.

Whether the CPU was made by AMD or by Intel is not the major factor in this problem.


Well each core has 4.2mb of L3, more than the Intel chips. I've tried with 8mb of L3 per core, no difference.

The i5 processor has only 3MB in total 1.5 per core. It clearly isn't down to L3 cache levels.

I'm only running it on the 64 cores, so has 2gb per core, although not running the full 64. I've tried on just 32 cores, makes no difference. Like I said, was running at 3 times the speed of the intel laptops which are slower and have considerably less cache on the 4.07 tasks, now regardless it's running marginally slower than them. That is an enormous drop.

ssd drive

There are other posts from people running AMD chips noticing their temps and power usage is reduced, all indicating less work being done, for whatever reason.

My clock speed is also up depending on wu between 10% and 30%, which again suggests that the chip is running faster but doing less per cycle.

I misread your total amount of main memory.

All of the cores must share the same path to main memory. With today's memory speeds, that means that each core in use spends most of its cycles waiting for access to the main memory, rather than doing any useful work, if many cores are in use. This reduces the power used, and therefore the amount of heat generated.

If you are using less than half of the total number of processors on the CPU chip, turning off hyperthreading, or AMD's equivalent, often helps somewhat.

I'm not saying that which company made the CPU has no effect, I just expect it to be much less than many cores using the same path to main memory. Recompiling 4.12 to take advantage of the things the AMD CPU has but Intel CPUs don't would offer some help, but that's not something I can do.



Yes, smt/hyper threading is off. The point is though, on my Intel laptops, the speed has remained the same between 4.07 to 4.12. On my tr it's dropped over 60% regardless of what I try, I can't put it much simpler than that.
ID: 93373 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
KWSN Ekky Ekky Ekky

Send message
Joined: 3 Apr 20
Posts: 9
Credit: 5,062,511
RAC: 0
Message 93375 - Posted: 4 Apr 2020, 13:14:30 UTC - in response to Message 93354.  

Many thanks everyone. Got some tasks now, though nothing for NVIDIA yet.


It's CPU only, so that may explain it ;)


Thank you very much. Newbie ignorance!
ID: 93375 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1231
Credit: 14,237,719
RAC: 3,838
Message 93378 - Posted: 4 Apr 2020, 13:33:38 UTC - in response to Message 93373.  

[snip]

Yes, smt/hyper threading is off. The point is though, on my Intel laptops, the speed has remained the same between 4.07 to 4.12. On my tr it's dropped over 60% regardless of what I try, I can't put it much simpler than that.

Note that all of your Intel laptops have a much lower number of cores, and will therefore have much less of a problem with too many cores trying to share the limited speed path to main memory. I can't put it much simpler than that, either.
ID: 93378 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
strongboes

Send message
Joined: 3 Mar 20
Posts: 27
Credit: 5,394,270
RAC: 0
Message 93381 - Posted: 4 Apr 2020, 13:52:07 UTC - in response to Message 93378.  

[snip]

Yes, smt/hyper threading is off. The point is though, on my Intel laptops, the speed has remained the same between 4.07 to 4.12. On my tr it's dropped over 60% regardless of what I try, I can't put it much simpler than that.

Note that all of your Intel laptops have a much lower number of cores, and will therefore have much less of a problem with too many cores trying to share the limited speed path to main memory. I can't put it much simpler than that, either.



4.07 is the reference yes. 4.12 is 60% slower than that, I don't know what is so difficult to understand, it's has nothing to do with memory or anything else. You dont have a 64/128 chip to only run it on 5 cores to keep the same productivity because of a software change. Is there a mod/developer that can possibly comment on this issue?
ID: 93381 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Luigi R.

Send message
Joined: 7 Feb 14
Posts: 39
Credit: 2,045,527
RAC: 0
Message 93382 - Posted: 4 Apr 2020, 14:26:37 UTC - in response to Message 92757.  
Last modified: 4 Apr 2020, 14:33:34 UTC

@bcov:

Why is low-credit granting retroactive?
I downloaded a bunch of covid-19 tasks on the 29th of March. I'm reporting now and getting 2cr. instead of ~200cr.

I thought issue would affect only new tasks downloaded after the 31st of March.

My app version is 4.08.
ID: 93382 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 37 · 38 · 39 · 40 · 41 · 42 · 43 . . . 299 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org