Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 346 · 347 · 348 · 349 · 350 · 351 · 352 · Next

AuthorMessage
GroovyG

Send message
Joined: 3 Aug 13
Posts: 1
Credit: 2,469,848
RAC: 0
Message 113013 - Posted: 18 Aug 2025, 11:46:48 UTC

Hoping this is a sensible place to put this: have had no tasks for a couple of weeks; and wonder what might be broken. Just updated from 8.0.2 to 8.2.4 but no tasks coming up.
ID: 113013 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bonny

Send message
Joined: 1 Apr 20
Posts: 1
Credit: 5,092,930
RAC: 47
Message 113014 - Posted: 18 Aug 2025, 13:36:28 UTC - in response to Message 113013.  

Rosetta has run out of work for the moment.
ID: 113014 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 113015 - Posted: 18 Aug 2025, 16:02:50 UTC
Last modified: 18 Aug 2025, 16:12:45 UTC

Unrelated to here, but for those who've had server feeder errors on WCG, it's just cleared
39 tasks uploaded, 169 tasks came down
Back working at last

Edit: This might be intermittent for the moment. I just tried updating my laptop and still got the "server error: feeder not running" message. It might be everyone piling in all at once, but hopefully it's fixed for everyone soon.
ID: 113015 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5770
Credit: 6,139,760
RAC: 1
Message 113016 - Posted: 18 Aug 2025, 19:42:24 UTC - in response to Message 113014.  

Rosetta has run out of work for the moment.



Nothing new. It is hit an miss these days. Short runs gobbled up by the massive amount of people here.
The AI took away our old work that was steady. Now we run refinements of its work if needed.
If you get some work, be glad, if you don't oh well..better luck next time.
ID: 113016 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tom M

Send message
Joined: 20 Jun 17
Posts: 178
Credit: 36,299,045
RAC: 19
Message 113017 - Posted: 20 Aug 2025, 14:12:22 UTC - in response to Message 113015.  
Last modified: 20 Aug 2025, 14:14:32 UTC

Unrelated to here, but for those who've had server feeder errors on WCG, it's just cleared
39 tasks uploaded, 169 tasks came down
Back working at last

Edit: This might be intermittent for the moment. I just tried updating my laptop and still got the "server error: feeder not running" message. It might be everyone piling in all at once, but hopefully it's fixed for everyone soon.


I have been getting WCG pretty steadily. But I have Milkyway@Home setup as a "0" (zero resource) alternative so my CPU (usually) has work.

While I am polling Rosetta on this "new" system, I have not had any luck pulling any down.
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6320307

I am about to RMA my new-to-me Server MB.
Proud member of the O.F.A. (Old Farts Association)
ID: 113017 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 113018 - Posted: 20 Aug 2025, 18:17:08 UTC - in response to Message 113017.  

Unrelated to here, but for those who've had server feeder errors on WCG, it's just cleared
39 tasks uploaded, 169 tasks came down
Back working at last

Edit: This might be intermittent for the moment. I just tried updating my laptop and still got the "server error: feeder not running" message. It might be everyone piling in all at once, but hopefully it's fixed for everyone soon.

I have been getting WCG pretty steadily. But I have Milkyway@Home setup as a "0" (zero resource) alternative so my CPU (usually) has work.

While I am polling Rosetta on this "new" system, I have not had any luck pulling any down
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6320307

Yup, WCG has been good for a few days now.
On Rosetta, the assumption should be for nothing in the short term.
I picked up 2 Rosetta tasks on my phone today, which was something of a shock tbh. The exception rather than the rule.
ID: 113018 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tigers_Dave
Avatar

Send message
Joined: 9 Dec 05
Posts: 10
Credit: 127,094,801
RAC: 615
Message 113019 - Posted: 21 Aug 2025, 3:11:31 UTC - in response to Message 113016.  

Rosetta has run out of work for the moment.



Nothing new. It is hit an miss these days. Short runs gobbled up by the massive amount of people here.
The AI took away our old work that was steady. Now we run refinements of its work if needed.
If you get some work, be glad, if you don't oh well..better luck next time.


I think this is a great attitude to take. I have resumed Einstein CPU crunching at a resource share of 1. So, if I don't pick up sufficient Rosetta tasks, I'll crunch Einstein tasks instead.
ID: 113019 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JLDun
Avatar

Send message
Joined: 31 May 08
Posts: 12
Credit: 75,322
RAC: 0
Message 113020 - Posted: 21 Aug 2025, 19:08:47 UTC

I was lucky enough to download 17 tasks from the recent batch last night, but 8 of them crashed within 3 seconds, all with "process exited with code 2 (0x2, -254)" and "no such file or directory".

The only noticable difference (on my end) is the ones that crashed were on an Android 15 phone and tablet, and the ones that are still running are on Android 12 & 9 phones.
ID: 113020 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JLDun
Avatar

Send message
Joined: 31 May 08
Posts: 12
Credit: 75,322
RAC: 0
Message 113021 - Posted: 21 Aug 2025, 19:08:49 UTC

I was lucky enough to download 17 tasks from the recent batch last night, but 8 of them crashed within 3 seconds, all with "process exited with code 2 (0x2, -254)" and "no such file or directory".

The only noticable difference (on my end) is the ones that crashed were on an Android 15 phone and tablet, and the ones that are still running are on Android 12 & 9 phones.
ID: 113021 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JLDun
Avatar

Send message
Joined: 31 May 08
Posts: 12
Credit: 75,322
RAC: 0
Message 113022 - Posted: 21 Aug 2025, 19:08:51 UTC
Last modified: 21 Aug 2025, 19:10:21 UTC

I was lucky enough to download 17 tasks from the recent batch last night, but 8 of them crashed within 3 seconds, all with "process exited with code 2 (0x2, -254)" and "no such file or directory".

The only noticable difference (on my end) is the ones that crashed were on an Android 15 phone and tablet, and the ones that are still running are on Android 12 & 9 phones.

(EDIT: This is with the default/manufacturer-installed OS on each one.)
ID: 113022 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TLD

Send message
Joined: 3 Nov 09
Posts: 3
Credit: 2,920,756
RAC: 3,042
Message 113023 - Posted: 21 Aug 2025, 21:00:20 UTC - in response to Message 113022.  

I got 28, so far 8 have validated and no errors.
ID: 113023 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 113024 - Posted: 22 Aug 2025, 2:02:55 UTC - in response to Message 113018.  

On Rosetta, the assumption should be for nothing in the short term.
I picked up 2 Rosetta tasks on my phone today, which was something of a shock tbh. The exception rather than the rule

Like others I got tasks on a variety of machines today.
I didn't realise until I saw my credit go up - by the time I got back from work they'd all been completed and I thought it was another idle day.
Shows how much I know.
Still, if I expect nothing, everything I do get is a pleasant surprise
ID: 113024 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 113025 - Posted: 22 Aug 2025, 2:14:39 UTC - in response to Message 113011.  

Taken a little while to get this right. Just need to get another HDMI cable for it to be ideal, but getting by with DVI for now.

Just as I was packing up to go home last week, I found another HDMI cable I didn't realise I had.
Now connected and looking good.
Boinc running at 100% on all cores (from 70%) , temperatures staying down and no heat-related crashes of the PC or the graphics card, possibly due to the much lower power-draw.
One of the best upgrades of 2 PCs I've ever done, for only 95GBP and a little bit of work.
Can hardly believe it tbh
ID: 113025 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2124
Credit: 12,426,657
RAC: 2,579
Message 113026 - Posted: 22 Aug 2025, 14:12:25 UTC - in response to Message 113016.  

Nothing new. It is hit an miss these days. Short runs gobbled up by the massive amount of people here.
The AI took away our old work that was steady. Now we run refinements of its work if needed.


This is not a problem. Today also home cpus have AI inside... :-)
ID: 113026 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2124
Credit: 12,426,657
RAC: 2,579
Message 113027 - Posted: 22 Aug 2025, 18:36:06 UTC - in response to Message 113023.  

I got 28, so far 8 have validated and no errors.


Lucky guy. All errors on my wus:

-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
<![CDATA[
<message>
(unknown error) (317) - exit code 3221225477 (0xc0000005)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting @flags_rb_08_01_685031_676925__t000__0_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_08_01_685031_676925__t000__0_C1_robetta.zip -frag_weight_aligned 0.2 -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2227079
Using database: database_357d5d93529_n_methylminirosetta_database


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x000002BA80C97E30

ID: 113027 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2475
Credit: 46,506,558
RAC: 3,757
Message 113029 - Posted: 23 Aug 2025, 4:22:09 UTC - in response to Message 113027.  

I got 28, so far 8 have validated and no errors.

Lucky guy. All errors on my wus:

I got at least 80% successes
They look like coding errors rather than problems at our end
A small batch came down a week or two back and they were like 100% failures, so this is actually an improvement...
ID: 113029 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MacStevins

Send message
Joined: 25 Jul 20
Posts: 1
Credit: 21,639
RAC: 83
Message 113033 - Posted: 25 Aug 2025, 11:36:23 UTC - in response to Message 113022.  

...all with "process exited with code 2 (0x2, -254)" and "no such file or directory."


its because theres no aarch64/ARM64 build and it seems that newer Android versions will crash/error, it happened to myself through Einstein@Home until i had to enable the beta test to my phone
ID: 113033 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
crystalsys
Avatar

Send message
Joined: 11 Aug 09
Posts: 11
Credit: 1,677,398
RAC: 0
Message 113037 - Posted: 27 Aug 2025, 3:10:19 UTC

Assigned tasks will never complete by due date.

This is the only project doing this, Running three projects, but lately I get Rosetta tasks that will never finish by the due date.

I can abort them, but I don't understand why this keeps happening. Will ditch the project if there is no explanation and
it continues this way.
ID: 113037 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1895
Credit: 18,534,891
RAC: 0
Message 113038 - Posted: 27 Aug 2025, 4:58:09 UTC - in response to Message 113037.  
Last modified: 27 Aug 2025, 5:13:22 UTC

Assigned tasks will never complete by due date.

This is the only project doing this, Running three projects, but lately I get Rosetta tasks that will never finish by the due date.

I can abort them, but I don't understand why this keeps happening.
There is either something seriously wrong with your system, or your system is busy doing a huge amount of other CPU work (eg Folding@home) while it's trying to process BOINC work, or (given that it is a laptop) you have set "Use at most xxx % of CPU time" to some exceptionally ridiculous value.

1 day and 23 hours to do only 6 hrs and 8 minutes of actual processing is why the system is struggling to complete anything.
Same with your Einstein processing times- 78,152 seconds (21 hrs 40min) to do only 13,754 seconds (3 hrs 48 min) of actual work.


Make sure "Use at most xxx % of CPU time" is set to 100%. Being a laptop (and a basic one at that, even with it's very, very, very low clock speeds), it's cooling isn't going to be all that great, so you'll need to limit the number of cores/threads being used.
Set "Use at most xxx % of the CPUs" to 25% (2 cores/threads) If it still can't deal with it, set it to 12% or 13% (1 core/thread).
(If that isn't the cause, then check in Task Manager and see what it is that's using all of your CPU time, leaving none available for BOINC).

If you're not using local settings, then after making the changes on the project web site, select that project in the BOINC Manager, and then Update, for the changes to take effect.
Grant
Darwin NT
ID: 113038 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
crystalsys
Avatar

Send message
Joined: 11 Aug 09
Posts: 11
Credit: 1,677,398
RAC: 0
Message 113039 - Posted: 27 Aug 2025, 13:41:42 UTC - in response to Message 113038.  

Thanks for the reply. I've made a few small changes, but the fact is that the laptop is not heavily used and was given very nearly free reign for BOINC when not in use, and did not run until the laptop had been unused for 5 minutes.

At my last job, I had access to multiple systems running lab machines, and let's face it, most of the time they were entirely idle. I had BOINC on all of them, running when the lab system was idle, with no impact on lab work. I did have another laptop here running BOINC, but when it started making noises like the fan was going to fail, it's shut down for now.

PrimeGrid sends a lot of relatively small jobs, always complete early. I observe there that the estimated time to completion drops as soon as the task is begun.
Einstein sends a number of jobs, fewer and larger than PrimeGrid and always complete early.

Right now I've put both those projects on 'no new tasks' and I'll see what happens after that, and update later.
ID: 113039 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 346 · 347 · 348 · 349 · 350 · 351 · 352 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org