Compute errors

Questions and Answers : Windows : Compute errors

To post messages, you must log in.

AuthorMessage
Bruce Downing

Send message
Joined: 19 Jul 08
Posts: 16
Credit: 7,615,560
RAC: 2,742
Message 56583 - Posted: 1 Nov 2008, 4:29:01 UTC

I am getting "client error" dozens of times in the last two days. Only three tasks have been successful. I tried resetting the project but it is still not working.
Any advice to fix this would be appreciated. Thanks!

Bruce Downing
Sarasota
ID: 56583 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
LittleNew

Send message
Joined: 27 May 08
Posts: 5
Credit: 47,758
RAC: 0
Message 56587 - Posted: 1 Nov 2008, 6:08:35 UTC

seems like you have the same problem as I do.
ID: 56587 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bruce Downing

Send message
Joined: 19 Jul 08
Posts: 16
Credit: 7,615,560
RAC: 2,742
Message 56600 - Posted: 1 Nov 2008, 15:34:48 UTC - in response to Message 56587.  

seems like you have the same problem as I do.


I am also running SETI and Einstein, no problems with those.
ID: 56600 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
LittleNew

Send message
Joined: 27 May 08
Posts: 5
Credit: 47,758
RAC: 0
Message 56602 - Posted: 1 Nov 2008, 16:43:38 UTC

Me too. Just only Rosetta.
ID: 56602 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bruce Downing

Send message
Joined: 19 Jul 08
Posts: 16
Credit: 7,615,560
RAC: 2,742
Message 56607 - Posted: 1 Nov 2008, 19:53:56 UTC - in response to Message 56602.  

Me too. Just only Rosetta.


I have stopped new Rosetta tasks.
ID: 56607 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
morjar

Send message
Joined: 29 Apr 06
Posts: 2
Credit: 1,239,975
RAC: 0
Message 56622 - Posted: 2 Nov 2008, 8:49:00 UTC - in response to Message 56583.  

I have also started to receive Client Error/Compute Error since around Oct-30 or Oct-31.
Common to these errors are that the CPU time is always very low, something like 0.06 seconds.
The application is always "Rosetta Mini with new score terms" and the WU name is something like "1fna__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1fna_-_4662_210"

Since these erroneous WU's don't consume CPU at all, the break at the very beginning, they shouldn't harm at all I guess?

ID: 56622 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bruce Downing

Send message
Joined: 19 Jul 08
Posts: 16
Credit: 7,615,560
RAC: 2,742
Message 56643 - Posted: 2 Nov 2008, 20:23:28 UTC - in response to Message 56622.  

I have also started to receive Client Error/Compute Error since around Oct-30 or Oct-31.
Common to these errors are that the CPU time is always very low, something like 0.06 seconds.
The application is always "Rosetta Mini with new score terms" and the WU name is something like "1fna__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1fna_-_4662_210"

Since these erroneous WU's don't consume CPU at all, the break at the very beginning, they shouldn't harm at all I guess?


Same here exactly. I was worried that it is my computer, but now I don't think so. As to harm, I guess not, but I sure would like this resolved.

Bruce in Sarasota, Florida
ID: 56643 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bruce Downing

Send message
Joined: 19 Jul 08
Posts: 16
Credit: 7,615,560
RAC: 2,742
Message 56672 - Posted: 3 Nov 2008, 17:25:14 UTC - in response to Message 56622.  

I have also started to receive Client Error/Compute Error since around Oct-30 or Oct-31.
Common to these errors are that the CPU time is always very low, something like 0.06 seconds.
The application is always "Rosetta Mini with new score terms" and the WU name is something like "1fna__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1fna_-_4662_210"

Since these erroneous WU's don't consume CPU at all, the break at the very beginning, they shouldn't harm at all I guess?


I have run 7 new tasks, all were normal. Will come back here if any more problems.
Bruce.
ID: 56672 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stephenish

Send message
Joined: 26 Feb 06
Posts: 3
Credit: 757,327
RAC: 0
Message 57430 - Posted: 2 Dec 2008, 0:30:32 UTC

2 Rosetta Mini 1.40 WUs failed as "Computation error."

h010__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-5-S3-7--h010_-_4675_452_0
h011__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-10-S3-5--h011_-_4675_475_0

The first WU ran for 00:53:51 and the second WU ran for 00:19:45 before aborting.
ID: 57430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Scott

Send message
Joined: 26 Nov 08
Posts: 1
Credit: 89,739
RAC: 0
Message 57518 - Posted: 2 Dec 2008, 23:17:50 UTC - in response to Message 57430.  


What is the resolution to this COMPUTE ERROR. Over half of my WU are ending with this message (and, consequently, no credits but gobs of my CPU being ripped.

It's obvious others are experiencing it too so why no fix/work-around.

If there's not something constructive available shortly, I will discontinue my contributions to rosetta in favour of another project that can better use my resources.

Thanks,
ID: 57518 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Matthias Lehmkuhl

Send message
Joined: 20 Nov 05
Posts: 10
Credit: 2,115,357
RAC: 361
Message 57540 - Posted: 3 Dec 2008, 10:06:22 UTC

see also
Thread: download error MD5 check failed

I got these messages, some of my wingman end with compute errors.

I've set my computers to "no new work" till any update from Project administrator
Matthias

ID: 57540 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mfbabb2

Send message
Joined: 10 Oct 08
Posts: 4
Credit: 10,345
RAC: 0
Message 57553 - Posted: 3 Dec 2008, 16:36:49 UTC

After getting the Compute Errors, I now get:
12/3/2008 9:45:26 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
12/3/2008 9:45:26 AM|rosetta@home|Message from server: Server error: can't attach shared memory
Sure looks like something broke.

ID: 57553 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 57568 - Posted: 3 Dec 2008, 20:46:57 UTC

Please review the Number Crunching board. Several threads there already discuss the shared memory issue.
Rosetta Moderator: Mod.Sense
ID: 57568 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
vakobo

Send message
Joined: 3 Aug 08
Posts: 18
Credit: 13,636,264
RAC: 2,319
Message 57620 - Posted: 5 Dec 2008, 12:32:34 UTC

WUs named like cs_vanilla_abrelax_... finish with error 'Output file ... absent'
and Windows shows message that Rosetta mini 1.4 causes an error and will be terminated (or something like this).
ID: 57620 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1223
Credit: 13,824,497
RAC: 2,340
Message 57643 - Posted: 6 Dec 2008, 2:25:32 UTC - in response to Message 57620.  

WUs named like cs_vanilla_abrelax_... finish with error 'Output file ... absent'
and Windows shows message that Rosetta mini 1.4 causes an error and will be terminated (or something like this).


Which complete version number of minirosetta? I've had workunits from both version 1.40 and version 1.45 lately. There's already separate threads for reporting problems in each of them - rather long for 1.40, and rather short for the newer 1.45. You might get a more useful response if you report the problem in the thread specific to that version, although similar problems have been reported in the thread for 1.40 before, mostly with more complete error reports.

ID: 57643 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
vakobo

Send message
Joined: 3 Aug 08
Posts: 18
Credit: 13,636,264
RAC: 2,319
Message 57651 - Posted: 6 Dec 2008, 15:00:02 UTC - in response to Message 57643.  

WUs named like cs_vanilla_abrelax_... finish with error 'Output file ... absent'
and Windows shows message that Rosetta mini 1.4 causes an error and will be terminated (or something like this).


Which complete version number of minirosetta? I've had workunits from both version 1.40 and version 1.45 lately. There's already separate threads for reporting problems in each of them - rather long for 1.40, and rather short for the newer 1.45. You might get a more useful response if you report the problem in the thread specific to that version, although similar problems have been reported in the thread for 1.40 before, mostly with more complete error reports.


When i get this problem i was already runing version 1.45 but in the message generated by Windows was 1.4
ID: 57651 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
vakobo

Send message
Joined: 3 Aug 08
Posts: 18
Credit: 13,636,264
RAC: 2,319
Message 57716 - Posted: 8 Dec 2008, 21:30:53 UTC
Last modified: 8 Dec 2008, 21:33:54 UTC

Again same problem with WU named "cs_valilla_abrelax_...".
Application: Rosetta Mini 1.45
In Windows message appear minirosetta_1.4.exe instead of minirosetta_1.45_windows_intelx86.exe

With other WUs no Windows messages was displayed.
ID: 57716 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1223
Credit: 13,824,497
RAC: 2,340
Message 57752 - Posted: 9 Dec 2008, 20:20:52 UTC - in response to Message 57716.  

Again same problem with WU named "cs_valilla_abrelax_...".
Application: Rosetta Mini 1.45
In Windows message appear minirosetta_1.4.exe instead of minirosetta_1.45_windows_intelx86.exe

With other WUs no Windows messages was displayed.


There have been a number of problem reports for the workunits with names beginning with cs_vanilla over in the minirosetta 1.45 thread. For now, expect a much higher percentage of those workunits to fail than workunits with other names.

ID: 57752 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Questions and Answers : Windows : Compute errors



©2024 University of Washington
https://www.bakerlab.org