exited with zero status but no 'finished' file

Message boards : Number crunching : exited with zero status but no 'finished' file

To post messages, you must log in.

AuthorMessage
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 21,719,780
RAC: 5,970
Message 73715 - Posted: 28 Aug 2012, 22:20:29 UTC

I have a new computer (Windows 7 Professional, SSD). I am running BOINC 7.0.28 (x64). I was getting this message only on the rb tasks in the beginning, but when it started to happen on other tasks, I did as the messages suggested, and reset the project. Still happening. Any suggestions?

Here are the entire 2 lines: 8/28/2012 5:40:18 PM | rosetta@home | Task sfr_sc1_1014116m10_abinitio_SAVE_ALL_OUT_56042_170_0 exited with zero status but no 'finished' file
8/28/2012 5:40:18 PM | rosetta@home | If this happens repeatedly you may need to reset the project.
ID: 73715 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Wissi

Send message
Joined: 19 Nov 08
Posts: 14
Credit: 485,807
RAC: 0
Message 73815 - Posted: 11 Sep 2012, 20:41:35 UTC - in response to Message 73715.  

I have the same problem here, even after resetting the project. Whats going on here?? I run Boinc Manager 7.0.28 on a 64bit Windows 7 Ultimate, Rosetta has currently Mini Rosetta 3.41 running here...

I have no idea what to do...
ID: 73815 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Snags

Send message
Joined: 22 Feb 07
Posts: 198
Credit: 2,815,019
RAC: 764
Message 73818 - Posted: 12 Sep 2012, 10:44:22 UTC

On Rosetta this is usually solved by increasing the "use at most xxx% of CPU time" setting to 100. You may then want to reduce the "on multiprocessors, use at most xxx% of the processors" to something less than currently set. Most people find this handles the temperature regulation concerns (that the cpu throttling was designed to address) perfectly.

Another possible cause are virus scanners; most folks exclude BOINC from those scans or set it to run only when BOINC isn't active.

An explanation and more possible causes can be found here: BOINC FAQ Service

Please know that this only becomes a fatal error when it occurs 100 times to a particular task; at that point BOINC assumes the task will never be able to finish and gives up on it, ending it as a client error. If you see this message only occasionally it is safe to ignore it.


Best,
Snags
ID: 73818 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1990
Credit: 38,522,839
RAC: 15,277
Message 73822 - Posted: 13 Sep 2012, 1:12:43 UTC
Last modified: 13 Sep 2012, 1:13:21 UTC

Sorry if I'm bit confused here, but there don't appear to be any errors in your finished tasks and credits are fully awarded. Or did I miss something?

Attempting to micro-manage every last warning message doesn't lead anywhere. On the rare occasion I ever see this kind of thing I've usually had an issue with another program I'm running which has a knock-on effect with Boinc, but not sufficiently that it actually breaks anything. Looks like nothing to me.
ID: 73822 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Alan Roberts

Send message
Joined: 7 Jun 06
Posts: 61
Credit: 6,901,926
RAC: 0
Message 73861 - Posted: 19 Sep 2012, 15:02:53 UTC - in response to Message 73818.  

Seeing the same issue on a new Win7 machine that I cranked up with BOINC 7.0.28. Per your note, I've just switched from 75% CPU on four jobs (one per core) to 100% CPU, only two concurrent jobs. Waiting to see if that reduces the problem, and how the temperature settles out.

Question: Any correlation between this error and the, "mismatch" between Rosetta and BOINC 7.x? Should I be planning my retreat to BOINC 6 because I'm seeing this failure?

...
Please know that this only becomes a fatal error when it occurs 100 times to a particular task; at that point BOINC assumes the task will never be able to finish and gives up on it, ending it as a client error. If you see this message only occasionally it is safe to ignore it.


Best,
Snags


Understood, but does the restart imply loss-of-work back to the previous checkpoint for the job?

ID: 73861 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Snags

Send message
Joined: 22 Feb 07
Posts: 198
Credit: 2,815,019
RAC: 764
Message 73870 - Posted: 20 Sep 2012, 20:21:25 UTC - in response to Message 73861.  

Seeing the same issue on a new Win7 machine that I cranked up with BOINC 7.0.28. Per your note, I've just switched from 75% CPU on four jobs (one per core) to 100% CPU, only two concurrent jobs. Waiting to see if that reduces the problem, and how the temperature settles out.

Question: Any correlation between this error and the, "mismatch" between Rosetta and BOINC 7.x? Should I be planning my retreat to BOINC 6 because I'm seeing this failure?


I shouldn't think so. This error message was first added for BOINC 5. We saw quite a spate of posts about it a while ago well before BOINC 7 was released. I haven't noticed any of the posts citing problems with BOINC 7 listing this as a symptom.

...
Please know that this only becomes a fatal error when it occurs 100 times to a particular task; at that point BOINC assumes the task will never be able to finish and gives up on it, ending it as a client error. If you see this message only occasionally it is safe to ignore it.


Best,
Snags


Understood, but does the restart imply loss-of-work back to the previous checkpoint for the job?


Yes but as Sid notes most of the time it isn't worth fretting over as a rare occurrence it would be difficult to track down the conflict and may be impossible to avoid. If it continues to happen frequently click through to the BOINC FAQ Service and check out Jord's list of suggestions. The link in my previous post takes you straight to the relevant page.


Best,
Snags
ID: 73870 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 7,348
Message 73880 - Posted: 22 Sep 2012, 11:49:40 UTC - in response to Message 73861.  

Question: Any correlation between this error and the, "mismatch" between Rosetta and BOINC 7.x? Should I be planning my retreat to BOINC 6 because I'm seeing this failure?


Rosetta OFFICIALLY does NOT support Version 7 of Boinc! They are happy with version 6 and are NOT upgrading any of their systems to support version 7. IF it works or not is not their concern. It DOES work for some people, but most find reverting to version 6 works better. Just be aware that downgrading means you WILL lose all existing units on your system, from EVERY Project!! So plan wisely!
ID: 73880 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Alan Roberts

Send message
Joined: 7 Jun 06
Posts: 61
Credit: 6,901,926
RAC: 0
Message 73885 - Posted: 24 Sep 2012, 19:10:27 UTC - in response to Message 73870.  


...

Yes but as Sid notes most of the time it isn't worth fretting over as a rare occurrence it would be difficult to track down the conflict and may be impossible to avoid. If it continues to happen frequently click through to the BOINC FAQ Service and check out Jord's list of suggestions. The link in my previous post takes you straight to the relevant page.

Best,
Snags


It was definitely happening frequently, but I did try the suggestion of not using BOINC's CPU throttling (switched from using four cores at 75% to just two at 100%), and that seems to have cleaned the problem up for me, at the "cost" of a wider spread in core temperatures and somewhat less crunching that I was hoping to get done.

Thanks,
Alan


ID: 73885 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Link
Avatar

Send message
Joined: 4 May 07
Posts: 352
Credit: 382,349
RAC: 0
Message 73886 - Posted: 24 Sep 2012, 21:56:17 UTC - in response to Message 73885.  

at the "cost" of a wider spread in core temperatures and somewhat less crunching that I was hoping to get done.

To get about the same job done you should use 3 cores @100%, that's 75% out of 4. Best performance you get of course with all 4 cores and since you apparently have enough RAM to run 4 tasks you could consider that, BOINC tasks are running at very low priority, so usually there should not be any performance issues with other applications.
.
ID: 73886 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 21,719,780
RAC: 5,970
Message 73894 - Posted: 27 Sep 2012, 21:51:29 UTC - in response to Message 73880.  

Question: Any correlation between this error and the, "mismatch" between Rosetta and BOINC 7.x? Should I be planning my retreat to BOINC 6 because I'm seeing this failure?


Rosetta OFFICIALLY does NOT support Version 7 of Boinc! They are happy with version 6 and are NOT upgrading any of their systems to support version 7. IF it works or not is not their concern. It DOES work for some people, but most find reverting to version 6 works better. Just be aware that downgrading means you WILL lose all existing units on your system, from EVERY Project!! So plan wisely!


Thanks, mikey. Reverting to version 6.12.34 (x64) seems to have solved the problem.
ID: 73894 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : exited with zero status but no 'finished' file



©2024 University of Washington
https://www.bakerlab.org