Rosetta@home

Minirosetta v1.40 bug thread

  UW Seal
 
[ Home ] [ Join ] [ About ] [ Participants ] [ Community ] [ Statistics ]
  [ login/out ]


Advanced search
Message boards : Number crunching : Minirosetta v1.40 bug thread

Sort
AuthorMessage
Sarel Profile

Joined: May 11 06
Posts: 51
ID: 81994
Credit: 81,712
RAC: 0
Message 56741 - Posted 6 Nov 2008 23:00:25 UTC

Please report any bugs in this version here.

Sarel.
____________

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 56743 - Posted 6 Nov 2008 23:13:16 UTC

The link on the homepage to the bugs thread leads you to the v1.39 thread.
____________
Rosetta Moderator: Mod.Sense

Chu

Joined: Feb 23 06
Posts: 120
ID: 61076
Credit: 112,439
RAC: 0
Message 56745 - Posted 6 Nov 2008 23:43:09 UTC

we have also located the graphic problem when there is non-protein ligand displayed and implemented a fix to that. So please let us know if you still observe such problems.
____________

Sarel Profile

Joined: May 11 06
Posts: 51
ID: 81994
Credit: 81,712
RAC: 0
Message 56749 - Posted 7 Nov 2008 1:05:07 UTC

Thanks! Fixed... Sarel
____________

Naesbye

Joined: Jul 30 08
Posts: 5
ID: 271478
Credit: 200,568
RAC: 0
Message 56760 - Posted 7 Nov 2008 14:28:32 UTC

My first 1.40 unit ended with a computation error.

Odd Braathun

Joined: Sep 2 08
Posts: 9
ID: 276375
Credit: 16,125
RAC: 0
Message 56772 - Posted 8 Nov 2008 15:23:11 UTC

Problem with this task:

Task ID 206078107
Name 1vcc__BOINC_ABRELAX_SPLIT_CONTROL_IGNORE_THE_REST-S25-9-S3-3--1vcc_-_4677_199_0
Workunit 188017112

Exiting numerous times but no "finished" file. Boinc said to reset project.

Odd

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 56779 - Posted 9 Nov 2008 6:48:34 UTC

I have this task running now it is very slow to progress, I watched and it

is only making .001% in 20sec. It has been running for 8hrs,20min and is at

98.050% my run time is 6hrs i haven't had this big a margin to finish before.

Could it be the new mini app 1.40 or the task?

1hzh_2fiw_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_76

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188078859

I'll let it run to end.

pete.
____________


Odd Braathun

Joined: Sep 2 08
Posts: 9
ID: 276375
Credit: 16,125
RAC: 0
Message 56780 - Posted 9 Nov 2008 9:41:54 UTC

I have had one of these, too, but have now aborted it.

Task ID 206030023
Name 1hzh_1juv_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_27_0
Workunit 187974469

I had also
Task ID 206101035
Name IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_2osa_4683_197_0
Workunit 188036989

This task ran smoothly for 2 hours, but ended up with a validate error.

Odd

Aegis Maelstrom

Joined: Oct 29 08
Posts: 61
ID: 285843
Credit: 792,303
RAC: 34
Message 56781 - Posted 9 Nov 2008 11:48:34 UTC

Hi,
I'm just having a similar problem as above.

Task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1

restarted twice so far, now processing:

2008-11-09 07:36:00|rosetta@home|Starting IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1
2008-11-09 07:36:35|rosetta@home|Starting task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 using minirosetta version 140
2008-11-09 09:36:44|rosetta@home|Task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 exited with zero status but no 'finished' file
2008-11-09 09:36:45|rosetta@home|If this happens repeatedly you may need to reset the project.
2008-11-09 09:38:42|rosetta@home|Restarting task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 using minirosetta version 140
2008-11-09 12:16:02|rosetta@home|Task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 exited with zero status but no 'finished' file
2008-11-09 12:16:03|rosetta@home|If this happens repeatedly you may need to reset the project.
2008-11-09 12:16:48|rosetta@home|Restarting task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 using minirosetta version 140

Just before the second restart, it had a progress near 60% and it was on a "model 1" 11000+ step, I guess unfolding/testing a beautifully folded protein (step around 10000 had a lower "low energy" than 11000) - I've made a snapshot.

When it restarted, it began from something like 18% and a still not enough folded protein. The time elapsed has been reduced as well.

What I would like to ask first is to add some checkpoints, it would help to process and bugtest. Now I am waiting to check if this workunit is endable.

Aegis Maelstrom

Joined: Oct 29 08
Posts: 61
ID: 285843
Credit: 792,303
RAC: 34
Message 56782 - Posted 9 Nov 2008 13:39:32 UTC - in response to Message ID 56781.


Task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1

restarted twice so far, now processing:

(...)

Now I am waiting to check if this workunit is endable.


The Workunit restarted third time, seemingly in the same place as the previous time (the percentage "completed" was higher but I was checking a couple minutes earlier and it was once again step 10000 then, so now it was probably 11000).

The WU started for the fourth time, now with 24% but I guess it was the same moment as before. When I restarted the WU after temporarily halting once again, it went back to 17%. Now I can see 18,23% and step 523.

Now I am halting this task and my business with Rosetta.

When the BOINC tried to download a different task, I got a following log:
2008-11-09 14:29:23|rosetta@home|Message from server: No work sent
2008-11-09 14:29:23|rosetta@home|Message from server: Your preferences limit memory usage to 452 MB, and 488 MB is needed

The problem seems to be with a higher memory usage although one of the mods recently assured us that there is no increase in memory requirements.
I could increase amount of memory dedicated to BOINC, however I would like to have this problem explained and ironed out.

Frankly speaking, as this is just a next computational problem in a few days, any explanations from Rosetta developers/maintainers would be highly appreciated. Thanks for your co-operation and good luck.

Path7

Joined: Aug 25 07
Posts: 128
ID: 201002
Credit: 61,751
RAC: 0
Message 56783 - Posted 9 Nov 2008 14:29:56 UTC

Hello all,
Just saw an error from this WU:
loopbuild_boinc4_hombench_loopbuild_t308__IGNORE_THE_REST_1UKVY_1_4693_12_0

<core_client_version>6.2.25</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
Too many restarts with no progress. Keep application in memory while preempted.
======================================================
DONE :: 1 starting structures 24.3206 cpu seconds
This process generated 0 decoys from 0 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>loopbuild_boinc4_hombench_loopbuild_t308__IGNORE_THE_REST_1UKVY_1_4693_12_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>
</message>

Well, looks like 2 errors: Too many restarts & file_xfer error.

Be aware: I'm running WCG's (beta-)BOINC 6.2.25, which seems to be pretty stable (so far).

Have a nice day,
Path7.

Path7

Joined: Aug 25 07
Posts: 128
ID: 201002
Credit: 61,751
RAC: 0
Message 56786 - Posted 9 Nov 2008 16:27:54 UTC

And another error:
oopbuild_boinc4_hombench_loopbuild_t326__IGNORE_THE_REST_1I1QB_3_4700_8_0
failed with:

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

Have a nice day,
Path7.

Neil Hunter

Joined: May 9 06
Posts: 9
ID: 81689
Credit: 177,620
RAC: 0
Message 56792 - Posted 9 Nov 2008 19:21:12 UTC - in response to Message ID 56779.

I have this task running now it is very slow to progress, I watched and it

is only making .001% in 20sec. It has been running for 8hrs,20min and is at

98.050% my run time is 6hrs i haven't had this big a margin to finish before.

Could it be the new mini app 1.40 or the task?

1hzh_2fiw_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_76

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188078859

I'll let it run to end.

pete.




I grabbed a few WUs on both an XP and Linux m/c.
Both have the same problem for me, in that they get to around 98% complete, then seem to just hang there. Completion does not take place and I have aborted all 1.40 WUs on both PCs for now.

Neil, UK.

____________

Neil Hunter

Joined: May 9 06
Posts: 9
ID: 81689
Credit: 177,620
RAC: 0
Message 56793 - Posted 9 Nov 2008 19:30:28 UTC - in response to Message ID 56792.



[/quote]

I grabbed a few WUs on both an XP and Linux m/c.
Both have the same problem for me, in that they get to around 98% complete, then seem to just hang there. Completion does not take place and I have aborted all 1.40 WUs on both PCs for now.

Neil, UK.
[/quote]


......they all seem to finally stick with 9m 53s to the end of the WU.
____________

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 56794 - Posted 9 Nov 2008 21:25:25 UTC - in response to Message ID 56779.
Last modified: 9 Nov 2008 21:59:01 UTC

I have this task running now it is very slow to progress, I watched and it

is only making .001% in 20sec. It has been running for 8hrs,20min and is at

98.050% my run time is 6hrs i haven't had this big a margin to finish before.

Could it be the new mini app 1.40 or the task?

1hzh_2fiw_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_76

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188078859

I'll let it run to end.

pete.


Well it finally finished after 11hrs not very happy, something needs to be fixed.

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
======================================================
DONE :: 1 starting structures 39696.6 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>

Over _ Success _ Done _ 39,697.10 _ 278.57 _ 16.10

b.t.w the credit is a bad joke.

pete.
____________


Allan Hojgaard

Joined: May 4 08
Posts: 9
ID: 256652
Credit: 425,302
RAC: 0
Message 56798 - Posted 10 Nov 2008 0:12:58 UTC
Last modified: 10 Nov 2008 0:22:29 UTC

Adding my share of long working WUs:

1hzh_2pww_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_86

Result:

<core_client_version>6.2.18</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
======================================================
DONE :: 1 starting structures 39652.9 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

As many have said before I do not mind crunching large WUs, but I would like to be credited/warned about it beforehand. Currently one of my cores is working on
1hzh_1a58_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_87_0 and it has now been working on it for 14 hours and 24 minutes and it has reached 98.840%. I am sure that I will get very low credit for it like the others in this thread.

This what the graphics show me:
http://www.home.no/kalumba/rosetta.png


Until the mess has been sorted out/properly explained I'm crunching for another project. I'm going to visit the forum frequently as Rosetta@Home is my favourite project.

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 56799 - Posted 10 Nov 2008 1:47:26 UTC

Looks like i have another run away task it's at 6hrs, 45min at 97.655% and as

slow as wet cement about .001% every 10 sec better then the last one but not much.

I bet i don't get much for it if & when it finisher's.

1hzh_2fe5_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_76

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188078846

pete.

____________


AMD_is_logical

Joined: Dec 20 05
Posts: 299
ID: 41207
Credit: 31,460,681
RAC: 0
Message 56800 - Posted 10 Nov 2008 6:16:10 UTC - in response to Message ID 56786.

And another error:
oopbuild_boinc4_hombench_loopbuild_t326__IGNORE_THE_REST_1I1QB_3_4700_8_0
failed with:

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

Have a nice day,
Path7.


I got the same error on one of my Linux nodes:
h005__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-7-S3-8--h005_-_4675_19_0

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56802 - Posted 10 Nov 2008 13:06:47 UTC

I've got another one of those workunits that are running longer than expected:

11/9/2008 5:57:49 PM|rosetta@home|Starting 1hzh_1o9g_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_155_1
11/9/2008 5:57:54 PM|rosetta@home|Starting task 1hzh_1o9g_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_155_1 using minirosetta version 140

Last night, it had accumulated about 6 CPU hours and claimed that it would finish in another 10 CPU minutes. This morning, it has accumulated over 12 CPU hours and claims that it will finish in another 9 CPU minutes and 56 seconds.

Also, it's currently the most memory hungry process on my machine. The Windows Task Manager recently said it was using over 256,000K of memory - over 10 times as much as the next process - but then dropped that to a little over 200,000K and is now 223,132K.

Since it hasn't let any other process take a turn in its CPU core for much longer than the 2 hours I've tried to set it for, I'll suspend it for a while and see if that helps.

The other person with a similar workunit had a compute error after about 6 CPU hours.

caesar1987 Profile
Avatar

Joined: Nov 28 06
Posts: 13
ID: 131900
Credit: 22,268
RAC: 0
Message 56804 - Posted 10 Nov 2008 13:49:00 UTC - in response to Message ID 56802.


Last night, it had accumulated about 6 CPU hours and claimed that it would finish in another 10 CPU minutes. This morning, it has accumulated over 12 CPU hours and claims that it will finish in another 9 CPU minutes and 56 seconds.

Also, it's currently the most memory hungry process on my machine. The Windows Task Manager recently said it was using over 256,000K of memory - over 10 times as much as the next process - but then dropped that to a little over 200,000K and is now 223,132K.
same by me
it say that it will finish in 9minuter and 51 sec. But by me is accumulates only 5 hour 5 min, but las hour it is the same.

"mini"rosetta mem usage -cca 290,000 K, VMsize - 320,000 K!!!
whats on this mini?

____________

ramostol

Joined: Feb 6 07
Posts: 64
ID: 145835
Credit: 584,052
RAC: 0
Message 56806 - Posted 10 Nov 2008 17:32:14 UTC

My MacBook refuse to compute any loopbuild_boinc4_hombench_-task, cf this result

<core_client_version>6.2.18</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 3600
# cpu_run_time_pref: 3600
# cpu_run_time_pref: 3600
# cpu_run_time_pref: 3600
# cpu_run_time_pref: 3600
Too many restarts with no progress. Keep application in memory while preempted.
======================================================
DONE :: 1 starting structures 671.186 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

Other tasks complete as exptected.

svincent

Joined: Dec 30 05
Posts: 202
ID: 44923
Credit: 4,102,500
RAC: 5,735
Message 56807 - Posted 10 Nov 2008 17:50:39 UTC

Many problems on an iMac2 on OSX 10.4.11

a) Tasks partially completed ; either waiting to run or waiting for memory

b) Mon Nov 10 08:06:41 2008|rosetta@home|Task 1hzh_1nio_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_160_0 exited with zero status but no 'finished' file
Mon Nov 10 08:06:41 2008|rosetta@home|If this happens repeatedly you may need to reset the project.
Mon Nov 10 08:06:42 2008|rosetta@home|Restarting task 1hzh_1nio_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_160_0 using minirosetta version 140

Believe, but can't be certain, that this was a task that had yet to complete after 12 hours work: it appears to now be starting again.

c) Mon Nov 10 08:16:01 2008|rosetta@home|Resuming task 1hzh_2a1i_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_200_0 using minirosetta version 140

This task now stuck after 1:05 minutes of processing




____________

DJStarfox

Joined: Jul 19 07
Posts: 140
ID: 191721
Credit: 560,560
RAC: 21
Message 56808 - Posted 10 Nov 2008 18:10:13 UTC
Last modified: 10 Nov 2008 18:11:30 UTC

App: Rosetta Mini 1.40
Name: 2ci2l_BOINC_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--2ci2l-_4678_394_0
BOINC: 5.10.45 x86_64
OS: Fedora 8 x86_64
Problem: Program WILL NOT STOP CRUNCHING even if I tell BOINC to Suspend all processing. Killing it and BOINC is only way.

Edit: It is behaving better since restarting BOINC daemon. But that was really weird. Note: Other projects/apps were suspending fine before the restart.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56810 - Posted 10 Nov 2008 19:14:32 UTC - in response to Message ID 56802.

I've got another one of those workunits that are running longer than expected:

11/9/2008 5:57:49 PM|rosetta@home|Starting 1hzh_1o9g_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_155_1
11/9/2008 5:57:54 PM|rosetta@home|Starting task 1hzh_1o9g_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_155_1 using minirosetta version 140

Last night, it had accumulated about 6 CPU hours and claimed that it would finish in another 10 CPU minutes. This morning, it has accumulated over 12 CPU hours and claims that it will finish in another 9 CPU minutes and 56 seconds.

Also, it's currently the most memory hungry process on my machine. The Windows Task Manager recently said it was using over 256,000K of memory - over 10 times as much as the next process - but then dropped that to a little over 200,000K and is now 223,132K.

Since it hasn't let any other process take a turn in its CPU core for much longer than the 2 hours I've tried to set it for, I'll suspend it for a while and see if that helps.

The other person with a similar workunit had a compute error after about 6 CPU hours.


I told it to suspend, which apparantly worked. Windows Task Manager now says it's using only 97,000K of memory, but I suspect that it doesn't include any part of it that's been moved to the swapfile.

The workunits already on my machine from other BOINC projects are now catching up with their CPU time allotments, and haven't given this workunit another chance yet, even though I had increased Rossetta@home's share of my machine's CPU time shortly before this problem started. I had also increased the upper limit on virtual memory size to 7 GB.

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 56813 - Posted 10 Nov 2008 22:30:44 UTC - in response to Message ID 56799.

Looks like i have another run away task it's at 6hrs, 45min at 97.655% and as

slow as wet cement about .001% every 10 sec better then the last one but not much.

I bet i don't get much for it if & when it finisher's.

1hzh_2fe5_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_76

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188078846

pete.


Note to Sarel.

This task restarted after it had ran all day yesterday for over 16hrs none

stop, and was at 99.001% it then went back to 2hrs,30min at 41.64% i have

aborted it, not going to waste more time, can someone please fix this.

pete.


____________


robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56815 - Posted 11 Nov 2008 0:08:47 UTC - in response to Message ID 56810.
Last modified: 11 Nov 2008 0:15:53 UTC

I've got another one of those workunits that are running longer than expected:

11/9/2008 5:57:49 PM|rosetta@home|Starting 1hzh_1o9g_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_155_1
11/9/2008 5:57:54 PM|rosetta@home|Starting task 1hzh_1o9g_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_155_1 using minirosetta version 140


It's now running again, and at over 16.5 CPU hours. It's back up to 228,832K.

I've noticed that a significant fraction of the minirosetta v1.40 workunits that have performed poorly on my machine lately or have been mentioned in this thread as having problems for other people have 4704 as part of their name. Is this significant, or just an indication of the current group of workunits?

kcolagio

Joined: Oct 7 05
Posts: 1
ID: 3281
Credit: 62,988
RAC: 0
Message 56817 - Posted 11 Nov 2008 2:00:28 UTC


Running under Windows XP, I have it go inactive when I'm using the system (about 6 hours out of the day). Often I'll see notices that Windows is running out of virtual memory.

The system is a 2.4 GHz Quad Core system with 4 Gig of memory (which Windows only sees 3 Gig of *sigh* ).

Looking in the task manager, I see that there are 4 instances of Minirosetta_1.40_windows_intex86 running and that they are using between 207 Meg and 290 Meg of memory.

There are also (if it's related) 2 instances of rosetta_beta_5.98_windows_intelx86 running that are taking 215 Meg each.

While paused, they are using 0% of the CPU (which is right in my book), but they have used up to 1 hour 4 minutes of CPU time...I have no idea if this is "normal" or not.

No idea if any of this helps, but it seems out of the ordinary to me...and I hate just killing the processes that are acting badly.

Let me know if you need more info.

____________

Adam Gajdacs (Mr. Fusion)

Joined: Nov 26 05
Posts: 13
ID: 20993
Credit: 1,495,337
RAC: 347
Message 56820 - Posted 11 Nov 2008 9:04:04 UTC

1hzh_1u9p_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_97_0 using minirosetta version 140 (Wu ID: 188064180)

Yesterday this task had been running for over 13 hours on a 4 hours target CPU time. It was stuck on model 1, step 79500, where step did not change for over an hour (the protein display did, however, once in every 15-20 seconds or so). Progress was increasing at the rate of roughly 0.001% per 15-20 seconds at 98.6% or so.

I don't run my system 24/7 (that's why I have a relatively short runtime specified), so I had shut it down yesterday for the night, and today it's started over from 0%; looks like it didn't checkpoint even once in all those 13+ hours. So I'm considering aborting this (and any similar) WU at this point.

In general, the memory use of the 1.40 has skyrocketed again, it fluctuates between 100-350 Mbytes of physical and commits about 300-350Mbytes virtual memory. Once again, this tends to fill up all available PM+VM on multi-core systems as the Rosetta WUs started in parallel will hit the combined memory limit within seconds, thus they get suspended to the "Waiting for memory" state, and then a new WU gets started only to hit the memory limit again. I usually have at least 3-4 "stuck" Rosetta WUs in memory, each holding 200-300Mbytes of VM (and a similar amount of PM until the system is forced to completely page them out).
____________

AMD_is_logical

Joined: Dec 20 05
Posts: 299
ID: 41207
Credit: 31,460,681
RAC: 0
Message 56828 - Posted 11 Nov 2008 13:08:08 UTC

These two CAPRI_comp_ems.1b.pdb.gz_docksim.protocol_8_12_4682_ WUs were ended by the watchdog because they ran over 48 hours (3x my 16 hour setting):
http://boinc.bakerlab.org/rosetta/result.php?resultid=205806719
http://boinc.bakerlab.org/rosetta/result.php?resultid=205765025

AMD_is_logical

Joined: Dec 20 05
Posts: 299
ID: 41207
Credit: 31,460,681
RAC: 0
Message 56829 - Posted 11 Nov 2008 13:20:02 UTC

This WU bombed out on both machines (one Linux and the other Windos) with a file xfer error:
IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1ukf_4683_83

<file_xfer_error>
<file_name>IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1ukf_4683_83_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

Warren B. Rogers

Joined: Oct 3 05
Posts: 5
ID: 2517
Credit: 821,633
RAC: 0
Message 56830 - Posted 11 Nov 2008 13:29:04 UTC

Hello everyone,

I've also had trouble with this version of Minirosetta. The WU will get to about 98% completion and show approximately 9 minutes to completion and then it seems to get stuck at that point. I've stopped the WU and let other projects get a chance to complete and when BOINC returns to the WU it will start from the beginning and sometimes complete in approximately 2 hours or it will do the same thing and get stuck at 98% and run for over 6 hour. I've had 2 end with Compute Errors and 1 with a Validate Error. And I've seen even the WU's that complete are getting shut down by the watchdog because of too many restarts. I hope this information helps.


Warren Rogers
____________

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,057,641
RAC: 1,674
Message 56840 - Posted 11 Nov 2008 17:20:45 UTC
Last modified: 11 Nov 2008 17:55:50 UTC

188575665 is doing the same thing. It has been running for 04:43:43 is 96.592% complete and the time to completion flips between 00:09:52 and 00:09:53 every few seconds.

It is also a 1hzh_2he4_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_262 wu.

Aborted. And yet again, I suppose I have to suspend Rosetta on my remote systems. Getting to be a habit that.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 56842 - Posted 11 Nov 2008 17:34:43 UTC

IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1lb4_4683_127_0 ran 5 hrs and 13 mins and then died with a huge debug output.

exit status is -1073741819 (0xc0000005)
Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x0083A59D read attempt to address 0xFFFFFFCC

Engaging BOINC Windows Runtime Debugger...


3 calls stacks and a bunch of other stuff...

that is just annoying as hell to run 5 hrs out of 6 and then die and get no credit. LAME!

Sarel Profile

Joined: May 11 06
Posts: 51
ID: 81994
Credit: 81,712
RAC: 0
Message 56848 - Posted 11 Nov 2008 19:16:26 UTC

I'm so sorry for this mess. The jobs labeled with the words design, jacob, or sarel, are related to a new mode that we've put into v1.40. You can read more about this new mode and why we're excited about running it on Rosetta @ Home on

http://boinc.bakerlab.org/forum_thread.php?id=4477

As far as I can tell from the messages here, people are seeing two major problems:
1. long run times with relatively low credit
2. larger than anticipated memory requirements

Please let me know if you see any other type of problem.

Since this is a departure from previous simulations on Rostta @ Home, we expected to run into some trouble, but obviously, after the extensive testing that we had carried out (with no glitches), we didn't expect this much! We're currently looking into ways of fixing this immediately as well as in the longer term. My colleagues and I will post new messages to this thread once we've figured this out.

By the way, I should mention that even this early, we're seeing that from the simulations that ran well we've gotten a huge amount of very useful output! Much more than on any other platform that I had worked with before!

Thank you very much for your patience and for providing all this feedback!
____________

DaBrat and DaBear

Joined: Aug 9 08
Posts: 16
ID: 272936
Credit: 213,180
RAC: 0
Message 56850 - Posted 11 Nov 2008 19:36:06 UTC

Nothing but the following... 8 plus hours run for 9 credits

http://boinc.bakerlab.org/rosetta/result.php?resultid=206158806

mikus

Joined: Nov 7 05
Posts: 58
ID: 10139
Credit: 700,115
RAC: 0
Message 56851 - Posted 11 Nov 2008 19:55:43 UTC

Rosetta/BOINC does not validate against partial results. It should.

The typical Rosetta task runs multiple decoys (each of which I believe is an *independent* simulation). I had such a task terminate because while calculating decoy 7 came it up with a NAN. The results from the correctly completed previous 6 decoys were discarded.

Looked in the 'Workunit Details' page and saw that another system was identified as successfully completing that same task. The catch -- it did only 5 decoys.

There is something fundamentally unfair when ALL the work from a system that did more crunching gets discarded, while accepting work from a system that crunched less.
.

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,057,641
RAC: 1,674
Message 56852 - Posted 11 Nov 2008 19:55:57 UTC

1. long run times with relatively low credit

That is not specific to this version. It was mentioned many times in this thread.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 56854 - Posted 11 Nov 2008 21:14:23 UTC
Last modified: 11 Nov 2008 21:44:26 UTC

the memory in this computer is small but did complete some work units http://boinc.bakerlab.org/rosetta/results.php?hostid=439347





http://boinc.bakerlab.org/rosetta/results.php?hostid=267483


http://boinc.bakerlab.org/rosetta/result.php?resultid=204400732

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56857 - Posted 11 Nov 2008 21:58:44 UTC - in response to Message ID 56854.
Last modified: 11 Nov 2008 22:02:05 UTC

the memory in this computer is small but did complete some work units

http://boinc.bakerlab.org/rosetta/results.php?hostid=439347

http://boinc.bakerlab.org/rosetta/results.php?hostid=267483

http://boinc.bakerlab.org/rosetta/result.php?resultid=204400732


Rochester, it looks like the memory and time estimates for the problem workunits are now accurate enough they don't send you any of the memory-hungry workunits with design, jacob, or sarel in their names, or the workunits with serious underestimates of time required that often have 4704 in their names, but still not accurate enough to handle some of us who can handle a little more, but not the maximum required.

Path7

Joined: Aug 25 07
Posts: 128
ID: 201002
Credit: 61,751
RAC: 0
Message 56859 - Posted 11 Nov 2008 23:20:26 UTC - in response to Message ID 56848.
Last modified: 11 Nov 2008 23:21:34 UTC

Hello Sarel,

Thanks for your reaction, and good to read you are still exited about the new mode you put into 1.40 : )

@ Please let me know if you see any other type of problem.

1hzh_1mve_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_147_0
This WU was running for more than 15 hours (runtime preference = 6 hours) when I restarted my computer (Windows update).
The WU started again with 38 minutes processor time!
If possible more checkpoints will be welcome.

Have a nice day,
Path7.

DaBrat and DaBear

Joined: Aug 9 08
Posts: 16
ID: 272936
Credit: 213,180
RAC: 0
Message 56863 - Posted 12 Nov 2008 1:03:31 UTC

This appeared to run smoothly but invalid.

http://boinc.bakerlab.org/rosetta/result.php?resultid=205965356


Server state Over
Outcome Validate error
Client state Done
Exit status 0 (0x0)
Computer ID 871503
Report deadline 18 Nov 2008 2:48:02 UTC
CPU time 9476.889
stderr out <core_client_version>6.2.18</core_client_version>
<![CDATA[
<stderr_txt>

======================================================
DONE :: 1 starting structures 9476.44 cpu seconds
This process generated 7 decoys from 7 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>


Validate state Invalid
Claimed credit 30.7157035506969
Granted credit 0
application version 1.40

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56864 - Posted 12 Nov 2008 1:03:47 UTC - in response to Message ID 56859.
Last modified: 12 Nov 2008 1:05:10 UTC


1hzh_1mve_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_147_0
This WU was running for more than 15 hours (runtime preference = 6 hours) when I restarted my computer (Windows update).
The WU started again with 38 minutes processor time!
If possible more checkpoints will be welcome.

Have a nice day,
Path7.


Path7,

Are you running Vista SP1? If so, I've found that when you are applying a Definition Update for Windows Defender, you don't have to shut down the computer. Suspending all the workunits under BOINC is enough, and allows you to resume them in a few minutes without losing anything except those few minutes of CPU time.

Most other types of Vista updates seem to require a BOINC shutdown, though, and often a reboot.

DaBrat and DaBear

Joined: Aug 9 08
Posts: 16
ID: 272936
Credit: 213,180
RAC: 0
Message 56865 - Posted 12 Nov 2008 1:21:25 UTC - in response to Message ID 56851.

Rosetta/BOINC does not validate against partial results. It should.

The typical Rosetta task runs multiple decoys (each of which I believe is an *independent* simulation). I had such a task terminate because while calculating decoy 7 came it up with a NAN. The results from the correctly completed previous 6 decoys were discarded.

Looked in the 'Workunit Details' page and saw that another system was identified as successfully completing that same task. The catch -- it did only 5 decoys.

There is something fundamentally unfair when ALL the work from a system that did more crunching gets discarded, while accepting work from a system that crunched less.
.


I got the same thing on either machines that returned 7 decoys either a
NAN or validate error though no errors accounted for. Got one that ran sometime today over 9 hours 4G of memory on the machine and wasn't being used for anything else with a 3G dual core. Hope I get more than 9 credits for this one.

AMD_is_logical

Joined: Dec 20 05
Posts: 299
ID: 41207
Credit: 31,460,681
RAC: 0
Message 56868 - Posted 12 Nov 2008 2:44:28 UTC

A number of my machines are diskless swapless single core Linux with 512k installed memory. That has worked fine for quite some time, but now these machines are getting WUs that use too much memory, which stops crunching on those machines (as they don't have any swap disk). The problem is with WUs starting with "1hzh_". For example:

1hzh_2fzp_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_136_0

The only thing you can see in the stderr is that it was restarted several times. That's because it kept running out of memory. I eventually just aborted it. Crunching had stopped on several other machines due to 1hzh_ WUs, so I went through and aborted these WUs on all my 512k machines.

mmadden

Joined: Nov 30 05
Posts: 1
ID: 24350
Credit: 25,938
RAC: 0
Message 56869 - Posted 12 Nov 2008 3:51:34 UTC
Last modified: 12 Nov 2008 3:53:06 UTC

Greetings partners. I dont know if the following is an issue of the new version. Im getting extreme high temperatures in my cores, as a matter of fact I have to shutdown one because both cores crunching Rosetta WUs gabe a dangerous 80 celsius. Other projects using my both cores give me 73 top.

I aborted some of the units.

Edit: One more thing: The graphics are frozen.
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56870 - Posted 12 Nov 2008 5:21:56 UTC - in response to Message ID 56869.
Last modified: 12 Nov 2008 5:27:48 UTC

Greetings partners. I dont know if the following is an issue of the new version. Im getting extreme high temperatures in my cores, as a matter of fact I have to shutdown one because both cores crunching Rosetta WUs gabe a dangerous 80 celsius. Other projects using my both cores give me 73 top.

I aborted some of the units.

Edit: One more thing: The graphics are frozen.


mmadden,

Do you have the option of decreasing the percentage of time BOINC projects use the CPU instead, and then checking whether Minirosetta v1.40 actually obeys this decrease?

Also, can you check if it is one of the most memory-hungry processes on your machine while it is running?

You might also want to check for signs that your machine has so little free memory that the application has shut down graphics, if it can do that.

dazman

Joined: May 28 06
Posts: 1
ID: 85084
Credit: 43,508,714
RAC: 2,423
Message 56873 - Posted 12 Nov 2008 8:10:49 UTC - in response to Message ID 56848.

I'm so sorry for this mess. The jobs labeled with the words design, jacob, or sarel, are related to a new mode that we've put into v1.40. You can read more about this new mode and why we're excited about running it on Rosetta @ Home on

http://boinc.bakerlab.org/forum_thread.php?id=4477

As far as I can tell from the messages here, people are seeing two major problems:
1. long run times with relatively low credit
2. larger than anticipated memory requirements



Yes I just started having problems with memory, and found it was these new units. They are using WAY to much memory. I have a 8 Core 2.8Ghz Mac Pro. Only running 2gigs ram. I run it on 7cores (since running on 8 moves spotlight search to a crawl), and its been running fine for months, until now. Even though I have BOINC set to only use 45% of memory, its no obeying that rule. I'm going to have to stop running Rosetta@Home until this is resolved. Time to move on to a new project.
____________

svincent

Joined: Dec 30 05
Posts: 202
ID: 44923
Credit: 4,102,500
RAC: 5,735
Message 56887 - Posted 12 Nov 2008 23:44:14 UTC

As others have reported already, I'm seeing tasks fail apparently as a result of a numerical error in a routine that calculates hydrogen bonding. The tasks end up being resent to other computers, which fail in the same way. Bit of a waste.

----

Task ID 206696346
Name loopbuild_boinc4_grow10_hombench_loopbuild_t293__IGNORE_THE_REST_1VQ1A_3_4710_10_1
Workunit 188513139

NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763

----

Task ID 206661889
Name loopbuild_boinc4_grow10_hombench_loopbuild_t293__IGNORE_THE_REST_1WY7A_9_4710_13_0
Workunit 188528726

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763

------

Mac OS X 10.4.11 : Boinc 6.2.18


____________

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 56890 - Posted 13 Nov 2008 0:26:33 UTC

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188579652

RC

Joined: Sep 27 05
Posts: 13
ID: 1401
Credit: 245,498
RAC: 0
Message 56899 - Posted 13 Nov 2008 11:15:06 UTC

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=188277752

This WU ran 19.5 hours before I killed it (my run time preference is 6 hours). The next computer to pick it up had a compute error.
____________

googloo
Avatar

Joined: Sep 15 06
Posts: 105
ID: 112667
Credit: 5,953,021
RAC: 7,157
Message 56910 - Posted 13 Nov 2008 17:00:30 UTC - in response to Message ID 56848.
Last modified: 13 Nov 2008 17:01:32 UTC

I'm so sorry for this mess. The jobs labeled with the words design, jacob, or sarel, are related to a new mode that we've put into v1.40. You can read more about this new mode and why we're excited about running it on Rosetta @ Home on

http://boinc.bakerlab.org/forum_thread.php?id=4477

As far as I can tell from the messages here, people are seeing two major problems:
1. long run times with relatively low credit
2. larger than anticipated memory requirements

Please let me know if you see any other type of problem.


Do you still want feedback on the work units that have problems 1 and/or 2?

Sarel Profile

Joined: May 11 06
Posts: 51
ID: 81994
Credit: 81,712
RAC: 0
Message 56912 - Posted 13 Nov 2008 18:10:48 UTC - in response to Message ID 56910.

I'm so sorry for this mess. The jobs labeled with the words design, jacob, or sarel, are related to a new mode that we've put into v1.40. You can read more about this new mode and why we're excited about running it on Rosetta @ Home on

http://boinc.bakerlab.org/forum_thread.php?id=4477

As far as I can tell from the messages here, people are seeing two major problems:
1. long run times with relatively low credit
2. larger than anticipated memory requirements

Please let me know if you see any other type of problem.


Do you still want feedback on the work units that have problems 1 and/or 2?


Thanks for the offer to help! I'm now in the process of finding out what went wrong. The user reports on specific workunit failures are invaluable in figuring this out, but for the time being I have quite a few to work with :) I'll let you know once this is resolved.
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56914 - Posted 13 Nov 2008 19:34:31 UTC - in response to Message ID 56912.

I'm so sorry for this mess. The jobs labeled with the words design, jacob, or sarel, are related to a new mode that we've put into v1.40. You can read more about this new mode and why we're excited about running it on Rosetta @ Home on

http://boinc.bakerlab.org/forum_thread.php?id=4477

As far as I can tell from the messages here, people are seeing two major problems:
1. long run times with relatively low credit
2. larger than anticipated memory requirements

Please let me know if you see any other type of problem.


Do you still want feedback on the work units that have problems 1 and/or 2?


Thanks for the offer to help! I'm now in the process of finding out what went wrong. The user reports on specific workunit failures are invaluable in figuring this out, but for the time being I have quite a few to work with :) I'll let you know once this is resolved.


You might want to check if you have the capability to give workunits that use the new mode different time and memory estimates (possibly even sometimes different from each other) so they can be directed to suitable machines.

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 56915 - Posted 13 Nov 2008 19:45:38 UTC - in response to Message ID 56914.
Last modified: 13 Nov 2008 19:47:12 UTC

You might want to check if you have the capability to give workunits that use the new mode different time and memory estimates (possibly even sometimes different from each other) so they can be directed to suitable machines.


The memory is already done. Aligning large memory tasks with large memory machines. The runtime is defined by the users here at Rosetta@home. And the fact that some of (ok many of) Sarel's tasks exceed the runtime target is exactly what he's working to correct. Until then, there's no better estimate on how long they will take anyway. So Sarel is working to make sure the models each complete in the more normal hour or two of CPU time. Then the user's runtime preference will be the best approximation of runtime available, which is how the project works.

So, no grander change is required. Correcting (improving) the long-running models is the solution. I don't know if you've seen the graphic, but the proteins Sarel is tackling are absolutely huge! So, they are bound to turn up some behaviors in the program that smaller proteins do not run across.
____________
Rosetta Moderator: Mod.Sense

Gavin Shaw Profile
Avatar

Joined: Feb 1 07
Posts: 10
ID: 144828
Credit: 506,456
RAC: 0
Message 56922 - Posted 14 Nov 2008 0:43:53 UTC

Had one unit go funny.

Long Unit

My preference is set to 4 hours and this one went way longer than that and didn't finish? When I last looked it was still on the first model/decoy. I also notice that it was a resend as well.

____________
Never surrender and never give up. In the darkest hour there is always hope.

funkydude

Joined: Jun 15 08
Posts: 12
ID: 264493
Credit: 146,106
RAC: 0
Message 56923 - Posted 14 Nov 2008 1:36:02 UTC - in response to Message ID 56922.

Rosetta Mini doesn't always respect BOINC's "Snooze" setting on making projects suspend. The weird thing is I had 2 Mini's running and when I hit "Snooze" 1 suspended and 1 continued.

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 56924 - Posted 14 Nov 2008 16:51:35 UTC

Rosetta Mini doesn't always respect BOINC's "Snooze" setting on making projects suspend.
I find it better to use 'suspend' which you can find on the activity list
____________

AMD_is_logical

Joined: Dec 20 05
Posts: 299
ID: 41207
Credit: 31,460,681
RAC: 0
Message 56926 - Posted 14 Nov 2008 17:59:35 UTC

I seem to be having a lot of WUs bomb out with the message:

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763
called boinc_finish

Here are some examples:

h011__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-14-S3-7--h011_-_4675_56_0
h010__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-6-S3-4--h010_-_4675_56_0
foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t293__olange_IGNORE_THE_REST_1NV8A_12_4735_30_0

Mattia Verga

Joined: Jul 15 06
Posts: 3
ID: 100179
Credit: 124,357
RAC: 0
Message 56929 - Posted 14 Nov 2008 19:23:53 UTC

"Too many restarts with no progress. Keep application in memory while preempted."

206344438
____________

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 56931 - Posted 14 Nov 2008 19:54:19 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=205975695
____________

Rabinovitch Profile
Avatar

Joined: Apr 28 07
Posts: 28
ID: 170444
Credit: 1,377,008
RAC: 1,448
Message 56944 - Posted 15 Nov 2008 3:37:37 UTC

15.11.2008 8:23:48|rosetta@home|Computation for task 1ail__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1ail_-_4768_650_0 finished
15.11.2008 8:23:48|rosetta@home|Output file 1ail__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1ail_-_4768_650_0_0 for task 1ail__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1ail_-_4768_650_0 absent

http://boinc.bakerlab.org/rosetta/result.php?resultid=207358562

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 56947 - Posted 15 Nov 2008 4:26:54 UTC

Hi.

I just noticed on my Quad that i had six tasks running, now four where

running normailly the two rosetta mini 1.40 where marked as waiting to run

but the time and percentage was going up. I tried suspending them it didn't

work, i don't know if it's Boinc Ver 6.2.14 is the problem or Rosetta.

Any ideas.

pete.


____________


robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56956 - Posted 15 Nov 2008 13:41:08 UTC

I've found that just increasing the minimum and maximum virtual memory sizes is not enough to make minirosetta v1.40 start using more virtual memory, at least on my Vista SP1 machine. I then increased the maximum amount of disk space BOINC is allowed to use, and the maximum percentage of virtual memory BOINC is allowed to use. Since this, I've been seeing two of the more memory-hungry workunits run at the same time on my dual-core machine more often, such as two minirosetta v1.40 workunits, and have started seeing a higher total size of virtual memory in use. However, I've also stopped seeing workunits with the known tags for use of the new mode of minirosetta v1.40, so it's hard to tell which is responsible for the improvement.

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 56959 - Posted 15 Nov 2008 15:12:53 UTC - in response to Message ID 56956.

...so it's hard to tell which is responsible for the improvement.


Yes, it is always hard to say for certain cause and effect. Keep in mind that CPU time is the main contributor here, not amount of virtual memory utilized. It will actually run faster if it has real memory then virtual. And you cannot force an application to use more memory. It either requires it, or it doesn't.

It is sort of like consuming water as your objective and someone suggests you use a larger glass with the thought that it would help you consume water faster. As long as your prior glass was able to provide water at a rate similar to the rate of consumption, a larger glass will not help.

____________
Rosetta Moderator: Mod.Sense

Rabinovitch Profile
Avatar

Joined: Apr 28 07
Posts: 28
ID: 170444
Credit: 1,377,008
RAC: 1,448
Message 56964 - Posted 15 Nov 2008 16:10:03 UTC

Aenozer wuan: http://boinc.bakerlab.org/rosetta/result.php?resultid=207358562

Saharak

Joined: Apr 28 07
Posts: 7
ID: 170710
Credit: 499,019
RAC: 1,482
Message 56965 - Posted 15 Nov 2008 19:50:30 UTC

1hzh_2exu_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_284_0
Had been running for 25 hours then it was suspended because of time of day. Next day it restarted at 50% (which means 12 hours of computing was lost). Therefore I canceled the unit.

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 56967 - Posted 15 Nov 2008 20:47:40 UTC - in response to Message ID 56947.
Last modified: 15 Nov 2008 20:56:32 UTC

Hi.

I just noticed on my Quad that i had six tasks running, now four where

running normailly the two rosetta mini 1.40 where marked as waiting to run

but the time and percentage was going up. I tried suspending them it didn't

work, i don't know if it's Boinc Ver 6.2.14 is the problem or Rosetta.

Any ideas.

pete.



Well it looks like it could be the mini app that's the problem as i have got two

Beta 5.98 tasks on now and they are suspending/waiting to run properly, ill

have to keep an eye on it when i get a couple of 1.40's running.

Edit// I forgot, these are the two tasks that where on at the time.

cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrelax_cs_ccr19_olange_4727_24570_0
cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrelax_cs_ccr19_olange_4727_26842_0

pete.
____________


robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56968 - Posted 15 Nov 2008 21:44:58 UTC - in response to Message ID 56959.

...so it's hard to tell which is responsible for the improvement.


Yes, it is always hard to say for certain cause and effect. Keep in mind that CPU time is the main contributor here, not amount of virtual memory utilized. It will actually run faster if it has real memory then virtual. And you cannot force an application to use more memory. It either requires it, or it doesn't.


I already got all the physical memory my motherboard can use when I saw a slowdown of the programs I normally use a few months ago. At least the greater use of virtual memory to swap out the workunits that aren't running allows me to use the browser and newsreader without the slowdowns I saw recently.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 56969 - Posted 15 Nov 2008 21:58:25 UTC - in response to Message ID 56967.

Hi.

I just noticed on my Quad that i had six tasks running, now four where

running normailly the two rosetta mini 1.40 where marked as waiting to run

but the time and percentage was going up. I tried suspending them it didn't

work, i don't know if it's Boinc Ver 6.2.14 is the problem or Rosetta.

Any ideas.

pete.



Well it looks like it could be the mini app that's the problem as i have got two

Beta 5.98 tasks on now and they are suspending/waiting to run properly, ill

have to keep an eye on it when i get a couple of 1.40's running.

Edit// I forgot, these are the two tasks that where on at the time.

cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrelax_cs_ccr19_olange_4727_24570_0
cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrelax_cs_ccr19_olange_4727_26842_0

pete.


Those workunit names look a lot like the names at least one workunit I ran recently, but under 1.40 instead. Under 1.40, they seemed to run OK after I made the changes that let BOINC use more virtual memory to swap out workunits that weren't running.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189103816

softweir

Joined: Oct 30 05
Posts: 1
ID: 7764
Credit: 1,345,335
RAC: 957
Message 56972 - Posted 16 Nov 2008 0:24:02 UTC
Last modified: 16 Nov 2008 0:28:07 UTC

I keep getting compute errors on my minirosetta tasks. The following (from taskID 207659162) is typical:-

stderr out

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C910193 write attempt to address 0x009254E6

Engaging BOINC Windows Runtime Debugger...



Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C910193 write attempt to address 0x00408E6E

Engaging BOINC Windows Runtime Debugger...

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 56976 - Posted 16 Nov 2008 3:10:01 UTC
Last modified: 16 Nov 2008 4:05:45 UTC

Several errors over the last week or so, which I'm only just catching up with:

These messages (to a greater or lesser degree) appear in the Task IDs listed beneath (note: I've seen this reported in the thread Rosetta Mini with new score terms bug thread too):

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
recovering checkpoint of tag S_U9X3X_00000001 with id abrelax_rg_state
recovering checkpoint of tag S_U9X3X_00000001 with id stage_1
recovering checkpoint of tag S_U9X3X_00000001 with id stage_2
# cpu_run_time_pref: 7200
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_1
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_2
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_3
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_4
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_5
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_6
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_7
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_8
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_9
recovering checkpoint of tag S_U9X3X_00000001 with id stage_3_iter1_10
recovering checkpoint of tag S_U9X3X_00000001 with id stage4_kk_1
recovering checkpoint of tag S_U9X3X_00000001 with id stage4_kk_2
recovering checkpoint of tag S_U9X3X_00000001 with id stage4_kk_3
recovering checkpoint of tag S_U9X3X_00000001 with id abrelax_relax
recovering checkpoint of tag S_U9X3X_00000002 with id abrelax_rg_state
recovering checkpoint of tag S_U9X3X_00000002 with id stage_1
recovering checkpoint of tag S_U9X3X_00000002 with id stage_2
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_1
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_2
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_3
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_4
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_5
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_6
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_7
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_8
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_9
recovering checkpoint of tag S_U9X3X_00000002 with id stage_3_iter1_10
recovering checkpoint of tag S_U9X3X_00000002 with id stage4_kk_1
recovering checkpoint of tag S_U9X3X_00000002 with id stage4_kk_2
recovering checkpoint of tag S_U9X3X_00000002 with id stage4_kk_3
recovering checkpoint of tag S_U9X3X_00000002 with id abrelax_relax
recovering checkpoint of tag S_U9X3X_00000003 with id abrelax_rg_state
recovering checkpoint of tag S_U9X3X_00000003 with id stage_1
recovering checkpoint of tag S_U9X3X_00000003 with id stage_2
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_1
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_2
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_3
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_4
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_5
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_6
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_7
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_8
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_9
recovering checkpoint of tag S_U9X3X_00000003 with id stage_3_iter1_10
recovering checkpoint of tag S_U9X3X_00000003 with id stage4_kk_1
recovering checkpoint of tag S_U9X3X_00000003 with id stage4_kk_2
recovering checkpoint of tag S_U9X3X_00000003 with id stage4_kk_3
recovering checkpoint of tag S_U9X3X_00000003 with id abrelax_relax
recovering checkpoint of tag S_U9X3X_00000004 with id abrelax_rg_state
recovering checkpoint of tag S_U9X3X_00000004 with id stage_1
recovering checkpoint of tag S_U9X3X_00000004 with id stage_2
recovering checkpoint of tag S_U9X3X_00000004 with id stage_3_iter1_1
recovering checkpoint of tag S_U9X3X_00000004 with id stage_3_iter1_2
recovering checkpoint of tag S_U9X3X_00000004 with id stage_3_iter1_3
recovering checkpoint of tag S_U9X3X_00000004 with id stage_3_iter1_4


Task ID 205994161
Task ID 206059326
Task ID 206211546
Task ID 206290866
Task ID 206333375
Task ID 206790264
Task ID 206812382
Task ID 206932670
Task ID 207028575
Task ID 207063049
Task ID 207098754
Task ID 207231397
Task ID 207268928
Task ID 207273838
Task ID 207278136
Task ID 207281019
Task ID 207305018
Task ID 207461993
Task ID 207466440
Task ID 207471528
Task ID 207471528


These messages appear in the Task IDs listed beneath:
ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish
Can't set up shared mem: -1
Will run in standalone mode.


Task ID 206157249
Task ID 207109581

However, these and many more WUs error out with these following details:

Client state Compute error
Exit status -226 (0xffffff1e)
...
<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
...
Can't acquire lockfile - exiting
Can't acquire lockfile - exiting
Can't acquire lockfile - exiting
<repeat many times>


The computer is listed here
AMD Phenom(tm) 9850 Quad-Core, Vista Home Premium x64 Edition, SP1, 8Gb RAM, 330Gb free space, preferences set to 2 hour run time due to constant erroring out with "Can't acquire lockfile - exiting".

This problem never occurs with Rosetta Beta 5.98 (or earlier versions of the Beta) - only with all versions of MiniRosetta since I upgraded to this machine and 64-bit OS.

Of my last 162 WUs:
Beta 5.98 - 58 - 100% success
Mini 1.40 - 104 - 60% success (62) 40% failure (42)

Failure of Mini 1.40 WUs rises rapidly if runtime is increased above 2 hours (60% failure rate)
____________

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 56988 - Posted 16 Nov 2008 8:30:41 UTC
Last modified: 16 Nov 2008 8:31:17 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=207486683
http://boinc.bakerlab.org/rosetta/result.php?resultid=207628604
____________

Dave Mickey

Joined: Dec 29 07
Posts: 33
ID: 231007
Credit: 4,136,957
RAC: 0
Message 56991 - Posted 16 Nov 2008 13:28:05 UTC

*As far as I can tell from the messages here, people are seeing two major *problems:
*1. long run times with relatively low credit
*2. larger than anticipated memory requirements
*
*Please let me know if you see any other type of problem.

I think I'm seeing something else. I have 2 machines sharing r@h with
seti. I'm seeing that each machine has gotten to a state where BOINC
has suspended a rosetta task in order to restart a seti task. But I see
that the seti task is only getting 50% of the CPU time, according to
boincview/boinc. When I look in Win task manager, I see this is because the
rosetta task is continuing to execute, and thus the rosetta and seti are
trying to share the processor, each getting about 50%. But boincview
has the rosetta task as waiting. But despite that, the cum CPU time reported
in the boincview is increasing, with both the rosetta and seti getting about
30 secs of CPU each minute. In the current case, they've been sharing the
CPU for about 2 hours, so this seems to be a steady state condition.

This rosetta app (or maybe something about the WU) has made it apparently
ignore BOINCs command to suspend and be preempted. The current problem case
is

2008-11-16 04:47:50 [rosetta@home] [cpu_sched] Preempting cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrel
ax_cs_ccr19_olange_4727_614_0 (left in memory)

In the first instance, I shut down BOINC and restarted, and it properly
restarted with only the seti wu executing.

Dave

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,057,641
RAC: 1,674
Message 56997 - Posted 16 Nov 2008 14:38:45 UTC

Can't acquire lockfile - exiting

I posted this in another thread, but as it seems to be Mini Rosetta specific I'll copy and paste it here as well. I said...

That's familiar. Go to "Your Account" then "Computing Preferences" check that at the bottom of the first block "Use at most" is set to 100%. That lock file error is common on systems where this is not set to 100% at some projects.



____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 56999 - Posted 16 Nov 2008 15:46:50 UTC
Last modified: 16 Nov 2008 15:48:01 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=207080587
h013__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-14-S3-7--h013_-_4675_237_1

it completed ok...but there was alot of messages like this:
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
recovering checkpoint of tag S_U14X7X_00000001 with id abrelax_rg_state
recovering checkpoint of tag S_U14X7X_00000001 with id stage_1
recovering checkpoint of tag S_U14X7X_00000001 with id stage_2
# cpu_run_time_pref: 21600
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_1
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_2
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_3
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_4
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_5
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_6
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_7
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_8
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_9
recovering checkpoint of tag S_U14X7X_00000001 with id stage_3_iter1_10
recovering checkpoint of tag S_U14X7X_00000001 with id stage4_kk_1

and it goes on and on.....repeating recovering checkpoint of tag S_U14X7X_00000001 as the central theme

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 57004 - Posted 16 Nov 2008 20:37:57 UTC

Hi.

I got this one overnight it ran for 3hrs, 47min then errored.

h001b_BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-8-S3-3--h001b-_4769_1442_0

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189422310

<core_client_version>6.2.14</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763
called boinc_finish

pete.

____________


(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57005 - Posted 16 Nov 2008 22:05:46 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=206369194
http://boinc.bakerlab.org/rosetta/result.php?resultid=205975695
=====================
http://boinc.bakerlab.org/rosetta/result.php?resultid=206340374
alidate state Valid
Claimed credit 269.214089450125
Granted credit 81.1915060162041 WTF ??????

____________

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57007 - Posted 16 Nov 2008 22:14:47 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=207312755
http://boinc.bakerlab.org/rosetta/result.php?resultid=207413491
http://boinc.bakerlab.org/rosetta/result.php?resultid=207413498
http://boinc.bakerlab.org/rosetta/result.php?resultid=207417625
http://boinc.bakerlab.org/rosetta/result.php?resultid=207417646

____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57009 - Posted 16 Nov 2008 22:39:05 UTC - in response to Message ID 57005.

=====================
http://boinc.bakerlab.org/rosetta/result.php?resultid=206340374
alidate state Valid
Claimed credit 269.214089450125
Granted credit 81.1915060162041 WTF ??????


That claimed to granted credit ratio is what typically happens when you return a workunit that had a serious underestimate of the amount of CPU time it needed to run.

Mike Tyka

Joined: Oct 20 05
Posts: 96
ID: 5612
Credit: 2,190
RAC: 0
Message 57011 - Posted 16 Nov 2008 23:51:25 UTC - in response to Message ID 56783.

Hello all,
Just saw an error from this WU:
loopbuild_boinc4_hombench_loopbuild_t308__IGNORE_THE_REST_1UKVY_1_4693_12_0

<core_client_version>6.2.25</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
Too many restarts with no progress. Keep application in memory while preempted.
======================================================
DONE :: 1 starting structures 24.3206 cpu seconds
This process generated 0 decoys from 0 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>loopbuild_boinc4_hombench_loopbuild_t308__IGNORE_THE_REST_1UKVY_1_4693_12_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>
</message>

Well, looks like 2 errors: Too many restarts & file_xfer error.

Be aware: I'm running WCG's (beta-)BOINC 6.2.25, which seems to be pretty stable (so far).

Have a nice day,
Path7.


I tried re-running this here locally in the lab and it runs just fine - so not sure what went wrong there i'm afraid :(
Thanks for posting anyway!



____________
http://beautifulproteins.blogspot.com/
http://www.miketyka.com/

Path7

Joined: Aug 25 07
Posts: 128
ID: 201002
Credit: 61,751
RAC: 0
Message 57015 - Posted 17 Nov 2008 0:42:50 UTC - in response to Message ID 57011.
Last modified: 17 Nov 2008 0:44:27 UTC


I tried re-running this here locally in the lab and it runs just fine - so not sure what went wrong there i'm afraid :(
Thanks for posting anyway!

Hello Mike Tyka,

Thanks for your reaction, and rerunning this WU.

Have had more errors on this new laptop (restarting & can't acquire lockfile).
These errors might occur due to throttling which I need to keep the fans “silent”.
I've changed some settings to keep the CPU running at a constant frequency.
Downgraded to BOINC 5.10.45, just in case.
Now this machine seems to crunch better, occasionally restarting, but valid WU's (so far).

Have a nice day,
Path7.

Cobra Profile

Joined: Nov 9 05
Posts: 6
ID: 10491
Credit: 8,472,001
RAC: 800
Message 57018 - Posted 17 Nov 2008 2:07:34 UTC

Add me to the list of folks seeing WUs seemingly hang at around 9ish minutes to go to completion. I've seen WUs run as long as 11 hrs without completing before manually aborting them. Behavior seen on multiple hardware platforms (at least an AMD 9950BE and Opteron 180 and an Intel Core2 Duo dual core laptop with installed memory ranging from 1GB to 3+ GB), but all running WinXP.

Seems to be happening on 5-20% of my Rosetta Mini 1.40 WUs.

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57027 - Posted 17 Nov 2008 16:38:58 UTC - in response to Message ID 57018.

Add me to the list of folks seeing WUs seemingly hang at around 9ish minutes to go to completion. I've seen WUs run as long as 11 hrs without completing before manually aborting them. Behavior seen on multiple hardware platforms (at least an AMD 9950BE and Opteron 180 and an Intel Core2 Duo dual core laptop with installed memory ranging from 1GB to 3+ GB), but all running WinXP.

Seems to be happening on 5-20% of my Rosetta Mini 1.40 WUs.

I really don't understand why people keep going on about this. It seems quite obvious to me that once the counter gets to around 10 minutes it stops counting altogether. Every WU does this, Mini or Beta. Always has, likely always will.

If the estimate is 3hours then even if the WU ends up running 3hours exactly the countdown still stops with 10 minutes to go. It ends when the model it's working on ends, then drops to zero as the WU finishes altogether. If the 1st model ends at 1h 31m then the WU ends because it'll assume the next model will take the same time and go over the 3hours. If 2 models complete at 2h 1m it'll do the same, assuming another 1h 0m 30s for the next model. And so on for 3 models at 2h 16m, 4 models at 2h 25m etc.

To see how many models have been done, click "Show Graphics" in the Boinc Manager. It's shown at the bottom right.

An estimate is an estimate. It's not a set time frame. Don't expect it to be cast in stone because it's not.

Same with all the long-running WUs. They don't end earlier because the first model hasn't even been completed. Don't look at the clock ticking down. As long as the CPU time is clicking up then it's running just fine. If you abort the WU while CPU time is running then it's your look-out. I think my record is about 14 hours.
____________

DJStarfox

Joined: Jul 19 07
Posts: 140
ID: 191721
Credit: 560,560
RAC: 21
Message 57028 - Posted 17 Nov 2008 17:11:13 UTC - in response to Message ID 56923.

Rosetta Mini doesn't always respect BOINC's "Snooze" setting on making projects suspend. The weird thing is I had 2 Mini's running and when I hit "Snooze" 1 suspended and 1 continued.


Yes, I've had the same problem on occassion with Rosetta Mini 1.40. I understand there are times where the program is "right in the middle of something" but it should perform callback checks to the BOINC API to suspend/run appropriately within a few seconds of the API command.

Warren B. Rogers

Joined: Oct 3 05
Posts: 5
ID: 2517
Credit: 821,633
RAC: 0
Message 57032 - Posted 17 Nov 2008 19:35:54 UTC - in response to Message ID 57027.

Add me to the list of folks seeing WUs seemingly hang at around 9ish minutes to go to completion. I've seen WUs run as long as 11 hrs without completing before manually aborting them. Behavior seen on multiple hardware platforms (at least an AMD 9950BE and Opteron 180 and an Intel Core2 Duo dual core laptop with installed memory ranging from 1GB to 3+ GB), but all running WinXP.

Seems to be happening on 5-20% of my Rosetta Mini 1.40 WUs.

I really don't understand why people keep going on about this. It seems quite obvious to me that once the counter gets to around 10 minutes it stops counting altogether. Every WU does this, Mini or Beta. Always has, likely always will.

If the estimate is 3hours then even if the WU ends up running 3hours exactly the countdown still stops with 10 minutes to go. It ends when the model it's working on ends, then drops to zero as the WU finishes altogether. If the 1st model ends at 1h 31m then the WU ends because it'll assume the next model will take the same time and go over the 3hours. If 2 models complete at 2h 1m it'll do the same, assuming another 1h 0m 30s for the next model. And so on for 3 models at 2h 16m, 4 models at 2h 25m etc.

To see how many models have been done, click "Show Graphics" in the Boinc Manager. It's shown at the bottom right.

An estimate is an estimate. It's not a set time frame. Don't expect it to be cast in stone because it's not.

Same with all the long-running WUs. They don't end earlier because the first model hasn't even been completed. Don't look at the clock ticking down. As long as the CPU time is clicking up then it's running just fine. If you abort the WU while CPU time is running then it's your look-out. I think my record is about 14 hours.


Actually what the problem is that when the WU gets to 10ish minutes to go it actually isn't doing any work it is just stuck. I've had a WU get to the 9 minute 56 second mark and just get stuck there. The longest I've had it get stuck is close to 18 hours and if BOINC get restarted it will go all the way back to 45 minutes and a then only take about 2 1/2 hours to complete after that. And there are alway a lot of lock file errors or watchdog reset errors. And not all have this problem. I've had several Rossetta Mini finish without getting stuck at just under 10 minutes and I can't remember see a beta have problems getting stuck.

Thanks for the info though,

Warren
____________

DALTON

Joined: Jun 9 08
Posts: 1
ID: 263682
Credit: 250,510
RAC: 0
Message 57034 - Posted 17 Nov 2008 21:18:29 UTC - in response to Message ID 57032.

Actually what the problem is that when the WU gets to 10ish minutes to go it actually isn't doing any work it is just stuck. I've had a WU get to the 9 minute 56 second mark and just get stuck there. The longest I've had it get stuck is close to 18 hours and if BOINC get restarted it will go all the way back to 45 minutes and a then only take about 2 1/2 hours to complete after that. And there are alway a lot of lock file errors or watchdog reset errors. And not all have this problem. I've had several Rossetta Mini finish without getting stuck at just under 10 minutes and I can't remember see a beta have problems getting stuck.

The description by Sid Celery sounds very accurate to me. I've currently got a Mini work unit at 6 hours (3 hours default) and as he says it's still on the first model and ticking up nicely. No problem at all.

If you've got lock file errors I'd hazard a guess that it's not ticking up at all on the CPU Time side. That's the issue. Forget anything to do with the remaining time because that's only ever a complete guess - as likely to be wrong as right.

When you get other errors, the WU falls back to its last save position or the start of the current model within the WU. Maybe it sorts itself out by doing that and that's why it completes quickly after that.

Just my 2cents

Aegis Maelstrom

Joined: Oct 29 08
Posts: 61
ID: 285843
Credit: 792,303
RAC: 34
Message 57035 - Posted 17 Nov 2008 21:24:48 UTC - in response to Message ID 56782.
Last modified: 17 Nov 2008 21:26:08 UTC

I wrote:


Task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1

restarted twice so far, now processing:

(...)

Now I am waiting to check if this workunit is endable.


The Workunit restarted third time, seemingly in the same place as the previous time (the percentage "completed" was higher but I was checking a couple minutes earlier and it was once again step 10000 then, so now it was probably 11000).

The WU started for the fourth time, now with 24% but I guess it was the same moment as before. When I restarted the WU after temporarily halting once again, it went back to 17%. Now I can see 18,23% and step 523.

Now I am halting this task and my business with Rosetta.

When the BOINC tried to download a different task, I got a following log:
2008-11-09 14:29:23|rosetta@home|Message from server: No work sent
2008-11-09 14:29:23|rosetta@home|Message from server: Your preferences limit memory usage to 452 MB, and 488 MB is needed

The problem seems to be with a higher memory usage although one of the mods recently assured us that there is no increase in memory requirements.
I could increase amount of memory dedicated to BOINC, however I would like to have this problem explained and ironed out.


Hi,

as I have promised I have come back, increased the memory amount and started to crunch again.

To my surprise, the process has suddenly finished with a "success". The log says:
2008-11-17 21:54:33|rosetta@home|Restarting task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 using minirosetta version 140
2008-11-17 21:56:12|rosetta@home|Computation for task IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1wr2_4683_55_1 finished

As I wrote in the posts above, this is impossible to end this task in such a time. Last time I needed two and a half "physical" hours just to crash, due to probably too low memory limits.

I would like to notify you that this unit has not been computed properly and probably it's worth a try. I've made a snapshot just before the crash and this protein looks far better (lower energy, RAC) than anyone from the old fashioned abinito process I have seen so far.

Frankly speaking, I would be more than happy to compute it by myself; unfortunately the client has sent it back. :(

If you could send it to me manually, that would be nice. :) If not, please consider a recomputation of this unit.

I wish you best luck with these units as the one I have seen so far signals a true breakthrough...

a.m.@Poland

Warren B. Rogers

Joined: Oct 3 05
Posts: 5
ID: 2517
Credit: 821,633
RAC: 0
Message 57037 - Posted 18 Nov 2008 1:13:13 UTC - in response to Message ID 57034.

Actually what the problem is that when the WU gets to 10ish minutes to go it actually isn't doing any work it is just stuck. I've had a WU get to the 9 minute 56 second mark and just get stuck there. The longest I've had it get stuck is close to 18 hours and if BOINC get restarted it will go all the way back to 45 minutes and a then only take about 2 1/2 hours to complete after that. And there are alway a lot of lock file errors or watchdog reset errors. And not all have this problem. I've had several Rossetta Mini finish without getting stuck at just under 10 minutes and I can't remember see a beta have problems getting stuck.

The description by Sid Celery sounds very accurate to me. I've currently got a Mini work unit at 6 hours (3 hours default) and as he says it's still on the first model and ticking up nicely. No problem at all.

If you've got lock file errors I'd hazard a guess that it's not ticking up at all on the CPU Time side. That's the issue. Forget anything to do with the remaining time because that's only ever a complete guess - as likely to be wrong as right.

When you get other errors, the WU falls back to its last save position or the start of the current model within the WU. Maybe it sorts itself out by doing that and that's why it completes quickly after that.

Just my 2cents


Also, BOINC will restart the WU if it gets stuck for too long and will go back to almost the beginning of the WU. Then the WU completes in approximately 2 1/2 hours like a WU that doesn't have any problems. The thing that sucks about that is I only get credit for the time it took to complete the WU, 2 1/2 hour and the other 7 to 16 hours that my computer was stuck doesn't get credited. I don't have a problem with working on WU's that take a long time to complete as most of the projects that I do work for take multiple hours and my longest is ClimatePrediction.net, which at the moment has been working for 339 hours and still has about 7 hour to go. I just don't like having a WU take up CPU cycles from another WU when it isn't necessary.
____________

Speedy
Avatar

Joined: Sep 25 05
Posts: 159
ID: 1058
Credit: 507,926
RAC: 0
Message 57038 - Posted 18 Nov 2008 5:29:05 UTC
Last modified: 18 Nov 2008 5:30:35 UTC

Taskid 207716389 it's a 1d0qA model dose not display any graphics by clicking show graphics or the screen saver. it's has just finished it's valid. I hope this is of help
____________
Have a crunching good day!!

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57039 - Posted 18 Nov 2008 8:15:37 UTC

mod or team...what is this recovering checkpoint thing that is showing up in some tasks? see my thread further down the list showing 4 tasks that completed ok, but gave checkpoint messages. also the task of speedy showed the same thing. completed ok, but gives a recovering checkpoint message.

Conan Profile
Avatar

Joined: Oct 11 05
Posts: 136
ID: 4053
Credit: 1,869,555
RAC: 1,082
Message 57040 - Posted 18 Nov 2008 10:46:16 UTC

This task ran for 20 hours and was terminated by Boinc Watchdog.
I know that rosetta is a low paying project at the best of times so I will be satisfied (I have to don't I?) with the 80 credits I received (4 cr/hr).

# cpu_run_time_pref: 21600
**********************************************************************
Rosetta is going too long. Watchdog is ending the run!
CPU time: 71942.5 seconds. Greater than 3X preferred time: 21600 seconds
**********************************************************************
called boinc_finish
____________

mikylinux

Joined: Jul 25 07
Posts: 3
ID: 193561
Credit: 73,155
RAC: 0
Message 57042 - Posted 18 Nov 2008 11:30:19 UTC


The tasks

cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrelax_cs_flua_olange_4728_19390_0

and

1bm8__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1bm8_-_4768_9_0

do not stop the work. It is running 14 hours, usually takes 4 hours.
Interrupting the work...

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57043 - Posted 18 Nov 2008 12:36:16 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=206369194
____________

ramostol

Joined: Feb 6 07
Posts: 64
ID: 145835
Credit: 584,052
RAC: 0
Message 57044 - Posted 18 Nov 2008 13:07:28 UTC - in response to Message ID 57042.


The tasks

cs_jumping_abrelax_6PNAS_proteins3_homo_bench_cs_jumping_abrelax_cs_flua_olange_4728_19390_0

and

1bm8__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1bm8_-_4768_9_0

do not stop the work. It is running 14 hours, usually takes 4 hours.
Interrupting the work...



I observed on my MacBook this morning (it's working by itself in peace and quietude) that the cs_jumping wus appear to complete normally, but seem (according to the message window) to restart once or twice in the computing process without an obvious explanation.

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57046 - Posted 18 Nov 2008 21:36:58 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=206938154
http://boinc.bakerlab.org/rosetta/result.php?resultid=207138889
http://boinc.bakerlab.org/rosetta/result.php?resultid=207121456
http://boinc.bakerlab.org/rosetta/result.php?resultid=207114809
http://boinc.bakerlab.org/rosetta/result.php?resultid=206990578
http://boinc.bakerlab.org/rosetta/result.php?resultid=206946754
http://boinc.bakerlab.org/rosetta/result.php?resultid=206944736
http://boinc.bakerlab.org/rosetta/result.php?resultid=206831871

____________

AMD_is_logical

Joined: Dec 20 05
Posts: 299
ID: 41207
Credit: 31,460,681
RAC: 0
Message 57049 - Posted 19 Nov 2008 0:14:20 UTC

Here's some more NANs in hbonding errors from h001b_BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25 WUS:

http://boinc.bakerlab.org/rosetta/result.php?resultid=208041354
http://boinc.bakerlab.org/rosetta/result.php?resultid=207922933
http://boinc.bakerlab.org/rosetta/result.php?resultid=207915448
http://boinc.bakerlab.org/rosetta/result.php?resultid=207873078

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57050 - Posted 19 Nov 2008 0:31:49 UTC

Hello, new here, sorry guys, I come to bitch.

Having to manually abort every Rosetta Mini 1.40 task, so that I'm not wasting CPU time and energy, is a bitch.

Just sayin'.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57051 - Posted 19 Nov 2008 1:31:46 UTC - in response to Message ID 57049.
Last modified: 19 Nov 2008 1:38:35 UTC

Here's some more NANs in hbonding errors from h001b_BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25 WUS:

http://boinc.bakerlab.org/rosetta/result.php?resultid=208041354
http://boinc.bakerlab.org/rosetta/result.php?resultid=207922933
http://boinc.bakerlab.org/rosetta/result.php?resultid=207915448
http://boinc.bakerlab.org/rosetta/result.php?resultid=207873078


I bet you'd like it if, in addition to reporting the error for the tag with the error, v1.41 also had the capability of reporting the good results for the previous tags, with separate credit calculations for each tag.

That would, however, probably require adding a new outcome state indicating partially successful.

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57053 - Posted 19 Nov 2008 3:26:31 UTC
Last modified: 19 Nov 2008 3:31:16 UTC

P.S.:

19/11/2008 02:40:10|rosetta@home|Starting foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0
19/11/2008 02:40:14|rosetta@home|Starting task foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0 using minirosetta version 140
19/11/2008 02:56:51|rosetta@home|Restarting task foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0 using minirosetta version 140
19/11/2008 02:57:32|rosetta@home|Task foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0 exited with zero status but no 'finished' file
19/11/2008 02:57:32|rosetta@home|If this happens repeatedly you may need to reset the project.
.
.
.

Again. I should suspend the Rosetta project altogether until this stops happening, right?

Cobra Profile

Joined: Nov 9 05
Posts: 6
ID: 10491
Credit: 8,472,001
RAC: 800
Message 57054 - Posted 19 Nov 2008 4:31:18 UTC - in response to Message ID 57027.

Add me to the list of folks seeing WUs seemingly hang at around 9ish minutes to go to completion. I've seen WUs run as long as 11 hrs without completing before manually aborting them. Behavior seen on multiple hardware platforms (at least an AMD 9950BE and Opteron 180 and an Intel Core2 Duo dual core laptop with installed memory ranging from 1GB to 3+ GB), but all running WinXP.

I really don't understand why people keep going on about this. It seems quite obvious to me that once the counter gets to around 10 minutes it stops counting altogether. Every WU does this, Mini or Beta. Always has, likely always will.

I have not shared your experience. I've run Rosetta@Home for a couple of years, and have happened to catch a few WUs counting down their final couple of minutes, so I disagree with your "always has" comment.

I have also become accustomed over the years to workunits wrapping up in ~2.5-3 hrs nearly 100% of the time. The combination of a "stuck" countdown timer and WUs going ~3-4 times longer than I'm used to was behavior outside of my experience and seemed to indicate a problem, so I posted.
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57058 - Posted 19 Nov 2008 7:55:45 UTC - in response to Message ID 57053.

P.S.:
19/11/2008 02:40:10|rosetta@home|Starting foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0
19/11/2008 02:40:14|rosetta@home|Starting task foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0 using minirosetta version 140
19/11/2008 02:56:51|rosetta@home|Restarting task foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0 using minirosetta version 140
19/11/2008 02:57:32|rosetta@home|Task foldcst_minimalist_core3_homo_bench_foldcst_cheat_chunk_t312__olange_IGNORE_THE_REST_1XV2A_5_4741_186_0 exited with zero status but no 'finished' file
19/11/2008 02:57:32|rosetta@home|If this happens repeatedly you may need to reset the project.
.
.
.

Again. I should suspend the Rosetta project altogether until this stops happening, right?


no...if it continues on those tasks specifically report it in the correct thread. it's just one of those bugs that shows up at random. I get those now and then. it's a pain in the backside, but thats just life in DC world.
keep on crunching, there will be others that are better.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57059 - Posted 19 Nov 2008 8:02:32 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=206368130
IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_2a7m_4683_239_0

CPU time 15318.72
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
======================================================
DONE :: 1 starting structures 15318.6 cpu seconds
This process generated 0 decoys from 0 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

this is odd...it ran about 75% of its time and came up with 0 decoys? and then stopped? what's up with that?

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57060 - Posted 19 Nov 2008 8:05:49 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=206127181
IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1w2l_4683_215_0

CPU time 12843.17
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
No heartbeat from core client for 30 sec - exiting
# cpu_run_time_pref: 21600
======================================================
DONE :: 1 starting structures 12842.9 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57061 - Posted 19 Nov 2008 8:07:01 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=206127181
IL23p40_p40BrubYhbond_design_jecorn_SAVE_ALL_OUT_IGNORE_THE_REST_ip40_1w2l_4683_215_0
CPU time 12843.17
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
No heartbeat from core client for 30 sec - exiting
# cpu_run_time_pref: 21600
======================================================
DONE :: 1 starting structures 12842.9 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57062 - Posted 19 Nov 2008 8:10:30 UTC
Last modified: 19 Nov 2008 8:13:35 UTC

more of the recovering checkpoint blah blah....

http://boinc.bakerlab.org/rosetta/result.php?resultid=207631513
1xxxA_ZNMP_ABRELAX_tetraL_IGNORE_THE_REST_ZINC_METALLOPROTEIN-1xxxA-_4658_78912_0

CPU time 21412.55
stderr out

-----

http://boinc.bakerlab.org/rosetta/result.php?resultid=207390655
1xxxA_ZNMP_ABRELAX_tetraR_IGNORE_THE_REST_ZINC_METALLOPROTEIN-1xxxA-_4658_55517_0

-----

http://boinc.bakerlab.org/rosetta/result.php?resultid=207329937
1xxxA_ZNMP_ABRELAX_tetraL_IGNORE_THE_REST_ZINC_METALLOPROTEIN-1xxxA-_4658_46952_0

to name a few...i think it is all the 1xxxA that produce this message:
<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
recovering checkpoint of tag S_00000001 with id abrelax_rg_state
recovering checkpoint of tag S_00000001 with id stage_1
recovering checkpoint of tag S_00000001 with id stage_2
recovering checkpoint of tag S_00000001 with id stage_3_iter1_1
recovering checkpoint of tag S_00000001 with id stage_3_iter1_2
recovering checkpoint of tag S_00000001 with id stage_3_iter1_3
recovering checkpoint of tag S_00000001 with id stage_3_iter1_4
recovering checkpoint of tag S_00000001 with id stage_3_iter1_5
recovering checkpoint of tag S_00000001 with id stage_3_iter1_6
recovering checkpoint of tag S_00000001 with id stage_3_iter1_7
recovering checkpoint of tag S_00000001 with id stage_3_iter1_8
recovering checkpoint of tag S_00000001 with id stage_3_iter1_9
recovering checkpoint of tag S_00000001 with id stage_3_iter1_10
recovering checkpoint of tag S_00000001 with id stage4_kk_1
recovering checkpoint of tag S_00000001 with id stage4_kk_2
recovering checkpoint of tag S_00000001 with id stage4_kk_3
recovering checkpoint of tag S_00000001 with id abrelax_relax
recovering checkpoint of tag S_00000002 with id abrelax_rg_state
recovering checkpoint of tag S_00000002 with id stage_1
recovering checkpoint of tag S_00000002 with id stage_2
recovering checkpoint of tag S_00000002 with id stage_3_iter1_1
recovering checkpoint of tag S_00000002 with id stage_3_iter1_2

and so on.....
of course the end message varies, but they all complete within this time frame and give good credit.

DONE :: 1 starting structures 21412.3 cpu seconds
This process generated 18 decoys from 18 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 57068 - Posted 19 Nov 2008 14:21:27 UTC - in response to Message ID 57059.


this is odd...it ran about 75% of its time and came up with 0 decoys? and then stopped? what's up with that?


Boy, that *IS* odd. And it gave you credit too, that doesn't look like it was for an error. I'd have to guess that it did some work, then restarted the task and somehow the stderr info. got reset.
____________
Rosetta Moderator: Mod.Sense

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57069 - Posted 19 Nov 2008 15:42:51 UTC - in response to Message ID 57068.


this is odd...it ran about 75% of its time and came up with 0 decoys? and then stopped? what's up with that?


Boy, that *IS* odd. And it gave you credit too, that doesn't look like it was for an error. I'd have to guess that it did some work, then restarted the task and somehow the stderr info. got reset.


some more info behind this, at the time i was running rosie and einstein at 175/25 respectively. the cycle time is 60 min which i believe is the default?
so maybe it got interrupted and went to einstein and then came back and tripped up. still strange..no errors and no other info. maybe you guys can pull something on your end.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57077 - Posted 19 Nov 2008 19:37:21 UTC

default cpu time 21600 this ran 3146.078
http://boinc.bakerlab.org/rosetta/result.php?resultid=207892330
h001b_BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-11-S3-8--h001b-_4769_556_0
Client state Compute error
Exit status 1 (0x1)
Computer ID 871217
Report deadline 26 Nov 2008 22:35:22 UTC
CPU time 3146.078
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
recovering checkpoint of tag S_U11X8X_00000001 with id abrelax_rg_state
recovering checkpoint of tag S_U11X8X_00000001 with id stage_1
recovering checkpoint of tag S_U11X8X_00000001 with id stage_2
# cpu_run_time_pref: 21600
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_1
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_2
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_3
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_4
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_5
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_6
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_7
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_8
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_9
recovering checkpoint of tag S_U11X8X_00000001 with id stage_3_iter1_10

and this repeats

then this stderr:
ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

</stderr_txt>
]]>

Validate state Invalid
Claimed credit 21.0970375448934

Thomas

Joined: Feb 20 06
Posts: 1
ID: 60233
Credit: 52,742
RAC: 36
Message 57078 - Posted 19 Nov 2008 19:51:00 UTC

Another WU with extremely bad credit / CPU-time ratio:

http://boinc.bakerlab.org/rosetta/result.php?resultid=206839093

7.45 Credit for more than 7.5 hours of crunching!

I decided to wait until this has been sorted out before crunching more of these WU's, at least on this computer.
____________

Guido Platteau

Joined: Sep 11 06
Posts: 2
ID: 111809
Credit: 283,392
RAC: 0
Message 57079 - Posted 19 Nov 2008 20:06:30 UTC
Last modified: 19 Nov 2008 20:13:58 UTC

I tried another WU on our Windows Vista Home system PC and it failed (again!)
WU
and this WU also failed on another computer:
Details

19/11/2008 13:40:51|rosetta@home|Sending scheduler request: To fetch work. Requesting 24469 seconds of work, reporting 0 completed tasks
19/11/2008 13:40:56|rosetta@home|Scheduler request succeeded: got 1 new tasks
19/11/2008 13:40:58|rosetta@home|Started download of minirosetta_1.40_windows_intelx86.exe
19/11/2008 13:40:58|rosetta@home|Started download of minirosetta_graphics_1.40_windows_intelx86.exe
19/11/2008 13:41:06|rosetta@home|Finished download of minirosetta_graphics_1.40_windows_intelx86.exe
19/11/2008 13:41:06|rosetta@home|Started download of Helvetica.txf
19/11/2008 13:41:08|rosetta@home|Finished download of Helvetica.txf
19/11/2008 13:41:08|rosetta@home|Started download of minirosetta_database_rev25538.zip
19/11/2008 13:41:24|rosetta@home|Finished download of minirosetta_1.40_windows_intelx86.exe
19/11/2008 13:41:24|rosetta@home|Started download of boinc_yebf_aah012_05_05.200_v1_3.gz
19/11/2008 13:41:31|rosetta@home|Finished download of boinc_yebf_aah012_05_05.200_v1_3.gz
19/11/2008 13:41:31|rosetta@home|Started download of boinc_yebf_aah012_03_05.200_v1_3.gz
19/11/2008 13:41:47|rosetta@home|Finished download of boinc_yebf_aah012_03_05.200_v1_3.gz
19/11/2008 13:41:47|rosetta@home|Started download of yebf_h012_.psipred_ss2
19/11/2008 13:41:49|rosetta@home|Finished download of yebf_h012_.psipred_ss2
19/11/2008 13:41:49|rosetta@home|Started download of yebf_h012_.fasta.gz
19/11/2008 13:41:50|rosetta@home|Finished download of yebf_h012_.fasta.gz
19/11/2008 13:42:06||Suspending computation - user is active
19/11/2008 13:42:29||Resuming computation
19/11/2008 13:43:26|rosetta@home|Finished download of minirosetta_database_rev25538.zip
19/11/2008 14:44:08|rosetta@home|Starting h012__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-5-S3-3--h012_-_4675_98_1
19/11/2008 14:44:10|rosetta@home|Starting task h012__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-5-S3-3--h012_-_4675_98_1 using minirosetta version 140
19/11/2008 14:57:50|rosetta@home|Task h012__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-5-S3-3--h012_-_4675_98_1 exited with zero status but no 'finished' file
19/11/2008 14:57:50|rosetta@home|If this happens repeatedly you may need to reset the project.
19/11/2008 14:57:50|rosetta@home|Restarting task h012__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-5-S3-3--h012_-_4675_98_1 using minirosetta version 140
19/11/2008 14:57:53||Suspending computation - user is active
19/11/2008 14:58:13||Resuming computation
19/11/2008 14:58:54|rosetta@home|Task h012__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-5-S3-3--h012_-_4675_98_1 exited with zero status but no 'finished' file
19/11/2008 14:58:54|rosetta@home|If this happens repeatedly you may need to reset the project.
____________

Speedy
Avatar

Joined: Sep 25 05
Posts: 159
ID: 1058
Credit: 507,926
RAC: 0
Message 57080 - Posted 19 Nov 2008 20:23:31 UTC

No graphics again 12.9 credits per hour 6.22 hours 80.67 credits total
____________
Have a crunching good day!!

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57082 - Posted 19 Nov 2008 20:54:49 UTC - in response to Message ID 57079.

I tried another WU on our Windows Vista Home system PC and it failed (again!)
WU
and this WU also failed on another computer:
Details


Outcome Client error
Client state Compute error
Exit status -226 (0xffffff1e)

CPU time 431.4052
stderr out <core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
[...]
Can't acquire lockfile - exiting
[...]

This error is becoming so widespread now (on non-Vista64 systems too] it really needs some dedicated attention.

Can we have some formal comment on it, even if it's just to say you haven't tracked down the source of the problem or a practical workaround? It's just frustrating otherwise.

Until it's solved I'm really struggling to see a reason why any Minis should be issued. I could double my output for the project (as could several others) if it was either solved or Beta 5.98 WUs were issued, which run 100% for me.
____________

Erwin Schlonz
Avatar

Joined: May 20 07
Posts: 5
ID: 178747
Credit: 203,397
RAC: 0
Message 57094 - Posted 20 Nov 2008 10:52:27 UTC

What's up with this compute error???
It seems to me that the file name is way too long to handle for WinXP! Isn't there a maximum file name length (including path) of 255 characters?
My disc space is definitely not full.

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 7200
WARNING! attempt to create gzipped file ../../projects/boinc.bakerlab.org_rosetta/loopbuild_minimalist_core_control_standardloopfile2_homo_bench_looprelax_cheat_chunk_control_standard_loopfiles_t288__olange_IGNORE_THE_REST_2FNEA_7_4818_50_0_0 failed.
======================================================
DONE :: 1 starting structures 7120.77 cpu seconds
This process generated 45 decoys from 45 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>loopbuild_minimalist_core_control_standardloopfile2_homo_bench_looprelax_cheat_chunk_control_standard_loopfiles_t288__olange_IGNORE_THE_REST_2FNEA_7_4818_50_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>

WU affected so far:

http://boinc.bakerlab.org/rosetta/result.php?resultid=208686725
http://boinc.bakerlab.org/rosetta/result.php?resultid=208672659

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57096 - Posted 20 Nov 2008 11:12:15 UTC - in response to Message ID 57094.

What's up with this compute error???
It seems to me that the file name is way too long to handle for WinXP! Isn't there a maximum file name length (including path) of 255 characters?
My disc space is definitely not full.

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 7200
WARNING! attempt to create gzipped file ../../projects/boinc.bakerlab.org_rosetta/loopbuild_minimalist_core_control_standardloopfile2_homo_bench_looprelax_cheat_chunk_control_standard_loopfiles_t288__olange_IGNORE_THE_REST_2FNEA_7_4818_50_0_0 failed.
======================================================
DONE :: 1 starting structures 7120.77 cpu seconds
This process generated 45 decoys from 45 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>loopbuild_minimalist_core_control_standardloopfile2_homo_bench_looprelax_cheat_chunk_control_standard_loopfiles_t288__olange_IGNORE_THE_REST_2FNEA_7_4818_50_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>

WU affected so far:

http://boinc.bakerlab.org/rosetta/result.php?resultid=208686725
http://boinc.bakerlab.org/rosetta/result.php?resultid=208672659



read this thread over in ufluids which references Dr. David Anderson. The article says:
-161 means there's a "dangling references" in your client_state.xml file, for example there's

<file_ref>
<name>foobar</name>
</file_ref>

but there's not <file_info> with name foobar.

It looks like the problem is that the ufluids app sometimes doesn't create all of its output files. I.E., the app finishes successfully but some of the output files don't exist. BOINC treats this as an error; the app must create all the files, even if they're empty.

same would apply to Rosie.

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 57099 - Posted 20 Nov 2008 18:33:37 UTC

The curse of the NANS strikes again: 208596316

Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

____________

netwraith Profile
Avatar

Joined: Sep 3 06
Posts: 80
ID: 109740
Credit: 13,483,227
RAC: 0
Message 57100 - Posted 20 Nov 2008 19:26:55 UTC


I have several of these loopbuild_minimalist_core3_homo_bench- .... tasks and several of them are way overtime... get to just under 10 minutes to go and stay that way for hours....

What could be up with these ???

All of my machines are Linux 2.6 kernels... Fedora/RedHat EL/CentOS


____________
Looking for a team ??? Join BoincSynergy!!


Not2Nutz

Joined: Jan 21 08
Posts: 1
ID: 236992
Credit: 76,372
RAC: 0
Message 57106 - Posted 20 Nov 2008 21:51:43 UTC

It looks like this problem has been ongoing for several weeks. And not one word about it on the Rosetta project web site face page. I am glad my frustration and curiosity finally rose to the level that caused me to visit this forum.

I too, have 8 WU's of Mini 1.40 in progress for 15+ hours and stuck at above 98% completion, and still showing 9 hours 57 minutes left to completition. In fact the time-to-completion hasn't changed in over 10 hours.

One WU did complete in a timely fashion with a computation error.

I don't think my problem is for lack of RAM as I have 24GB installed. I am running Vista X64 on twin dual-core Xeons at 3.0GHz.

I have suspended all but one WU and I have bumped the task priority by two levels, just to see if I could hasten this one WU along. It doesn't seem to be helping as my CPUs are hardly even taxed at this point. So the problem does not seem to be a shortage of compute power. And I have over 1 Terabyte of free disk space. So it can't be for a lack of disk space either.

I am really at a loss of what to do here. Should I just abort them all and wait for the detectives to do the forensic thing and a fix to be implemented?

Any suggestions would be appreciated.

n2n

Rifleman

Joined: Nov 19 08
Posts: 17
ID: 288725
Credit: 139,408
RAC: 0
Message 57107 - Posted 20 Nov 2008 22:05:32 UTC

I just started crunching Rosetta and have 3 tasks running for over 3 hours now with 15 hours to go. I had to abort this morning that ran for well over 18 hours.
Is this normal? I had one task finish alright but took almost 18? hours. My task managershows minirosetta consuming 165000K for each of the 3 cores it is using.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57108 - Posted 20 Nov 2008 23:52:18 UTC - in response to Message ID 57107.

I just started crunching Rosetta and have 3 tasks running for over 3 hours now with 15 hours to go. I had to abort this morning that ran for well over 18 hours.
Is this normal? I had one task finish alright but took almost 18? hours. My task managershows minirosetta consuming 165000K for each of the 3 cores it is using.


your memory usage is in line with mine for a dual core.
time remaining, see my reply in your first question thread about newbie questions.
there are some things to check, if they are all ok, then let boinc manager learn its way around your system. it will settle down over time. also could you post links to the work units that you aborted. show what your cpu run time was set for and what the run time was when you aborted the task. also show what the stderr out message was or any other messages. people can comment on what they see from that data.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57110 - Posted 20 Nov 2008 23:58:55 UTC

Rob,

Rosetta@home workunits seem to behave that way when they have a significant underestimate of the amount of CPU time they need to run. I had one about a week ago that needed about 19.5 hours to run, instead of the 6 hours length I was then asking for, but completed normally otherwise. Don't be surprised if the underestimate of the time it needs to run also gives you a rather poor ratio to credits received to credits requested. Also, these Minirosetta v1.40 workunits with such underestimates are also poor at recovering from restarting your machine after a shurdown or reboot. Earlier in this thread, you should find about 5 items in workunit names that indicate they are likely to have these problems - for example, zinc as part of the name.

Robert

Christian

Joined: Jun 11 06
Posts: 1
ID: 94373
Credit: 215,203
RAC: 0
Message 57112 - Posted 21 Nov 2008 0:31:39 UTC

ALLCON,
For some reason Minirosetta v1.40 continues to lock-up my machine. This condition has existed for about two weeks now. It gets a little exasperating when I need my machine to do something and have to reboot it... consequentially I will be discontinuing running BOINC and Rosetta until someone cleans up the problem, nevermind the CPU tasking!

Hardware stats:

EVGA nForce 590 SLI mobo
AMD Athlon 64 x2 5000+
Corsair XMS 4 gb ram (2x2gb)
EVGA/Nvidia 8800gt (512gb x 2 in SLI)
BFG 650w PS
etc...

Software stats:
Win XP sp2 (up to date)
Trend Micro IS 2009 (up to date)

This is a fairly new build (6 months) and has very little in the way of garbage on it. I've never had a problem with BOINC or Rosetta before these past few weeks with the introduction of Minirosetta v1.40.
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57113 - Posted 21 Nov 2008 0:33:01 UTC - in response to Message ID 57106.

I too, have 8 WU's of Mini 1.40 in progress for 15+ hours and stuck at above 98% completion, and still showing 9 hours 57 minutes left to completition. In fact the time-to-completion hasn't changed in over 10 hours.

I don't think my problem is for lack of RAM as I have 24GB installed. I am running Vista X64 on twin dual-core Xeons at 3.0GHz.

I have suspended all but one WU and I have bumped the task priority by two levels, just to see if I could hasten this one WU along. It doesn't seem to be helping as my CPUs are hardly even taxed at this point. So the problem does not seem to be a shortage of compute power. And I have over 1 Terabyte of free disk space. So it can't be for a lack of disk space either.

n2n


Are you sure that isn't 9 minutes 57 seconds to go? Rosetta@home workunits tend to stick at about that estimated time to go if they come with a serious underestimate of how much CPU time they need, until they finally reach a time when the actual time to go is less than that.

I've read of some of these workunits needing about 800 MB to run well, but if any one core on your machine can get this much, suspending jobs on other cores won't help it run any faster. Exception: If some of the reported cores on your machine are due to hyperthreading, telling it to use only as many cores as are available without hyperthreading often at least doubles the speed on that number of cores.

I've seen one of these workunits that seemed to get stuck actually take about 19.5 hours CPU time, but it seemed to complete normally otherwise. It got a bad credit granted to credit requested ratio, though.

Adjusting my settings so that BOINC is allowed to use more than the default of about 10 GB of disk space seemed to help my more recent jobs, though.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57114 - Posted 21 Nov 2008 0:50:36 UTC - in response to Message ID 57112.

ALLCON,
For some reason Minirosetta v1.40 continues to lock-up my machine. This condition has existed for about two weeks now. It gets a little exasperating when I need my machine to do something and have to reboot it... consequentially I will be discontinuing running BOINC and Rosetta until someone cleans up the problem, nevermind the CPU tasking!

Software stats:
Win XP sp2 (up to date)
Trend Micro IS 2009 (up to date)


Christian,

Who or what is ALLCON?

Is that a 32-bit version of Windows XP SP2, which is unlikely to be able to actually use more than about 3.5 GB of your RAM memory, or a 64-bit version, which can use more of it?

When I had a similar problem on my machine, I found that it was helpful to tell BOINC that it could make use of more of my disk space than the default of about 10 GB; I had significantly more free disk space than that.

Martin Johnson

Joined: Oct 18 05
Posts: 19
ID: 5371
Credit: 171,164
RAC: 0
Message 57115 - Posted 21 Nov 2008 0:53:58 UTC

1.4 units WILL NOT STOP / wait / suspend.
So my Rosetta RAC is rising, and the others are falling !!!
____________

Sarel Profile

Joined: May 11 06
Posts: 51
ID: 81994
Credit: 81,712
RAC: 0
Message 57116 - Posted 21 Nov 2008 1:08:03 UTC

Sorry for being away for a while. I was busy in the wet lab testing some of my older designs (some of which show promise! when I get verification on this, I'll post an update on the protein-protein interactions thread).

Using the information that you posted on this thread I've been able to reproduce on the lab's machines the long run-time problems that you have reported. I now have a good idea about how to avoid such occurrences in the future so that future runs will not be poorly behaved. Also, we have found a way for lowering the memory signature of our runs, but for at least a while, we'll keep the current 512Mb restriction, just in case. We will probably submit more protein-interface jobs to boinc over the next week or so and I will look for your messages to see whether we've completely resolved this issue.

So, I'm planning to sift through the 500 thousand designed models that you have produced over the next few days and am extremely excited about seeing all these new possibilities!
____________

Martin Johnson

Joined: Oct 18 05
Posts: 19
ID: 5371
Credit: 171,164
RAC: 0
Message 57117 - Posted 21 Nov 2008 1:57:09 UTC

What about this "refusal to stop" issue ?
____________

Jim_Clark Profile
Avatar

Joined: Sep 11 07
Posts: 7
ID: 204423
Credit: 38,439
RAC: 0
Message 57121 - Posted 21 Nov 2008 4:22:16 UTC
Last modified: 21 Nov 2008 4:26:23 UTC

On my AMD Athlon 64 X2 Dual Core with Windows XP Pro SP3, and with 2 GB RAM and 100 GB available HD, no Rosetta Mini WU of any version has ever completed successrfully since Rosetta Mini came into existence. . They fail with a compute error or sometimes lockup my computer after wasting time that could be applied to WUs that can complete OK.

So I abort all Rosetta Mini WUs until I finally get a Rosetta Beta WU. . This is a lot of work, since I generally need to abort about 30 or more Rosetta Mini WUs to get one Rosetta Beta WU. . About once a week, I allow one Rosetta Mini WU to run, to see if the problem is fixed yet -- which hasn't happened yet.

Other project sites such as World Community Grid and PrimeGrid allow me to choose which applications my computer will run. . Why can't Rosetta provide this feature, too? . I would like to run the Rosetta Beta WUs, but if I get tired of aborting hundreds of Rosetta Mini WUs, I may feel forced to abandon Rosetta altogether.

Speedy
Avatar

Joined: Sep 25 05
Posts: 159
ID: 1058
Credit: 507,926
RAC: 0
Message 57122 - Posted 21 Nov 2008 5:25:00 UTC

This isn't a bug. Is there away to delete old database files without the project re downloading them once you restarted boinc? I have database rev 23035 25/6, 23035 7/8 & 25538 11/11 in total 54.8MB, do I need them all
Thanks for any advice
____________
Have a crunching good day!!

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57125 - Posted 21 Nov 2008 6:23:21 UTC - in response to Message ID 57117.

What about this "refusal to stop" issue ?


Yes, and what about the recurring "exited with zero status but no 'finished' file" issue?

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57126 - Posted 21 Nov 2008 6:28:43 UTC - in response to Message ID 57121.

On my AMD Athlon 64 X2 Dual Core with Windows XP Pro SP3, and with 2 GB RAM and 100 GB available HD, no Rosetta Mini WU of any version has ever completed successrfully since Rosetta Mini came into existence. . They fail with a compute error or sometimes lockup my computer after wasting time that could be applied to WUs that can complete OK.

So I abort all Rosetta Mini WUs until I finally get a Rosetta Beta WU. . This is a lot of work, since I generally need to abort about 30 or more Rosetta Mini WUs to get one Rosetta Beta WU. . About once a week, I allow one Rosetta Mini WU to run, to see if the problem is fixed yet -- which hasn't happened yet.

Other project sites such as World Community Grid and PrimeGrid allow me to choose which applications my computer will run. . Why can't Rosetta provide this feature, too? . I would like to run the Rosetta Beta WUs, but if I get tired of aborting hundreds of Rosetta Mini WUs, I may feel forced to abandon Rosetta altogether.


Well, well, it seems I'm not alone here.

Martin Johnson

Joined: Oct 18 05
Posts: 19
ID: 5371
Credit: 171,164
RAC: 0
Message 57127 - Posted 21 Nov 2008 7:12:03 UTC

No, you're not. But no one has yet admitted this is a problem,
so I too will be forced to abort until it is settled.
____________

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57128 - Posted 21 Nov 2008 9:27:29 UTC

I generally need to abort about 30 or more Rosetta Mini WUs to get one Rosetta Beta WU. . About once a week, I allow one Rosetta Mini WU to run, to see if the problem is fixed yet -- which hasn't happened yet.

Well, well, it seems I'm not alone here.

No, you're not. But no one has yet admitted this is a problem,
so I too will be forced to abort until it is settled.


Abort. Abort. Abort.

I too am starting to feel like an abort-robot.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57129 - Posted 21 Nov 2008 10:56:50 UTC

A workunit that refused to give up its CPU core when its timeslot ended and it was time for a workunit from a different BOINC project to take over that CPU core:

11/20/2008 10:46:25 PM|rosetta@home|Starting loopbuild_minimalist_core_control_standardloopfile2_homo_bench_looprelax_cheat_chunk_control_standard_loopfiles_t326__olange_IGNORE_THE_REST_2GHRA_8_4830_404_0
11/20/2008 10:46:26 PM|rosetta@home|Starting task loopbuild_minimalist_core_control_standardloopfile2_homo_bench_looprelax_cheat_chunk_control_standard_loopfiles_t326__olange_IGNORE_THE_REST_2GHRA_8_4830_404_0 using minirosetta version 140

I've now told the BOINC interface to suspend both that task and the whole Rosetta@home project, but it's still taking about half the CPU time on that CPU CORE.

I'm using the leave-in-memory option, in case that matters. The BOINC version is 5.10.45, under 32-bit Windows Vista SP1.

A Few Good Men

Joined: Mar 25 07
Posts: 14
ID: 157915
Credit: 2,031,382
RAC: 23
Message 57133 - Posted 21 Nov 2008 14:57:58 UTC

72 hours of crunching on Qx6700 with Boinc 6.2.19 and 1.40mini and I have recieved 25 credits. "Computational errors" I have reset the project 3 times.

droople
Avatar

Joined: Aug 19 08
Posts: 18
ID: 274398
Credit: 341,093
RAC: 419
Message 57134 - Posted 21 Nov 2008 15:19:25 UTC

Hi

I ran a minirosetta and got an error message as follows
http://boinc.bakerlab.org/rosetta/result.php?resultid=206624050

t <core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
Too many restarts with no progress. Keep application in memory while preempted.
======================================================
DONE :: 1 starting structures 3.59375 cpu seconds
This process generated 0 decoys from 0 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>1hzh_1xk5_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_206_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>


I did check keep application in memory while preempted, but got this error
any idea?

Cheers

Daniel Kohn

Joined: Dec 30 05
Posts: 18
ID: 44719
Credit: 2,293,134
RAC: 391
Message 57135 - Posted 21 Nov 2008 15:31:08 UTC - in response to Message ID 57129.

I noticed the other day that I "Snoozed" BOINC and one of my 2 Rosetta work-units keept crunching anyway.

____________

craig_bye

Joined: Nov 30 06
Posts: 1
ID: 132428
Credit: 84,909
RAC: 0
Message 57136 - Posted 21 Nov 2008 15:46:11 UTC

I too keep seeing an issue that the Rosetta Mini 1.40 just keeps running although BOINC reports it as "Waiting to Run". I've seen this twice now and I end up having to kill off the minirosetta_1.40_windows_intelx86.exe process.

sarha1

Joined: Sep 23 05
Posts: 5
ID: 844
Credit: 6,339,735
RAC: 0
Message 57137 - Posted 21 Nov 2008 16:18:13 UTC

Really, "loopbuild_" WUs seem to ignore all the requests to suspend and use the full CPU.
____________

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,057,641
RAC: 1,674
Message 57138 - Posted 21 Nov 2008 16:37:49 UTC
Last modified: 21 Nov 2008 16:41:09 UTC

Crashed out wu's.

208927642 Watchdog after 20,991 seconds?
208802490 after 16,202 seconds NAN in HBonding.
208717837 after 7,255 seconds NAN in HBonding.

Machines are all set to 6 Hour wu time. Leave in memory. Core Client 6.2.19. All "loopbuild_....." wu's - man they have long names.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57141 - Posted 21 Nov 2008 17:33:51 UTC

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=190175120

Nothing But Idle Time

Joined: Sep 28 05
Posts: 209
ID: 1675
Credit: 139,545
RAC: 0
Message 57142 - Posted 21 Nov 2008 18:58:49 UTC

Ditto here, Mini 1.40 is not relinquishing control when BOINC switches to another task. I was able to suspend it manually, however. Not relinquishing control throws the whole concept of resource shares into the toilet. Fix the program and look for the boinc interrupt, please.

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 57147 - Posted 21 Nov 2008 21:43:48 UTC - in response to Message ID 57122.

This isn't a bug. Is there away to delete old database files without the project re downloading them once you restarted boinc? I have database rev 23035 25/6, 23035 7/8 & 25538 11/11 in total 54.8MB, do I need them all
Thanks for any advice


BOINC will only download again files that are being used by tasks. At present, the Rosetta scheduler is explicitly sending a request to the BOINC client to delete the following:

minirosetta_database.zip
minirosetta_database_rev19451.zip
minirosetta_database_rev20412.zip
minirosetta_database_rev20139.zip
minirosetta_database_rev20940.zip
minirosetta_database_rev21566.zip
minirosetta_database_rev22619.zip

The numbers you sited where higher then those and so are still being used. This is why BOINC just downloads them again when it discovers you've deleted them.

So, yes, once they truely become "old", they should be deleted automatically for you.
____________
Rosetta Moderator: Mod.Sense

Speedy
Avatar

Joined: Sep 25 05
Posts: 159
ID: 1058
Credit: 507,926
RAC: 0
Message 57148 - Posted 21 Nov 2008 22:07:31 UTC

At present, the Rosetta scheduler is explicitly sending a request to the BOINC client to delete the following:

minirosetta_database.zip
minirosetta_database_rev19451.zip
minirosetta_database_rev20412.zip
minirosetta_database_rev20139.zip
minirosetta_database_rev20940.zip
minirosetta_database_rev21566.zip
minirosetta_database_rev22619.zip

Thank you Mod Sense for explaining this. How often do the databases change? Would it be possible for someone to explain what the graphics are doing at each stage for the mini app?
____________
Have a crunching good day!!

A Few Good Men

Joined: Mar 25 07
Posts: 14
ID: 157915
Credit: 2,031,382
RAC: 23
Message 57149 - Posted 21 Nov 2008 23:17:27 UTC

This is what I have been getting for the last 3 days.

11/21/2008 4:08:56 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:08:56 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:08:56 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:09:37 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:09:37 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:09:37 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:10:18 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:10:18 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:10:19 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:11:00 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:11:00 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:11:42 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:11:42 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:11:42 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:12:23 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:12:23 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:12:23 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:13:04 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:13:04 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:13:05 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:13:46 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:13:46 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:13:46 PM|rosetta@home|Restarting task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 using minirosetta version 140
11/21/2008 4:14:27 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:14:27 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/21/2008 4:15:08 PM|rosetta@home|Task loopbuild_minimalist_core3_homo_bench_looprelax_cheat_chunk_t293__olange_IGNORE_THE_REST_1T43A_16_4801_30_0 exited with zero status but no 'finished' file
11/21/2008 4:15:08 PM|rosetta@home|If this happens repeatedly you may need to reset the project.

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 57150 - Posted 21 Nov 2008 23:25:47 UTC - in response to Message ID 57148.

How often do the databases change?


"As needed" I believe is the most accurate description there.

We did just recently review and revise the system requirements page. It indicates 400MB of disk is the minimum, so the roughly 50MB of the files you mention is a part of that. But it is more space that was required before mini.

Are you having disk space problems?

____________
Rosetta Moderator: Mod.Sense

Speedy
Avatar

Joined: Sep 25 05
Posts: 159
ID: 1058
Credit: 507,926
RAC: 0
Message 57151 - Posted 21 Nov 2008 23:36:03 UTC

No space problems. I was just interested to see how often the databases came out. Thank you for taking the time to explain.
____________
Have a crunching good day!!

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57153 - Posted 22 Nov 2008 0:20:41 UTC - in response to Message ID 57134.

Hi

I ran a minirosetta and got an error message as follows
http://boinc.bakerlab.org/rosetta/result.php?resultid=206624050

</stderr_txt>
<message>
<file_xfer_error>
<file_name>1hzh_1xk5_fchbonds_20_30sarel_SAVE_ALL_OUT_4704_206_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>


I did check keep application in memory while preempted, but got this error
any idea?

Cheers


I've seen some of the workunits with 1hzh_, sarel, or 4704 as part of their workunit names take considerably longer to run than their initial estimates; for example 19.5 CPU hours compared to a requested time of 6 hours for one. It looks like you got one of those. Also, they don't seem to recover from some types of restarts well. Are you letting such workunits continue running overnight, so they can finish without such a restart?

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 57156 - Posted 22 Nov 2008 1:58:41 UTC

Hi.

I just returned this one, it's wasn't a problem for me but was for the other guy

i don't know if it could be part of the problems people are seeing. But the

result file for it was 1.14 MB, that's the bigest i've seen in a while. It was

one of the loopbuild_minimalist_core3 tasks.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=190452868

pete.





____________


robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57157 - Posted 22 Nov 2008 14:56:37 UTC - in response to Message ID 57156.

Hi.

I just returned this one, it's wasn't a problem for me but was for the other guy

i don't know if it could be part of the problems people are seeing. But the

result file for it was 1.14 MB, that's the bigest i've seen in a while. It was

one of the loopbuild_minimalist_core3 tasks.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=190452868

pete.






I just noticed that a number of the problems with the loopbuild_ workunits seem to affect people running them under Windows, but not those running them under Linux. Could someone with more access to the results database check if this applies in general, or only to the workunits I've seen?

Path7

Joined: Aug 25 07
Posts: 128
ID: 201002
Credit: 61,751
RAC: 0
Message 57158 - Posted 22 Nov 2008 16:16:47 UTC

Wu's won't wait

An increasing amount of Wu's don't want to wait for running when they are supposed to.

Ubuntu 8.04 Boinc 5.10.45:
loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t297_IGNORE_THE_REST_1S5PA_ 8_4778_11_0
1urnA_BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3—1urnA-_4768_217_0

Windows XP-home SP3 Boinc 5.10.45:
1c8cA_BOINC_ABRELAX_SPLIT_CONTROL_IGNORE_THE_REST-S25-9-S3-3—1c8cA-_4677_4812_0

I have changed my resource share settings in order to correct this.

Have a nice day,
Path7.

Rifleman

Joined: Nov 19 08
Posts: 17
ID: 288725
Credit: 139,408
RAC: 0
Message 57159 - Posted 22 Nov 2008 18:09:31 UTC

Hi all. I am new so forgive me if I am wrong here. I have 2 tasks showing up with error messages and they will soon get aborted. Too bad as they have been running 15 hours.
Strangely though if I look at the graphics for these supposedly stalled tasks------they are still chugging away fine---with the progress percentage increasing.
I have had quite a few of these "error" tasks now and maybe this can help fix the bug?
Here is one of the tasks in question. http://boinc.bakerlab.org/rosetta/result.php?resultid=209086501

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57160 - Posted 22 Nov 2008 18:38:02 UTC

My Rosetta had to be suspended 'until further notice'.

My other BOINC projects were being seriously undermined so it had to be done.

When or how will people know Rosetta's WU are good again?

AlexisValle

Joined: Jan 10 06
Posts: 1
ID: 49061
Credit: 649,761
RAC: 0
Message 57161 - Posted 22 Nov 2008 19:58:04 UTC
Last modified: 22 Nov 2008 20:02:16 UTC

Yup, I can't get 1.40 to suspend either... even if I suspend the whole project, even if I do File->Exit, they're still there.

I'm running a quad (q6600), and right now there are 3 mini 1.40 and 1 5.98 running. If I suspend activites, 5.98 will suspend fine, but the 1.40s keep running.

If activities are normal (all running), and I do File->Exit, no rosetta projects will exit (neither 1.40 or 5.98). In fact, the only way to get them to stop is to kill their processes in task manager. Very sloppy, and usually results in premature end computation.

Edit: I'm running vista x64, and the boinc manager is 6.2.19
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57166 - Posted 22 Nov 2008 22:40:11 UTC
Last modified: 22 Nov 2008 22:41:04 UTC

I'm holding off on recommending Rosetta@home to other people, even though I've found two groups of people likely to be helped by research that can be done with the recent new features of minirosetta (even though there don't seem to be any researchers here trying yet). I'm going to wait until there is a new version of minirosetta without the memory problems, underestimated CPU time problems, and problems with suspending minirosetta before recommending it again, and am also restricting how much it runs on my machine until then.

Bob and Claudia Weidman

Joined: Nov 10 08
Posts: 4
ID: 287397
Credit: 141,521
RAC: 0
Message 57167 - Posted 22 Nov 2008 22:41:52 UTC

Hi. I am new to Rosetta and I am getting WAY too many "client error" and
"exited with zero status but no 'finished' file" messages. I have rest the project a few times and it makes no difference. I see no sense in continuing.


(CLIENT ERROR MESSAGES):
209367978 190936697 22 Nov 2008 21:37:40 UTC 2 Dec 2008 21:37:40 UTC In Progress Unknown New --- --- ---
209311335 190888846 22 Nov 2008 16:28:02 UTC 22 Nov 2008 21:37:40 UTC Over Client error Compute error 2,623.95 11.44 ---
209255024 190839795 22 Nov 2008 11:50:28 UTC 22 Nov 2008 19:34:32 UTC Over Success Done 9,732.72 42.45 63.00
209230367 190817080 22 Nov 2008 9:34:07 UTC 22 Nov 2008 11:50:28 UTC Over Client error Compute error 1,750.72 7.64 ---
209183329 190773742 22 Nov 2008 5:29:46 UTC 22 Nov 2008 16:28:02 UTC Over Success Done 10,482.81 45.72 68.04
209140195 190739048 22 Nov 2008 1:20:02 UTC 22 Nov 2008 5:29:46 UTC Over Client error Compute error 3,065.56 13.37 ---
209116709 190717634 21 Nov 2008 22:55:46 UTC 1 Dec 2008 22:55:46 UTC In Progress Unknown New --- --- ---
209069310 190673452 21 Nov 2008 19:16:04 UTC 1 Dec 2008 19:16:04 UTC In Progress Unknown New --- --- ---
209054639 190664040 21 Nov 2008 15:18:15 UTC 21 Nov 2008 19:16:04 UTC Over Client error Compute error 1,015.35 4.43 ---
209031432 190648610 21 Nov 2008 13:48:24 UTC 21 Nov 2008 15:18:15 UTC Over Client error Compute error 138.31 0.60 ---
208999907 190621568 21 Nov 2008 9:50:08 UTC 21 Nov 2008 18:13:28 UTC Over Success Done 10,556.84 46.04 69.86
208938126 190567817 21 Nov 2008 5:13:38 UTC 21 Nov 2008 9:50:08 UTC Over Client error Compute error 3,151.08 13.74 ---
208908749 190542121 21 Nov 2008 2:18:30 UTC 21 Nov 2008 9:50:08 UTC Over Client error Compute error 0.00 0.00 ---
208870073 190507400 20 Nov 2008 22:28:09 UTC 21 Nov 2008 5:47:06 UTC Over Success Done 10,234.51 44.64 64.57
208847563 190488763 20 Nov 2008 20:18:10 UTC 20 Nov 2008 22:28:09 UTC Over Client error Compute error 116.50 0.51 ---
208810362 190435490 20 Nov 2008 16:46:54 UTC 21 Nov 2008 0:26:05 UTC Over Success Done 10,110.36 44.10 66.44
208761828 190409852 20 Nov 2008 12:50:52 UTC 20 Nov 2008 16:46:54 UTC Over Client error Compute error 3,082.83 13.45 ---
208695780 190355622 20 Nov 2008 4:55:36 UTC 20 Nov 2008 12:50:52 UTC Over Success Done 10,320.32 45.01 53.84
208695211 189253986 20 Nov 2008 7:19:04 UTC 20 Nov 2008 12:50:52 UTC Over Client error Compute error 432.68 1.89 ---
208659532 190325857 20 Nov 2008 3:13:14 UTC 20 Nov 2008 4:55:36 UTC Over Client error Compute error 476.43 2.08 ---

(ZERO STATUS BUT NO 'FINISHED' FILE):
11/22/2008 3:27:46 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with 11/22/2008 3:27:46 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:27:46 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:28:27 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:28:27 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:28:27 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:29:08 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:29:08 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:29:50 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:29:50 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:29:50 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:30:31 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:30:31 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:31:12 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:31:12 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:31:12 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:31:54 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:31:54 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:32:35 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:32:35 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:32:35 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:33:16 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:33:16 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:33:57 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:33:57 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:33:58 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:34:39 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:34:39 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:35:20 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:35:20 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:35:20 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:36:01 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:36:01 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:36:43 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:36:43 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:36:43 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:37:24 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:37:24 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:38:05 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:38:05 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:38:05 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:38:46 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:38:46 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:39:28 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:39:28 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:39:28 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:40:09 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:40:09 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:40:50 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:40:50 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:40:50 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:41:31 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:41:31 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:42:13 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:42:13 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:42:13 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:42:54 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:42:54 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:43:35 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:43:35 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:43:35 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:44:16 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:44:16 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:44:58 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:44:58 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:44:58 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:45:39 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:45:39 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:46:20 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:46:20 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:46:20 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:47:02 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:47:02 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:47:43 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:47:43 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:47:43 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:48:24 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:48:24 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:49:05 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:49:05 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:49:06 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:49:47 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:49:47 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:50:28 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:50:28 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:50:28 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:51:09 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:51:09 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:51:51 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:51:51 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:51:51 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:52:32 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:52:32 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:53:13 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:53:13 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:53:13 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:53:54 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:53:54 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:54:36 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:54:36 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:54:36 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:55:17 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:55:17 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:55:58 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:55:58 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:55:58 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:56:39 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:56:39 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:57:21 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:57:21 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:57:21 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:58:02 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:58:02 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:58:43 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:58:43 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:58:43 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:59:25 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:59:25 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:00:06 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:00:06 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:00:06 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:00:47 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:00:47 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:01:28 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:01:28 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:01:29 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:02:10 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:02:10 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:02:51 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:02:51 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:02:51 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:03:32 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:03:32 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:04:13 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:04:13 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:04:14 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:04:55 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:04:55 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:05:36 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:05:36 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:05:36 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:06:17 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:06:17 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:06:59 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:06:59 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:06:59 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:07:40 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:07:40 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:08:21 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:08:21 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:08:21 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:09:02 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:09:02 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:09:44 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:09:44 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:09:44 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:10:25 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:10:25 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:11:06 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:11:06 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:11:06 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:11:48 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:11:48 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:12:29 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:12:29 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:12:29 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:13:10 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:13:10 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:13:51 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:13:51 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:13:52 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:14:33 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:14:33 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:15:14 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:15:14 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:15:14 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:15:55 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:15:55 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:16:36 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:16:36 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:16:37 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:17:18 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:17:18 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:17:59 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:17:59 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:17:59 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:18:40 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:18:40 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:19:21 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:19:21 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:19:22 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:20:03 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:20:03 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:20:44 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:20:44 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:20:44 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:21:25 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:21:25 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:22:07 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:22:07 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:22:07 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:22:48 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:22:48 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:23:29 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:23:29 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:23:29 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:24:10 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:24:10 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:24:52 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:24:52 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:24:52 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:25:33 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:25:33 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:26:14 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:26:14 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:26:14 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:26:55 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:26:55 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:27:37 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:27:37 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:27:37 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:28:18 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:28:18 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:28:59 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:28:59 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:28:59 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:29:41 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:29:41 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:30:22 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:30:22 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:30:22 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:31:03 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:31:03 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:31:44 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:31:44 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:31:45 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:32:26 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:32:26 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:33:07 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:33:07 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:33:07 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:33:48 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:33:48 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:34:30 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:34:30 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:34:30 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:35:11 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:35:11 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:35:52 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:35:52 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:35:52 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:36:33 PM|rosetta@home|Computation for task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 finished
11/22/2008 4:36:33 PM|rosetta@home|Output file h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0_0 for task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 absent

11/22/2008 3:27:46 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:28:27 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:28:27 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:28:27 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:29:08 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:29:08 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:29:50 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:29:50 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:29:50 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:30:31 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:30:31 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:31:12 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:31:12 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:31:12 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:31:54 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:31:54 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:32:35 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:32:35 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:32:35 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:33:16 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:33:16 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:33:57 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:33:57 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:33:58 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:34:39 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:34:39 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:35:20 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:35:20 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:35:20 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:36:01 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:36:01 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:36:43 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:36:43 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:36:43 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:37:24 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:37:24 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:38:05 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:38:05 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:38:05 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:38:46 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:38:46 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:39:28 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:39:28 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:39:28 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:40:09 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:40:09 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:40:50 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:40:50 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:40:50 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:41:31 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:41:31 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:42:13 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:42:13 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:42:13 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:42:54 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:42:54 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:43:35 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:43:35 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:43:35 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:44:16 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:44:16 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:44:58 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:44:58 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:44:58 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:45:39 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:45:39 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:46:20 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:46:20 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:46:20 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:47:02 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:47:02 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:47:43 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:47:43 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:47:43 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:48:24 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:48:24 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:49:05 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:49:05 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:49:06 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:49:47 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:49:47 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:50:28 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:50:28 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:50:28 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:51:09 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:51:09 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:51:51 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:51:51 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:51:51 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:52:32 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:52:32 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:53:13 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:53:13 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:53:13 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:53:54 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:53:54 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:54:36 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:54:36 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:54:36 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:55:17 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:55:17 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:55:58 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:55:58 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:55:58 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:56:39 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:56:39 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:57:21 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:57:21 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:57:21 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:58:02 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:58:02 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:58:43 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:58:43 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 3:58:43 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 3:59:25 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 3:59:25 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:00:06 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:00:06 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:00:06 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:00:47 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:00:47 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:01:28 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:01:28 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:01:29 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:02:10 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:02:10 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:02:51 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:02:51 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:02:51 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:03:32 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:03:32 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:04:13 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:04:13 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:04:14 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:04:55 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:04:55 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:05:36 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:05:36 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:05:36 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:06:17 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:06:17 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:06:59 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:06:59 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:06:59 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:07:40 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:07:40 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:08:21 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:08:21 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:08:21 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:09:02 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:09:02 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:09:44 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:09:44 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:09:44 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:10:25 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:10:25 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:11:06 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:11:06 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:11:06 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:11:48 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:11:48 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:12:29 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:12:29 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:12:29 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:13:10 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:13:10 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:13:51 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:13:51 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:13:52 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:14:33 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:14:33 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:15:14 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:15:14 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:15:14 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:15:55 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:15:55 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:16:36 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:16:36 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:16:37 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:17:18 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:17:18 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:17:59 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:17:59 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:17:59 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:18:40 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:18:40 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:19:21 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no 'finished' file
11/22/2008 4:19:21 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
11/22/2008 4:19:22 PM|rosetta@home|Restarting task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 using minirosetta version 140
11/22/2008 4:20:03 PM|rosetta@home|Task h001__BOINC_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-17-S3-3--h001_-_4841_276_0 exited with zero status but no '
____________

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57170 - Posted 22 Nov 2008 23:05:27 UTC

Why isn't this v1.40 Minirosetta being worked as a Beta project?

'Cause as non-beta... This makes no sense to me.

JChojnacki Profile
Avatar

Joined: Sep 17 05
Posts: 71
ID: 105
Credit: 6,746,778
RAC: 1,867
Message 57174 - Posted 23 Nov 2008 2:01:19 UTC

too many normally harmless exit(s) -- 208604840

Also, I am seeing various 1.40 work units that do not suspend when they should.

____________



Bob and Claudia Weidman

Joined: Nov 10 08
Posts: 4
ID: 287397
Credit: 141,521
RAC: 0
Message 57175 - Posted 23 Nov 2008 2:35:43 UTC

Hello again. I have been advised to revise my recent post (Message 57167 - Posted 22 Nov 2008 22:41:52 UTC) with some pertinent information that I neglected to mention and to also make my computer viewable. My PC is now viewable. I am running Windows Vista 32-bit on a Dell XPS 420 with an Intel Core 2 Quad CPU Q6600 @2.4GHz and 3.00 GB RAM.
I am allowing BOINC to use just 2 of the processors (I've burnt up a PC in the past using BOINC.) I am allowing BOINC to use 100GB of disk space.
My BOINC Manager version is 6.2.19.
I also noticed that I made a typo in my first message.....I didn't "rest" the project, I reset it. :-/

My thanks to Robertmiles for his assistance.
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57176 - Posted 23 Nov 2008 8:48:24 UTC
Last modified: 23 Nov 2008 8:52:42 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=208586851
loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t293__IGNORE_THE_REST_2AS0A_1_4777_8_0
Outcome Client error
Client state Compute error
Exit status 1 (0x1)
CPU time 5065.922
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# cpu_run_time_pref: 21600

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

</stderr_txt>
]]>
Validate state Invalid
Claimed credit 34.1688813911913
Granted credit 0
application version 1.40

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57178 - Posted 23 Nov 2008 8:56:30 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=208586847
Task ID 208586847
Name loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t293__IGNORE_THE_REST_2AS0A_1_4777_6_0
Outcome Client error
Client state Compute error
Exit status 1 (0x1)
CPU time 7957.859
stderr out

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# cpu_run_time_pref: 21600
# cpu_run_time_pref: 21600

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

</stderr_txt>
]]>

Validate state Invalid
Claimed credit 53.3639821072357
Granted credit 0

A Few Good Men

Joined: Mar 25 07
Posts: 14
ID: 157915
Credit: 2,031,382
RAC: 23
Message 57179 - Posted 23 Nov 2008 9:30:07 UTC

I deleted and removed ver. 62.19 Boinc and reinstalled ver. 5.10.45.
The results after 12 hours are the same, computational error using the mini140. Three of the tasks actually stalled out before getting to 100 percent before the error message came up. Let me know what I can do to get folding again please.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57180 - Posted 23 Nov 2008 9:46:05 UTC
Last modified: 23 Nov 2008 10:00:42 UTC

why do the mini 1.40 loops build tasks ignore the boinc manager command to suspend?
i have a few updates i need to apply to my system and I selected suspend activity.
the einstein work stops, but rosetta does not.

**edit** upon rebooting the system the mini 1.40 responds to the suspend command, but earlier after running days upon end with no stoppage of the system they would not respond to the suspend command.

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,057,641
RAC: 1,674
Message 57181 - Posted 23 Nov 2008 10:32:52 UTC
Last modified: 23 Nov 2008 10:34:16 UTC

I also see MiniRosetta 1.40 running when it is posted as being in the "Waiting to run" state.

The cynic in me caused the thought "Hmmm, that would be a good way to get more out of the crunchers" to cross my mind.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57183 - Posted 23 Nov 2008 14:38:37 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=207439900
http://boinc.bakerlab.org/rosetta/result.php?resultid=208373092
http://boinc.bakerlab.org/rosetta/result.php?resultid=208397860

____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57184 - Posted 23 Nov 2008 14:48:24 UTC

another question regarding 1.40 tasks,
whats with all the exit status 1 errors that have the recovering checkpoint message repeating and then end with NANs occured in hbonding! ?
this is showing up allot lately and we waste cpu cycles on this stuff and get no credit. Is there some sort of specific task group or particular protein that is causing this problem?

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57185 - Posted 23 Nov 2008 15:02:53 UTC

from the mini 1+ thread

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57189 - Posted 23 Nov 2008 19:36:39 UTC

OK, here comes...

23/11/2008 10:07:16|rosetta@home|Scheduler request succeeded: got 1 new tasks
23/11/2008 10:07:18|rosetta@home|Started download of boinc_t482_aah001b03_05.200_v1_3.gz
23/11/2008 10:07:18|rosetta@home|Started download of boinc_t482_aah001b06_05.200_v1_3.gz
23/11/2008 10:07:23|rosetta@home|Finished download of boinc_t482_aah001b03_05.200_v1_3.gz
23/11/2008 10:07:23|rosetta@home|Started download of boinc_t482_aah001b04_05.200_v1_3.gz
23/11/2008 10:07:25|rosetta@home|Finished download of boinc_t482_aah001b06_05.200_v1_3.gz
23/11/2008 10:07:25|rosetta@home|Started download of t482_h001b.psipred_ss2
23/11/2008 10:07:26|rosetta@home|Finished download of t482_h001b.psipred_ss2
23/11/2008 10:07:26|rosetta@home|Started download of t482_minus3.pdb.gz
23/11/2008 10:07:27|rosetta@home|Finished download of t482_minus3.pdb.gz
23/11/2008 10:07:33|rosetta@home|Finished download of boinc_t482_aah001b04_05.200_v1_3.gz
23/11/2008 10:11:25|rosetta@home|Starting h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-4--h001b-_4842_169_0
23/11/2008 10:11:29|rosetta@home|Starting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-4--h001b-_4842_169_0 using minirosetta version 140
23/11/2008 10:11:29|rosetta@home|Sending scheduler request: To fetch work. Requesting 5601 seconds of work, reporting 0 completed tasks
23/11/2008 10:11:35|rosetta@home|Scheduler request succeeded: got 1 new tasks
23/11/2008 10:14:13|rosetta@home|Starting h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-6--h001b-_4842_423_0
23/11/2008 10:14:13|rosetta@home|Starting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-6--h001b-_4842_423_0 using minirosetta version 140
23/11/2008 10:14:17|rosetta@home|Computation for task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-4--h001b-_4842_169_0 finished
23/11/2008 10:14:19|rosetta@home|Computation for task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-6--h001b-_4842_423_0 finished
23/11/2008 10:15:41|rosetta@home|Sending scheduler request: To fetch work. Requesting 43200 seconds of work, reporting 2 completed tasks
23/11/2008 10:15:46|rosetta@home|Scheduler request succeeded: got 2 new tasks
23/11/2008 10:15:48|rosetta@home|Started download of boinc_t482_aah001b08_05.200_v1_3.gz
23/11/2008 10:15:56|rosetta@home|Finished download of boinc_t482_aah001b08_05.200_v1_3.gz
23/11/2008 10:15:58|rosetta@home|Starting h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0
23/11/2008 10:15:58|rosetta@home|Starting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:26:04|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:26:04|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:26:46|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:26:46|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:26:46|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:27:27|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:27:27|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:27:27|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:28:09|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:28:09|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:28:50|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:28:50|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:28:50|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:29:31|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:29:31|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:30:13|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:30:13|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:30:13|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:30:54|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:30:54|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:30:54|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:31:35|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:31:35|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:32:17|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:32:17|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:32:17|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:32:58|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:32:58|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:33:39|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:33:39|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:33:40|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:34:21|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:34:21|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:34:21|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:35:02|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:35:02|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:35:44|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:35:44|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:35:44|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:36:25|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:36:25|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:37:07|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:37:07|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:37:07|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:37:48|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:37:48|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:37:48|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:38:29|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:38:29|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:39:11|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:39:11|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:39:11|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:39:52|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:39:52|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:40:35|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:40:35|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:40:35|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:41:16|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:41:16|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:41:58|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:41:58|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:41:58|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:42:39|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:42:39|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:43:21|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:43:21|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:43:21|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:44:02|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:44:02|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:44:02|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:44:43|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:44:43|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:45:25|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:45:25|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:45:25|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:46:05|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:46:05|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:46:05|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:46:47|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:46:47|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:47:28|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:47:28|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:47:29|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:48:15|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:48:15|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:48:58|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:48:58|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:48:59|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:49:40|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:49:40|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:49:40|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:50:22|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:50:22|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:50:22|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:51:03|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:51:03|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:51:03|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:51:44|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:51:44|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:52:25|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:52:25|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:52:25|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:53:07|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:53:07|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:53:07|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:53:48|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:53:48|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:54:30|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:54:30|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:54:30|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:55:11|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:55:11|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:55:11|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:55:52|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:55:52|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:56:34|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:56:34|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:56:34|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:57:15|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:57:15|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:57:15|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:57:57|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:57:57|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:57:57|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:58:38|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:58:38|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 10:58:38|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 10:59:19|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 10:59:19|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:00:01|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:00:01|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:00:01|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:00:42|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:00:42|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:00:42|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:01:23|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:01:23|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:02:05|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:02:05|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:02:05|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:02:46|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:02:46|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:03:28|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:03:28|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:03:28|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:04:08|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:04:08|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:04:08|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:04:49|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:04:49|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:05:31|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:05:31|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:05:31|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:06:12|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:06:12|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:06:12|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:06:53|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:06:53|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:07:35|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:07:35|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:07:35|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:08:16|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:08:16|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:08:17|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:08:58|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:08:58|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:09:39|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:09:39|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:09:39|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:10:20|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:10:20|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:10:20|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:11:01|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:11:01|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:11:42|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:11:42|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:11:43|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:12:24|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:12:24|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:12:24|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:13:05|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:13:05|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:13:47|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:13:47|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:13:47|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:14:28|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:14:28|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:15:09|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:15:09|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:15:10|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:15:51|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:15:51|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:15:51|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:16:32|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:16:32|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:17:14|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:17:14|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:17:14|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:17:55|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:17:55|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:18:37|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:18:37|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:18:37|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:19:18|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:19:18|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:19:18|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:19:59|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:19:59|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:20:41|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:20:41|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:20:41|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:21:22|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:21:22|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:22:04|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:22:04|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:22:04|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:22:45|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:22:45|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:22:45|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:23:26|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:23:26|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:24:08|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:24:08|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:24:08|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:24:49|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:24:49|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:25:31|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:25:31|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:25:31|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:26:12|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:26:12|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:26:12|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:26:53|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:26:53|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:27:35|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:27:35|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:27:35|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:28:16|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:28:16|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:28:58|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:28:58|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:28:58|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:29:39|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:29:39|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:29:39|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:30:20|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:30:20|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:31:02|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:31:02|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:31:02|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:31:42|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:31:42|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:31:42|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:32:24|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:32:24|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:33:05|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:33:05|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:33:05|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:33:46|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:33:46|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:33:47|rosetta@home|Restarting task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 using minirosetta version 140
23/11/2008 11:34:28|rosetta@home|Task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 exited with zero status but no 'finished' file
23/11/2008 11:34:28|rosetta@home|If this happens repeatedly you may need to reset the project.
23/11/2008 11:35:11|rosetta@home|Computation for task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 finished
23/11/2008 11:35:11|rosetta@home|Output file h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0_0 for task h001b_BOINC_ABRELAX_RANGE_t482_IGNORE_THE_REST-S25-6-S3-8--h001b-_4842_246_0 absent

Nice. It doesn't even say 'go screw yourself'.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57190 - Posted 23 Nov 2008 22:15:10 UTC

nice, wheres the link to that specific task?

Bob and Claudia Weidman

Joined: Nov 10 08
Posts: 4
ID: 287397
Credit: 141,521
RAC: 0
Message 57195 - Posted 24 Nov 2008 3:22:28 UTC

Okay.........this is nonsense. After hours and hours of work, I get nothing but error messages. This version needs to be in beta testing NOT in general use. I'm outta' here. Someone let me know if this bull gets fixed. I have other projects that needs my cpu time and their programs actually work.
____________

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 57197 - Posted 24 Nov 2008 6:30:04 UTC
Last modified: 24 Nov 2008 6:34:23 UTC

Hi.

I have another one of those tasks that don't want to stop when preempted.

It's a loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t326__IGNORE_THE_REST_1JVNA_9_4790_24_0

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=191156664

Edit// Add to error list, it restarted and fellover.

Mon 24 Nov 2008 17:32:08 EST|rosetta@home|Output file loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t326__IGNORE_THE_REST_1JVNA_9_4790_24_0_0 for task loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t326__IGNORE_THE_REST_1JVNA_9_4790_24_0 absent

pete.
____________


Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57198 - Posted 24 Nov 2008 7:38:45 UTC - in response to Message ID 57190.
Last modified: 24 Nov 2008 7:51:01 UTC

nice, wheres the link to that specific task?

Your wish is my command.

http://boinc.bakerlab.org/rosetta/result.php?resultid=209499254

This feels kind of pointless, though, because the same happens to ALL the tasks that come my way. Like, for example, these:

http://boinc.bakerlab.org/rosetta/result.php?resultid=206660099
http://boinc.bakerlab.org/rosetta/result.php?resultid=207304009
http://boinc.bakerlab.org/rosetta/result.php?resultid=208744272
http://boinc.bakerlab.org/rosetta/result.php?resultid=208868439
http://boinc.bakerlab.org/rosetta/result.php?resultid=209499253

If this really helps. Good day now

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57200 - Posted 24 Nov 2008 9:11:07 UTC

read this. It was posted in another new thread by peter leman.

within that wiki article is the link to "lockfile" and it mentions: Where this becomes problematical is when a process dies (crashes) and the Lock File is never closed. This us usually corrected with a reboot action, but not always.

If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.

see if the reboot of boinc helps and if not then follow the directions in the wiki article.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57201 - Posted 24 Nov 2008 9:11:47 UTC - in response to Message ID 57198.

nice, wheres the link to that specific task?

Your wish is my command.

http://boinc.bakerlab.org/rosetta/result.php?resultid=209499254

This feels kind of pointless, though, because the same happens to ALL the tasks that come my way. Like, for example, these:

http://boinc.bakerlab.org/rosetta/result.php?resultid=206660099
http://boinc.bakerlab.org/rosetta/result.php?resultid=207304009
http://boinc.bakerlab.org/rosetta/result.php?resultid=208744272
http://boinc.bakerlab.org/rosetta/result.php?resultid=208868439
http://boinc.bakerlab.org/rosetta/result.php?resultid=209499253

If this really helps. Good day now


Alec, you might want to look through the earlier messages in this thread. I believe that at least one of them included a workaround for the Can't Acquire Lockfile error. Also, you might want to check for your setting for the maximum disk space BOINC can use, and if you have enough free disk space, at least double it.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57202 - Posted 24 Nov 2008 9:20:03 UTC - in response to Message ID 57150.

How often do the databases change?


"As needed" I believe is the most accurate description there.

We did just recently review and revise the system requirements page. It indicates 400MB of disk is the minimum, so the roughly 50MB of the files you mention is a part of that. But it is more space that was required before mini.

Are you having disk space problems?


It may need revising again - it still doesn't say that it will run under Vista. You might also want separate entries for the 64-bit versions of XP and Vista

Rob Lilley

Joined: Jan 11 06
Posts: 11
ID: 49465
Credit: 95,495
RAC: 18
Message 57204 - Posted 24 Nov 2008 11:51:43 UTC

This Minirosetta v1.40 Work Unit is another one that won't suspend and continues running when pre-empted by a QMC WU.

Should I abort the Minirosetta Work Unit or suspend all the other projects that play nice until the Minirosetta WU finishes?
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57205 - Posted 24 Nov 2008 12:29:01 UTC - in response to Message ID 57204.

This Minirosetta v1.40 Work Unit is another one that won't suspend and continues running when pre-empted by a QMC WU.

Should I abort the Minirosetta Work Unit or suspend all the other projects that play nice until the Minirosetta WU finishes?



try what i did, exit boinc manager and then reopen. however i did a complete reboot of the system after that so i have no idea if just the closing and reopening of boinc manager will solve that problem.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57206 - Posted 24 Nov 2008 12:40:58 UTC - in response to Message ID 57201.

nice, wheres the link to that specific task?

Your wish is my command.

http://boinc.bakerlab.org/rosetta/result.php?resultid=209499254

This feels kind of pointless, though, because the same happens to ALL the tasks that come my way. Like, for example, these:

http://boinc.bakerlab.org/rosetta/result.php?resultid=206660099
http://boinc.bakerlab.org/rosetta/result.php?resultid=207304009
http://boinc.bakerlab.org/rosetta/result.php?resultid=208744272
http://boinc.bakerlab.org/rosetta/result.php?resultid=208868439
http://boinc.bakerlab.org/rosetta/result.php?resultid=209499253

If this really helps. Good day now


Alec, you might want to look through the earlier messages in this thread. I believe that at least one of them included a workaround for the Can't Acquire Lockfile error. Also, you might want to check for your setting for the maximum disk space BOINC can use, and if you have enough free disk space, at least double it.



the only other reference is here

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57207 - Posted 24 Nov 2008 12:59:11 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=208491606

Rob Lilley

Joined: Jan 11 06
Posts: 11
ID: 49465
Credit: 95,495
RAC: 18
Message 57210 - Posted 24 Nov 2008 13:35:35 UTC - in response to Message ID 57205.

This Minirosetta v1.40 Work Unit is another one that won't suspend and continues running when pre-empted by a QMC WU.

Should I abort the Minirosetta Work Unit or suspend all the other projects that play nice until the Minirosetta WU finishes?



try what i did, exit boinc manager and then reopen. however i did a complete reboot of the system after that so i have no idea if just the closing and reopening of boinc manager will solve that problem.


I tried that, and it doesn't seem to work. I am running BOINC as a service, so there's a different method for stopping it anyway, according to a thread I found somwhere on the BOINC message boards, but that doesn't work either. If I do stop both the BOINC Manager and the BOINC Core Client, Minirosetta continues to run. The Windows Task Manager shows the minirosetta task hasn't unloaded, and the CPU usage stays at 100%. I could end the Minirosetta process, but I am reluctant to do that.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57212 - Posted 24 Nov 2008 14:14:39 UTC - in response to Message ID 57210.

This Minirosetta v1.40 Work Unit is another one that won't suspend and continues running when pre-empted by a QMC WU.

Should I abort the Minirosetta Work Unit or suspend all the other projects that play nice until the Minirosetta WU finishes?



try what i did, exit boinc manager and then reopen. however i did a complete reboot of the system after that so i have no idea if just the closing and reopening of boinc manager will solve that problem.


I tried that, and it doesn't seem to work. I am running BOINC as a service, so there's a different method for stopping it anyway, according to a thread I found somwhere on the BOINC message boards, but that doesn't work either. If I do stop both the BOINC Manager and the BOINC Core Client, Minirosetta continues to run. The Windows Task Manager shows the minirosetta task hasn't unloaded, and the CPU usage stays at 100%. I could end the Minirosetta process, but I am reluctant to do that.


try exiting the boinc, do not delete anything from your folders. goto add/remove software and unistall boinc. then reinstall. kind of drastic, but thats all i can think of. maybe someone else has an different idea.


Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57213 - Posted 24 Nov 2008 15:57:28 UTC - in response to Message ID 57200.

read this. It was posted in another new thread by peter leman.

within that wiki article is the link to "lockfile" and it mentions: Where this becomes problematical is when a process dies (crashes) and the Lock File is never closed. This us usually corrected with a reboot action, but not always.

If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.

see if the reboot of boinc helps and if not then follow the directions in the wiki article.


Thank you!

It worked out, booting the computer. The 'slots' disappeared and, with them, the lock file. Now to see if the error wont happen again after I resume Rosetta's tasks.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57214 - Posted 24 Nov 2008 16:15:46 UTC - in response to Message ID 57213.

read this. It was posted in another new thread by peter leman.

within that wiki article is the link to "lockfile" and it mentions: Where this becomes problematical is when a process dies (crashes) and the Lock File is never closed. This us usually corrected with a reboot action, but not always.

If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.

see if the reboot of boinc helps and if not then follow the directions in the wiki article.


Thank you!

It worked out, booting the computer. The 'slots' disappeared and, with them, the lock file. Now to see if the error wont happen again after I resume Rosetta's tasks.


glad to help, but also thanks to peter leman for creating the original thread with the lockfile topic.

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 57221 - Posted 24 Nov 2008 22:30:28 UTC

4 lockfiles

208603902
208601704
208601702
208596319

and

1 NAN

208596316

____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57225 - Posted 24 Nov 2008 22:46:03 UTC - in response to Message ID 57221.

4 lockfiles <--- see discussion below

208603902
208601704
208601702
208596319

and

1 NAN

208596316

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 57228 - Posted 25 Nov 2008 9:57:36 UTC

4 lockfiles <--- see discussion below

Yes sorry, I didn't see that until later. Strange that all the errors appear to be on the loopbuild models and yet some of these are not affected. In the past few days I have had boinc stop completely which in the past has meant that at a model has jammed up the works. Restart boinc and everything start working again with no apparent model failure. Strange.
____________

Rob Lilley

Joined: Jan 11 06
Posts: 11
ID: 49465
Credit: 95,495
RAC: 18
Message 57229 - Posted 25 Nov 2008 11:12:58 UTC - in response to Message ID 57212.
Last modified: 25 Nov 2008 11:13:47 UTC

This Minirosetta v1.40 Work Unit is another one that won't suspend and continues running when pre-empted by a QMC WU.

Should I abort the Minirosetta Work Unit or suspend all the other projects that play nice until the Minirosetta WU finishes?



try what i did, exit boinc manager and then reopen. however i did a complete reboot of the system after that so i have no idea if just the closing and reopening of boinc manager will solve that problem.


I tried that, and it doesn't seem to work. I am running BOINC as a service, so there's a different method for stopping it anyway, according to a thread I found somwhere on the BOINC message boards, but that doesn't work either. If I do stop both the BOINC Manager and the BOINC Core Client, Minirosetta continues to run. The Windows Task Manager shows the minirosetta task hasn't unloaded, and the CPU usage stays at 100%. I could end the Minirosetta process, but I am reluctant to do that.


try exiting the boinc, do not delete anything from your folders. goto add/remove software and unistall boinc. then reinstall. kind of drastic, but thats all i can think of. maybe someone else has an different idea.



Didn't do that, just suspended all other projects then restarted the computer. Turns out it was probably a bad WU anyway, as it worked for a while then came to a dead stop and woudln't restart. After I tried restarting BOINC, it then errored out and came up with the lockfile error others are experiencing, as you will see here.

Ah well, some other poor sap will get that nasty WU now :(

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57230 - Posted 25 Nov 2008 11:38:03 UTC

I think the "rule" for loopbuild is to not do anything to it or it crashes and burns badly.

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57232 - Posted 25 Nov 2008 18:51:30 UTC

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189333988

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57233 - Posted 25 Nov 2008 20:48:03 UTC - in response to Message ID 57232.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189333988

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189333988

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57235 - Posted 25 Nov 2008 22:09:43 UTC - in response to Message ID 57233.

rochester..have a look at this mornings discussion down below on lockfile issues.
it will save you more errors and loss of credit.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189333988

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189333988


P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 57240 - Posted 26 Nov 2008 3:30:48 UTC

Hi.

I have another task that dosen't want to stop when preempted time & percentage

are ticking up, it is currently running.

1lis__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1lis_-_4768_2176_0

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=191579804

pete.

____________


Stacey Baird Profile
Avatar

Joined: Apr 11 06
Posts: 19
ID: 75056
Credit: 74,745
RAC: 0
Message 57247 - Posted 26 Nov 2008 13:27:41 UTC

Probable Problem
11/26/2008 9:15:00 PM|rosetta@home|Restarting task 1acf__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--1acf_-_4768_1359_0 using minirosetta version 140

The above is stuck on 00.9:59.00, nine minutes 59 seconds remaining.
CPU time of more than five hours increases but time remaining never decreases.

Should I abort? Hmmm, as I read farther below, others are having the same problem.

Good Luck
____________

adrianxw Profile
Avatar

Joined: Sep 18 05
Posts: 535
ID: 402
Credit: 1,057,641
RAC: 1,674
Message 57249 - Posted 26 Nov 2008 15:44:31 UTC

210279108 NAN in HBonding.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

FalconFly Profile
Avatar

Joined: Jan 11 08
Posts: 23
ID: 234757
Credit: 2,163,056
RAC: 0
Message 57250 - Posted 26 Nov 2008 16:09:41 UTC - in response to Message ID 57249.
Last modified: 26 Nov 2008 16:12:56 UTC

I'm seeing a significantly above average failures, which result in the shutdown/crash of BOINC (MiniRosetta 1.40).

Happens across all my Linux Systems with no derterminable pattern (64bit BOINC V5.10.45) and naturally results in loss of computing power (need to restart BOINC or the System for ease of purpose)

Otherwise, repeatedly above average numbers of WorkUnits stuck at a certain percentage and its MiniRosetta Task either failed or using 0% CPU power, effectively blocking a CPU core each. Also requires a BOINC restart to get the affected WorkUnits kick into action again.
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57253 - Posted 26 Nov 2008 18:41:49 UTC - in response to Message ID 57250.

I'm seeing a significantly above average failures, which result in the shutdown/crash of BOINC (MiniRosetta 1.40).

Happens across all my Linux Systems with no derterminable pattern (64bit BOINC V5.10.45) and naturally results in loss of computing power (need to restart BOINC or the System for ease of purpose)

Otherwise, repeatedly above average numbers of WorkUnits stuck at a certain percentage and its MiniRosetta Task either failed or using 0% CPU power, effectively blocking a CPU core each. Also requires a BOINC restart to get the affected WorkUnits kick into action again.



for the team to know what is going on, please post your affected work units links in your next message.

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57257 - Posted 26 Nov 2008 19:36:57 UTC - in response to Message ID 57200.

read this. It was posted in another new thread by peter leman.

within that wiki article is the link to "lockfile" and it mentions: Where this becomes problematical is when a process dies (crashes) and the Lock File is never closed. This us usually corrected with a reboot action, but not always.

If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.

see if the reboot of boinc helps and if not then follow the directions in the wiki article.

Thanks for highlighting Peter's message on this subject, Greg.

I've closed all apps, ended the MiniRosetta processes, deleted the files and am about to do a re-boot. Fingers crossed. I promise to report back soon.
____________

FalconFly Profile
Avatar

Joined: Jan 11 08
Posts: 23
ID: 234757
Credit: 2,163,056
RAC: 0
Message 57258 - Posted 26 Nov 2008 21:09:02 UTC - in response to Message ID 57253.
Last modified: 26 Nov 2008 21:22:26 UTC

for the team to know what is going on, please post your affected work units links in your next message.


This is going to be a tedious task, as the WorkUnits (most of them) complete normally after the deadlock is solved.
And after BOINC has crashed, I have no way of telling which WorkUnit may have caused it, since I'm looking at upto 8 WorkUnits per Host which will restart all normal when re-launching BOINC.

For now I'm afraid I'm best off with just solving the deadlocks, had to do that ~8 times today already.

(the only real solution I'd see is to run BOINC in debug mode to get behind it crashing or the MiniRosetta Client failing, which I'm very hesitant to do on 24 active production Systems running 24/7 at full speed - sounds like loads of work :p )

Anyway, for now I haven't seen any such behaviour on my 32bit Win32 Systems so far, only my Linux Systems seem randomly affected.

-- edit --

Oh, forgot :
How does Rosetta react to undervolting of CPUs ?

Most of my Systems run with reduced Vcore tested stable with Prime95, given a small safety buffer and have 100% validation on other Projects (Einstein, MalariaControl, SETI, LHC).

I'm very careful before I blame anything on a Project Client when I'm not running hardware 100% to its specifications.
____________

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57259 - Posted 26 Nov 2008 21:30:16 UTC - in response to Message ID 57214.

read this. It was posted in another new thread by peter leman.

within that wiki article is the link to "lockfile" and it mentions: Where this becomes problematical is when a process dies (crashes) and the Lock File is never closed. This us usually corrected with a reboot action, but not always.

If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.

see if the reboot of boinc helps and if not then follow the directions in the wiki article.


Thank you!

It worked out, booting the computer. The 'slots' disappeared and, with them, the lock file. Now to see if the error wont happen again after I resume Rosetta's tasks.


glad to help, but also thanks to peter leman for creating the original thread with the lockfile topic.

Now the update:

The boot was no more than a (short-lived) temporary solution. It all happened again. I believe the error occurs when Rosetta tasks are paused (for the BOINC client to switch to other projects) and, when they start again, it all goes to crap (and this is the technical term).

These were the tasks. I let them be processed till the end':

http://boinc.bakerlab.org/rosetta/result.php?resultid=209770604
http://boinc.bakerlab.org/rosetta/result.php?resultid=209817742

More came by, I aborted them when they started the usual (afore transcribed) 'you may need to reset the project'.

The Rosetta project is now suspended again until a solution to this is 'Revealed' to me.

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57261 - Posted 26 Nov 2008 21:56:54 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=210070351
http://boinc.bakerlab.org/rosetta/result.php?resultid=210070348
http://boinc.bakerlab.org/rosetta/result.php?resultid=209966564
http://boinc.bakerlab.org/rosetta/result.php?resultid=208462198
http://boinc.bakerlab.org/rosetta/result.php?resultid=209224858
http://boinc.bakerlab.org/rosetta/result.php?resultid=209224828

____________

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57264 - Posted 26 Nov 2008 23:16:41 UTC - in response to Message ID 57257.

read this. It was posted in another new thread by peter leman.

within that wiki article is the link to "lockfile" and it mentions: Where this becomes problematical is when a process dies (crashes) and the Lock File is never closed. This us usually corrected with a reboot action, but not always.

If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.

see if the reboot of boinc helps and if not then follow the directions in the wiki article.

Thanks for highlighting Peter's message on this subject, Greg.

I've closed all apps, ended the MiniRosetta processes, deleted the files and am about to do a re-boot. Fingers crossed. I promise to report back soon.

Sorry, no good whatsoever - possibly worse. 1 success, 6 failures. Of the 4 that errored out before I aborted them:


210309372
210290406
Can't acquire lockfile - exiting
Outcome Client error
Client state Compute error
Exit status -226 (0xffffff1e)

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>too many exit(0)s</message>

And

Outcome Client error
Client state Compute error
Exit status 1 (0x1)

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>Incorrect function. (0x1) - exit code 1 (0x1)</message>
210317441
<stderr_txt>
# cpu_run_time_pref: 7200
recovering checkpoint of tag S_1VYHA_5_00000001 with id abrelax_rg_state
Loops::add_loop error -- overlapping loop regions
existing loop begin/end: 92/124
new loop begin/end: 124/191
ERROR:: Exit from: ..\..\src\protocols\loops\LoopClass.cc line: 233
called boinc_finish
</stderr_txt>

210318343
<stderr_txt>
recovering checkpoint of tag S_1BE9A_3_00000001 with id abrelax_rg_state
Loops::add_loop error -- overlapping loop regions
existing loop begin/end: 1/20
new loop begin/end: 20/31
ERROR:: Exit from: ..\..\src\protocols\loops\LoopClass.cc line: 233
called boinc_finish
</stderr_txt>

Not sure what those last two were about tbh, but they fell over quick enough.

Any more ideas, anyone?
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57265 - Posted 26 Nov 2008 23:46:16 UTC
Last modified: 26 Nov 2008 23:51:27 UTC

sid, thats pretty odd as the first 2 tasks have the same output errors as rochester and the others. even with the -226. so something else happened on your system.

looks like its my turn again for compute errors.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57266 - Posted 26 Nov 2008 23:54:03 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=209650572
loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t326__IGNORE_THE_REST_1ZH8A_6_4790_9_0

http://boinc.bakerlab.org/rosetta/result.php?resultid=209650574
loopbuild_boinc4_tex_cst_hombench_loopbuild_tex_cst_t326__IGNORE_THE_REST_1ZH8A_6_4790_10

2X's - <core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# cpu_run_time_pref: 21600

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

</stderr_txt>
]]>

2956 for the last one and 6905 seconds for the first

transient
Avatar

Joined: Sep 30 06
Posts: 376
ID: 115553
Credit: 7,590,569
RAC: 2,277
Message 57270 - Posted 27 Nov 2008 6:08:34 UTC

I've got a number of compute errors for ZNMP tasks.

http://boinc.bakerlab.org/rosetta/result.php?resultid=210185113
http://boinc.bakerlab.org/rosetta/result.php?resultid=210184507
http://boinc.bakerlab.org/rosetta/result.php?resultid=210184505

All display the error:
ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763 called boinc_finish
____________

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 57271 - Posted 27 Nov 2008 6:39:04 UTC
Last modified: 27 Nov 2008 6:39:48 UTC

Hi.

Here's another, after 3hrs, 38mins.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=191817904

1g73A_ZNMP_ABRELAX_tetraL_IGNORE_THE_REST_ZINC_METALLOPROTEIN-1g73A-_4652_57735_0

<core_client_version>6.2.14</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>

ERROR: NANs occured in hbonding!
ERROR:: Exit from: src/core/scoring/hbonds/hbonds_geom.cc line: 763
called boinc_finish

pete.
____________


HA-SOFT, s.r.o.

Joined: Jan 27 07
Posts: 10
ID: 144015
Credit: 65,787,312
RAC: 52,014
Message 57272 - Posted 27 Nov 2008 7:27:11 UTC

I have problem on W2008Server 64 bit, where all Minirosetta task hangs at 0.00 progress. Rosetta beta work ok. BOINC 6.2.19

Zdenek
____________

BF

Joined: Dec 1 05
Posts: 1
ID: 26296
Credit: 3,854,531
RAC: 0
Message 57274 - Posted 27 Nov 2008 10:00:02 UTC

I have the same problem. Rosetta beta works well but rosetta mini gives compute error within seconds.
Most of the time, I got an access violation:
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00030003

Engaging BOINC Windows Runtime Debugger...


(I can provide the complete file if needed).


This computer has WinXP SP2 - and a core 2 duo processor (E6600).

Another pc with the same configuration but with a pentium 4 works well.

BF
____________

David Ball

Joined: Nov 25 05
Posts: 25
ID: 19653
Credit: 1,270,528
RAC: 0
Message 57275 - Posted 27 Nov 2008 11:21:42 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=210193423

2vik__BOINC_ABRELAX_SPLIT_SPLIT2_IGNORE_THE_REST-S25-9-S3-3--2vik_-_4768_1689_1

Vista home premium 64 bit system with 5 GB of ram. C2 Quad Q6600. Only running BOINC. 2 rosetta tasks were running along with 2 tasks from other projects. Lots of free memory and disk space. BOINC is set to leave tasks in memory. BOINC is not used as a screensaver. BOINC client version is 6.2.19.

The WU above was running but the CPU time (3 hours 50 minutes 2 seconds) and percent complete (about 69%) weren't increasing. I checked with task manager and it WAS using 25% cpu (1 of 4 cores in a C2Q Q6600). I suspended the WU and the status in the BOINC manager changed from running to waiting to run. However, windows task manager showed that it was still running. I had another rosetta task running so I suspended the second WU as well to make sure I had the right one. The second rosetta WU stopped using CPU when it was suspended but remained in memory as it should. BOINC manager now showed NO rosetta tasks running, but windows task manager showed the problem WU was still using all the cpu time it could get. I killed it in task manager and aborted the WU. When looking at the result, I found that I was the second person to get the WU and it had died on the other computer after about 3 minutes.

IIRC, the WU was on the 5th model when this happened.

Hope this helps.
____________
Have you read a good Science Fiction book lately?

AdeB Profile
Avatar

Joined: Dec 12 06
Posts: 45
ID: 135244
Credit: 2,358,915
RAC: 2,105
Message 57282 - Posted 27 Nov 2008 14:01:23 UTC - in response to Message ID 57258.

for the team to know what is going on, please post your affected work units links in your next message.


This is going to be a tedious task, as the WorkUnits (most of them) complete normally after the deadlock is solved.
And after BOINC has crashed, I have no way of telling which WorkUnit may have caused it, since I'm looking at upto 8 WorkUnits per Host which will restart all normal when re-launching BOINC.

For now I'm afraid I'm best off with just solving the deadlocks, had to do that ~8 times today already.

(the only real solution I'd see is to run BOINC in debug mode to get behind it crashing or the MiniRosetta Client failing, which I'm very hesitant to do on 24 active production Systems running 24/7 at full speed - sounds like loads of work :p )

Anyway, for now I haven't seen any such behaviour on my 32bit Win32 Systems so far, only my Linux Systems seem randomly affected.

-- edit --

Oh, forgot :
How does Rosetta react to undervolting of CPUs ?

Most of my Systems run with reduced Vcore tested stable with Prime95, given a small safety buffer and have 100% validation on other Projects (Einstein, MalariaControl, SETI, LHC).

I'm very careful before I blame anything on a Project Client when I'm not running hardware 100% to its specifications.


FalconFly, i noticed that you are crunching for LHC@home as well.
It might be that LHC@home is causing your crashes. I've had some crashes too this week. Next time it happens check your boinc.log file, the last message there, before SIGSEGV and the stack trace, is probably: [lhcathome] Scheduler request
A few weeks ago this has also been mentioned by several people in the LHC@home message boards.

AdeB
____________

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57284 - Posted 27 Nov 2008 15:37:13 UTC

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189759587

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57287 - Posted 27 Nov 2008 17:35:30 UTC - in response to Message ID 57284.

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=189759587


this is not really a issue with the task.
but rather the time of 10 days to crunch the task and report back has expired. you may have to much work on your system and it is not active enough to complete the work assigned to it. I see no CPU time on this task, so it appears it never got crunched to begin with. There are no error codes either.

This was also the case of another task you reported earlier. It never got crunched in 10 days.

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57289 - Posted 27 Nov 2008 20:49:36 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=208906339

____________

Rifleman

Joined: Nov 19 08
Posts: 17
ID: 288725
Credit: 139,408
RAC: 0
Message 57290 - Posted 27 Nov 2008 22:07:00 UTC

Can someone take a quick look at my results and see if they know why I am getting massive numbers of errors and wasted time? The ones I terminated myself were still runing in task manager after retarting BOINC so I'd end up with 8 WUs vying for CPU time while only 4 showed in BOINC.
Here is my results page and thanks. http://boinc.bakerlab.org/rosetta/results.php?userid=288725

Mike.Gibson

Joined: Nov 3 07
Posts: 19
ID: 217599
Credit: 194,329
RAC: 0
Message 57291 - Posted 27 Nov 2008 22:39:50 UTC

I am using a dual-core 3800+ with Vista Premium and Boinc 6.2.19.

If I have a mini 1.40 & Beta 5.98 running and suspend the project, both tasks are shown as suspended by user. However, the mini 1.40 keeps on running, albeit slowly.

Two other tasks start to run, one at normal speed and the other slowly.

Obviously, one of the new tasks is running on its own in one core and the other new task is sharing the second core with mini 1.40.

I have never seen a core sharing before. Is this ok, or is this a problem. None of my other projects show any signs of this phenomenum.

Any ideas?

Rifleman

Joined: Nov 19 08
Posts: 17
ID: 288725
Credit: 139,408
RAC: 0
Message 57292 - Posted 27 Nov 2008 23:15:33 UTC

Check your processes running in task manager by pressing control, alt, delete. Do you show more than the normal number of tasks running?

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57293 - Posted 27 Nov 2008 23:45:35 UTC - in response to Message ID 57290.
Last modified: 27 Nov 2008 23:46:46 UTC

Can someone take a quick look at my results and see if they know why I am getting massive numbers of errors and wasted time? The ones I terminated myself were still runing in task manager after retarting BOINC so I'd end up with 8 WUs vying for CPU time while only 4 showed in BOINC.
Here is my results page and thanks. http://boinc.bakerlab.org/rosetta/results.php?userid=288725


Your link to your results page is not correct, that is your own internal link i think. Here is the public one: http://boinc.bakerlab.org/rosetta/results.php?hostid=948562

Read this message and then go up the board a bit and see what others did when it comes to lockfile issues.

I see that other results have Nan issues. No one has explained what this is or if they are working on a fix for it or not.

Your non nan and lockfile results that errored out are possibly due to a out of date graphics card device driver. read this message for an explanation.

hope this helps get you back on track.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57294 - Posted 27 Nov 2008 23:50:25 UTC - in response to Message ID 57291.

I am using a dual-core 3800+ with Vista Premium and Boinc 6.2.19.

If I have a mini 1.40 & Beta 5.98 running and suspend the project, both tasks are shown as suspended by user. However, the mini 1.40 keeps on running, albeit slowly.

Two other tasks start to run, one at normal speed and the other slowly.

Obviously, one of the new tasks is running on its own in one core and the other new task is sharing the second core with mini 1.40.

I have never seen a core sharing before. Is this ok, or is this a problem. None of my other projects show any signs of this phenomenum.

Any ideas?



There has been a few problems I have experienced and others have as well with 1.4 tasks not suspending, that was mostly in the loopbuild tasks. I have found that you have to just exit boinc and restart it. you may also have to reboot your system. but that is probably last ditch. After one or both of these steps boinc mgr will act properly again.

Mike.Gibson

Joined: Nov 3 07
Posts: 19
ID: 217599
Credit: 194,329
RAC: 0
Message 57295 - Posted 28 Nov 2008 1:40:49 UTC - in response to Message ID 57292.

Check your processes running in task manager by pressing control, alt, delete. Do you show more than the normal number of tasks running?


Already checked - all 3 registered at variable amounts around 44%, 22% & 22%.

Mike.Gibson

Joined: Nov 3 07
Posts: 19
ID: 217599
Credit: 194,329
RAC: 0
Message 57296 - Posted 28 Nov 2008 1:45:37 UTC - in response to Message ID 57294.

I am using a dual-core 3800+ with Vista Premium and Boinc 6.2.19.

If I have a mini 1.40 & Beta 5.98 running and suspend the project, both tasks are shown as suspended by user. However, the mini 1.40 keeps on running, albeit slowly.

Two other tasks start to run, one at normal speed and the other slowly.

Obviously, one of the new tasks is running on its own in one core and the other new task is sharing the second core with mini 1.40.

I have never seen a core sharing before. Is this ok, or is this a problem. None of my other projects show any signs of this phenomenum.

Any ideas?



There has been a few problems I have experienced and others have as well with 1.4 tasks not suspending, that was mostly in the loopbuild tasks. I have found that you have to just exit boinc and restart it. you may also have to reboot your system. but that is probably last ditch. After one or both of these steps boinc mgr will act properly again.



I have tried all sorts of combinations including reboots but it recurs next time. It seems to happen with either suspending project or suspending task. However, suspending both can clear the problem until the next time.

(_KoDAk_) Profile

Joined: Jul 18 06
Posts: 109
ID: 100677
Credit: 1,859,263
RAC: 0
Message 57300 - Posted 28 Nov 2008 5:56:42 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=210335636

ERROR: NANs occured in hbonding!
ERROR:: Exit from: ..\..\src\core\scoring\hbonds\hbonds_geom.cc line: 763
called boinc_finish

CPU time 39732.38 ((((((
____________

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57301 - Posted 28 Nov 2008 6:50:27 UTC

People said it -- I said it -- I insist:

Rosetta mini should be worked as a Beta project. It seems SO obvious!

We want to crunch Rosetta again. Start sending Rosetta Beta 5.xx WU again, please!

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57302 - Posted 28 Nov 2008 6:58:34 UTC - in response to Message ID 57301.


http://boinc.bakerlab.org/rosetta/results.php?hostid=267483



People said it -- I said it -- I insist:

Rosetta mini should be worked as a Beta project. It seems SO obvious!

We want to crunch Rosetta again. Start sending Rosetta Beta 5.xx WU again, please!

A Few Good Men

Joined: Mar 25 07
Posts: 14
ID: 157915
Credit: 2,031,382
RAC: 23
Message 57303 - Posted 28 Nov 2008 7:08:48 UTC

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

FalconFly Profile
Avatar

Joined: Jan 11 08
Posts: 23
ID: 234757
Credit: 2,163,056
RAC: 0
Message 57304 - Posted 28 Nov 2008 7:34:35 UTC - in response to Message ID 57282.
Last modified: 28 Nov 2008 7:35:29 UTC

FalconFly, i noticed that you are crunching for LHC@home as well.
It might be that LHC@home is causing your crashes. I've had some crashes too this week. Next time it happens check your boinc.log file, the last message there, before SIGSEGV and the stack trace, is probably: [lhcathome] Scheduler request
A few weeks ago this has also been mentioned by several people in the LHC@home message boards.

AdeB


Darn, it seems you could be right on the spot with that. Nice catch!
I haven't seen any anomalies for >24hrs now, as the most recent batch of LHC WorkUnits have been processed.

Given the somewhat shaky state of LHC@Home, I'd say Rosetta is off the hook concerning my recent problems :)
____________

sarha1

Joined: Sep 23 05
Posts: 5
ID: 844
Credit: 6,339,735
RAC: 0
Message 57305 - Posted 28 Nov 2008 8:58:05 UTC

Validate error. WTH?
Extremely high claimed credit (100x more than expected).

http://boinc.bakerlab.org/rosetta/result.php?resultid=210214915
http://boinc.bakerlab.org/rosetta/result.php?resultid=210214913

Athlon 64 3200+ 1GB RAM WIN XP prof. SP3

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57307 - Posted 28 Nov 2008 11:24:06 UTC - in response to Message ID 57303.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57308 - Posted 28 Nov 2008 12:00:04 UTC - in response to Message ID 57307.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


i'll stay as my error rate is low, but i have to agree, the team needs to take and revamp all these tasks with stupid errors, such as Nan's and recovering checkpoints and lock file errors along with all the other stupid problems that could be taken care of if they were tested on Ralph properly before being released to here.

the idea of Rosetta is research of proteins and not research of bad programing.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57309 - Posted 28 Nov 2008 13:55:35 UTC

A workunit where my computer completed some models successfully without getting any credit:

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=191865519

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57310 - Posted 28 Nov 2008 13:59:18 UTC - in response to Message ID 57307.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


The problems seem to be mainly in workunits that use the new features, so an option to avoid getting any of the workunits using those features would be useful.

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57312 - Posted 28 Nov 2008 14:14:18 UTC

am i doing something wrong here or what? http://boinc.bakerlab.org/rosetta/results.php?hostid=267483

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57315 - Posted 28 Nov 2008 14:51:37 UTC - in response to Message ID 57312.

am i doing something wrong here or what? http://boinc.bakerlab.org/rosetta/results.php?hostid=267483


nothing is wrong, other than you need to try out the stuff i pointed out about lockfiles in a previous message to you. if you give that a try it should clear up the problem.

the others, as i pointed out last time, seem to time out (10 days no processing or reporting) due to some unknown reason. to much work, not enough on time or cpu time being dedicated to rosetta, or just a rash of bad luck.

try solving the lockfile issue and then don't accept any new work until you have completed what you have in queue and when that is done then accept new work and see what results you have.

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57317 - Posted 28 Nov 2008 15:37:26 UTC - in response to Message ID 57315.




i looked and cant find the procedure what do i do?





am i doing something wrong here or what? http://boinc.bakerlab.org/rosetta/results.php?hostid=267483


nothing is wrong, other than you need to try out the stuff i pointed out about lockfiles in a previous message to you. if you give that a try it should clear up the problem.

the others, as i pointed out last time, seem to time out (10 days no processing or reporting) due to some unknown reason. to much work, not enough on time or cpu time being dedicated to rosetta, or just a rash of bad luck.

try solving the lockfile issue and then don't accept any new work until you have completed what you have in queue and when that is done then accept new work and see what results you have.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57320 - Posted 28 Nov 2008 16:13:48 UTC - in response to Message ID 57317.

goto here for my original post and go here for the boinc wiki description. here is where you will find the files you need to remove after you shut all boinc processes down: If you are going to delete it then you can find the lockfile that is actually called boinc_lockfile and it is in boinc folder then subfolder projects and then subfolder slots.




i looked and cant find the procedure what do i do?





am i doing something wrong here or what? http://boinc.bakerlab.org/rosetta/results.php?hostid=267483


nothing is wrong, other than you need to try out the stuff i pointed out about lockfiles in a previous message to you. if you give that a try it should clear up the problem.

the others, as i pointed out last time, seem to time out (10 days no processing or reporting) due to some unknown reason. to much work, not enough on time or cpu time being dedicated to rosetta, or just a rash of bad luck.

try solving the lockfile issue and then don't accept any new work until you have completed what you have in queue and when that is done then accept new work and see what results you have.


robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57328 - Posted 28 Nov 2008 16:59:18 UTC
Last modified: 28 Nov 2008 16:59:59 UTC

I just suspended BOINC entirely for my weekly antiviral and antispyware checks, then noticed that a rosetta@home workunit was still using CPU time on my computer:

http://boinc.bakerlab.org/rosetta/results.php?userid=264600

I then also suspended the rosetta@home project and that specific task; this didn't stop it from using CPU time. Since this is using only one core of my dual core PC, I'm going to try running the antiviral and antispyware programs as usual, even with that workunit still running.

11/28/2008 8:40:30 AM|rosetta@home|Starting 1shfA_BOINC_ABRELAX_SPLIT_SPLIT_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1shfA-_4844_644_1
11/28/2008 8:40:31 AM|rosetta@home|Starting task 1shfA_BOINC_ABRELAX_SPLIT_SPLIT_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1shfA-_4844_644_1 using minirosetta version 140

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57330 - Posted 28 Nov 2008 17:24:36 UTC - in response to Message ID 57308.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


i'll stay as my error rate is low, but i have to agree, the team needs to take and revamp all these tasks with stupid errors, such as Nan's and recovering checkpoints and lock file errors along with all the other stupid problems that could be taken care of if they were tested on Ralph properly before being released to here.

the idea of Rosetta is research of proteins and not research of bad programing.

Very well put.

So why do the project developers say nothing about this here?

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57334 - Posted 28 Nov 2008 20:25:53 UTC - in response to Message ID 57330.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


i'll stay as my error rate is low, but i have to agree, the team needs to take and revamp all these tasks with stupid errors, such as Nan's and recovering checkpoints and lock file errors along with all the other stupid problems that could be taken care of if they were tested on Ralph properly before being released to here.

the idea of Rosetta is research of proteins and not research of bad programing.

Very well put.

So why do the project developers say nothing about this here?


I suspect it's because they're too busy reading all the problem reports.

Do you think it would be enough to move just the workunits using the new features introduced in 1.39 and 1.40 back to Ralph, so they'd still have something for the rest of the participants to do until they fix the new problems?

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57335 - Posted 28 Nov 2008 20:26:02 UTC - in response to Message ID 57330.
Last modified: 28 Nov 2008 20:28:58 UTC

(Duplicate message - deleted)

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57337 - Posted 28 Nov 2008 22:42:06 UTC - in response to Message ID 57334.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


i'll stay as my error rate is low, but i have to agree, the team needs to take and revamp all these tasks with stupid errors, such as Nan's and recovering checkpoints and lock file errors along with all the other stupid problems that could be taken care of if they were tested on Ralph properly before being released to here.

the idea of Rosetta is research of proteins and not research of bad programing.

Very well put.

So why do the project developers say nothing about this here?


I suspect it's because they're too busy reading all the problem reports.

Do you think it would be enough to move just the workunits using the new features introduced in 1.39 and 1.40 back to Ralph, so they'd still have something for the rest of the participants to do until they fix the new problems?


Wouldn't that be best?

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57338 - Posted 28 Nov 2008 23:05:27 UTC - in response to Message ID 57337.

Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


i'll stay as my error rate is low, but i have to agree, the team needs to take and revamp all these tasks with stupid errors, such as Nan's and recovering checkpoints and lock file errors along with all the other stupid problems that could be taken care of if they were tested on Ralph properly before being released to here.

the idea of Rosetta is research of proteins and not research of bad programing.

Very well put.

So why do the project developers say nothing about this here?


I suspect it's because they're too busy reading all the problem reports.

Do you think it would be enough to move just the workunits using the new features introduced in 1.39 and 1.40 back to Ralph, so they'd still have something for the rest of the participants to do until they fix the new problems?


Wouldn't that be best?


I agree with you guys on this.

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57341 - Posted 29 Nov 2008 1:50:11 UTC - in response to Message ID 57338.

this is getting too crazy ill give it 2 more days disconnect and then ill be back in a couple weeks to see if this is back to working


Please send email to my account when an alternate to mini 1.40 test is available. Thanks in Advance.

I second that!


i'll stay as my error rate is low, but i have to agree, the team needs to take and revamp all these tasks with stupid errors, such as Nan's and recovering checkpoints and lock file errors along with all the other stupid problems that could be taken care of if they were tested on Ralph properly before being released to here.

the idea of Rosetta is research of proteins and not research of bad programing.

Very well put.

So why do the project developers say nothing about this here?


I suspect it's because they're too busy reading all the problem reports.

Do you think it would be enough to move just the workunits using the new features introduced in 1.39 and 1.40 back to Ralph, so they'd still have something for the rest of the participants to do until they fix the new problems?


Wouldn't that be best?


I agree with you guys on this.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57342 - Posted 29 Nov 2008 2:24:08 UTC - in response to Message ID 57328.

I just suspended BOINC entirely for my weekly antiviral and antispyware checks, then noticed that a rosetta@home workunit was still using CPU time on my computer:

http://boinc.bakerlab.org/rosetta/results.php?userid=264600

I then also suspended the rosetta@home project and that specific task; this didn't stop it from using CPU time. Since this is using only one core of my dual core PC, I'm going to try running the antiviral and antispyware programs as usual, even with that workunit still running.

11/28/2008 8:40:30 AM|rosetta@home|Starting 1shfA_BOINC_ABRELAX_SPLIT_SPLIT_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1shfA-_4844_644_1
11/28/2008 8:40:31 AM|rosetta@home|Starting task 1shfA_BOINC_ABRELAX_SPLIT_SPLIT_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1shfA-_4844_644_1 using minirosetta version 140


The Ad-Aware 2008 program apparantly ran correctly even with that workunit still running, without taking longer than usual. It found about twice as many cookies as usual, which makes me suspect that I forgot to run it last week. It was unable to remove all these cookies without restarting Vista - something which happens about half the time even when all workunits respond correctly to a suspend - so I let it restart Vista. Since I have to restart BOINC manually every time Vista restarts, I was then able to run the remaining antispyware programs and the antivirus program before restarting BOINC.

What filename should I expect for the cookie from Rosetta@home, so I can tell that program not to delete it?

When that workunit got a CPU core again, it repeated the same problem of continuing to run even after BOINC tries to give another workunit a turn on that CPU core.

I'm going to tell BOINC not to download any more Rosetta@home workunits until I have more time to watch for such behavior.

David Baker
Forum moderator
Project administrator
Project developer
Project scientist

Joined: Sep 17 05
Posts: 704
ID: 122
Credit: 559,847
RAC: 0
Message 57344 - Posted 29 Nov 2008 3:19:41 UTC

Very sorry about all the problems, we are working to fix them as fast as possible. One source of the problems is that we are now running a broader range of applications on rosetta@home so there are more sources of error. I do apologize for the problems; we have an absolute rule to check all work units first on ralph, but there are some errors which don't get caught this way. Our top priority now is to find the source of the problems and to fix them.
____________

Chu

Joined: Feb 23 06
Posts: 120
ID: 61076
Credit: 112,439
RAC: 0
Message 57345 - Posted 29 Nov 2008 3:19:47 UTC

Hi everyone,

the fix to the NAN hbonding problem will be included in the next update (probably after this weekend) and we are still investigating the problem of lockfile and that some WUs cannot be suspended. Sorry for the trouble and inconvenience and we will try our best to avoid such problems from happening on such a large scale in future.

Please continue to report other errors and problems that are not mentioned above.
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57347 - Posted 29 Nov 2008 8:14:22 UTC

overlapping loop regions error
http://boinc.bakerlab.org/rosetta/result.php?resultid=210296907
cc_0_6_nocst_homo_bench_foldcst_chunk_general_t286__olange_IGNORE_THE_REST_1FXWF_7_4848_20_1
died at 13 secs
<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
recovering checkpoint of tag S_1FXWF_7_00000001 with id abrelax_rg_state
Loops::add_loop error -- overlapping loop regions
existing loop begin/end: 123/182
new loop begin/end: 182/202
ERROR:: Exit from: ..\..\src\protocols\loops\LoopClass.cc line: 233
called boinc_finish

</stderr_txt>
]]>

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 57348 - Posted 29 Nov 2008 9:44:36 UTC

message 4366

This message from James describes among other matters some of the problems that are being solved on RALPH

Conan - those loop boundary errors were input errors by the person who submitted those workunits. The validate errors are the result of a new format added that's not yet supported by the BOINC server, and we'll have to update our server code to deal with it over the weekend. That slow workunit bug looks like something that we fixed several months ago, we've alerted the person who submitted those jobs and he's looking into them.





____________

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57353 - Posted 29 Nov 2008 14:58:47 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=210400827

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57392 - Posted 1 Dec 2008 12:46:37 UTC - in response to Message ID 57345.

Hi everyone,

The fix to the NAN hbonding problem will be included in the next update (probably after this weekend) and we are still investigating the problem of lockfile and that some WUs cannot be suspended. Sorry for the trouble and inconvenience and we will try our best to avoid such problems from happening on such a large scale in future.

Please continue to report other errors and problems that are not mentioned above.

Thanks to you and David for the above comments.

As I'm out of work (just with results to upload) I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.

I suggest others with similar problems to do the same.
____________

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 57393 - Posted 1 Dec 2008 12:55:30 UTC

I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.

Make sure you upload your results before you reset, or you will lose everything.
____________

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57398 - Posted 1 Dec 2008 14:27:27 UTC

So, once again, the lock file thingy!

http://boinc.bakerlab.org/rosetta/result.php?resultid=211319613

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57410 - Posted 1 Dec 2008 16:58:47 UTC - in response to Message ID 57393.
Last modified: 1 Dec 2008 16:59:26 UTC

I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.

Make sure you upload your results before you reset, or you will lose everything.

Good point. I realised that just in time. I've set Boinc Manager not to get new WUs just yet and waiting for the upload to go through successfully before I reset.

Just noticed only 43k successes in the last 24hours. Some are obviously going through, but I don't know if there's a bottleneck or a problem receiving them on the Rosetta side. Nothing's going through for me yet.
____________

Mike Tyka

Joined: Oct 20 05
Posts: 96
ID: 5612
Credit: 2,190
RAC: 0
Message 57411 - Posted 1 Dec 2008 16:59:23 UTC - in response to Message ID 57392.


As I'm out of work (just with results to upload) I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.



*All* the lock files ? Where do they accumulate ? Do they accumulate after avery job ? Or only after failed ones ? This might be a leading thread to solving this silly lockfile problem!

Cheers, Mike


____________
http://beautifulproteins.blogspot.com/
http://www.miketyka.com/

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57415 - Posted 1 Dec 2008 17:14:36 UTC - in response to Message ID 57411.

As I'm out of work (just with results to upload) I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.


*All* the lock files ? Where do they accumulate ? Do they accumulate after every job ? Or only after failed ones ? This might be a leading thread to solving this silly lockfile problem!

In fact I spoke too soon before checking. The last time this was mentioned there were numerous 0-byte boinc_lockfiles in C:\ProgramData\BOINC\slots\0\ (and folders 1, 2, 3, 4 etc) - under Vista64 btw.

This time the slots folder was empty, so no lockfiles, even though I got many WUs with too many errors after repeated "Can't acquire lockfile" messages. I'd been away from home 11/27 to 11/30

See my results

Also, note this Validate error here

Server state Over
Outcome Validate error
Client state Done
Exit status 0 (0x0)

<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 7200
======================================================
DONE :: 1 starting structures 4470.93 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish
Can't set up shared mem: -1
Will run in standalone mode.
# cpu_run_time_pref: 7200
Can't acquire lockfile - exiting

</stderr_txt>
]]>

I don't think I've noticed this particular one before.
____________

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57417 - Posted 1 Dec 2008 17:40:44 UTC - in response to Message ID 57410.
Last modified: 1 Dec 2008 17:41:45 UTC

I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.

Make sure you upload your results before you reset, or you will lose everything.

Good point. I realised that just in time. I've set Boinc Manager not to get new WUs just yet and waiting for the upload to go through successfully before I reset.

That was wise. What I've been doing is to set Boinc Manager not to get new tasks too. I then click 'Update', so that the client communicates with the Roseta server(s). Finally, to avoid doing something wrong, I boot the computer. That makes the Rosetta slots disappear (with them the lock file(s). Like magic.

Of course, when I allow a new WU to be downloaded, the process is fraked up. Again.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57418 - Posted 1 Dec 2008 17:46:01 UTC - in response to Message ID 57410.

Just noticed only 43k successes in the last 24hours. Some are obviously going through, but I don't know if there's a bottleneck or a problem receiving them on the Rosetta side. Nothing's going through for me yet.


The uploads server hasn't caught up with uploading all the results from all the workunits that completed during the recent fileserver problem. If you have enough free disk space to hold the results, and have told BOINC it can use enough of it that Rosetta@home's share will hold a few day's worth of results, all you should really have to do is wait for the uploads server to catch up.

P . P . L .
Avatar

Joined: Aug 20 06
Posts: 581
ID: 105843
Credit: 4,864,105
RAC: 0
Message 57453 - Posted 2 Dec 2008 6:43:23 UTC

Hi .

I've got two more of these, they don't want to stop when preempted.


1tig__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1tig_-_4845_1488_0

1c9oA_BOINC_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1c9oA-_4678_404_1

pete.




____________


ramostol

Joined: Feb 6 07
Posts: 64
ID: 145835
Credit: 584,052
RAC: 0
Message 57460 - Posted 2 Dec 2008 9:50:03 UTC

My (MacBook) "abinitio_nohomfrag_70_A_1unpA_4466"-tasks show a failure rate of three out of four, all failures terminate after some hours' computing with finishing file absent.

Cannot link to a result, as I am unable to report to the project at the moment.

[B^S] HenryHunter Profile

Joined: May 28 08
Posts: 1
ID: 261511
Credit: 72,915
RAC: 0
Message 57464 - Posted 2 Dec 2008 11:35:41 UTC - in response to Message ID 56741.

Please report any bugs in this version here.

Sarel.


02.12.2008 04:58:10|rosetta@home|Message from server: Server error: can't attach shared memory
any solution?
CU

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57465 - Posted 2 Dec 2008 11:38:24 UTC - in response to Message ID 57464.

Please report any bugs in this version here.

Sarel.


02.12.2008 04:58:10|rosetta@home|Message from server: Server error: can't attach shared memory
any solution?
CU


see here if things do not resolve themselves automatically.
The team created a new server for task processing as the main server was getting overloaded. The address has changed, but should correct automatically. if not see the link.

mikylinux

Joined: Jul 25 07
Posts: 3
ID: 193561
Credit: 73,155
RAC: 0
Message 57466 - Posted 2 Dec 2008 11:46:07 UTC
Last modified: 2 Dec 2008 11:51:35 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=211014218
http://boinc.bakerlab.org/rosetta/result.php?resultid=208363538
http://boinc.bakerlab.org/rosetta/result.php?resultid=208319555
http://boinc.bakerlab.org/rosetta/result.php?resultid=206052369



And workunits:
http://boinc.bakerlab.org/rosetta/result.php?resultid=209971190
and
http://boinc.bakerlab.org/rosetta/result.php?resultid=210257656

are working by 37 and 19 hours.... I wait a bit and stop the tasks....

upstatelabs

Joined: Jun 22 06
Posts: 10
ID: 96397
Credit: 516,767
RAC: 0
Message 57469 - Posted 2 Dec 2008 13:14:46 UTC

I have a pair of errors to report:

12/1/2008 11:07:42 PM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 exited with zero status but no 'finished' file
12/1/2008 11:07:42 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2008 11:07:43 PM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 using minirosetta version 140
12/1/2008 11:08:24 PM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 exited with zero status but no 'finished' file
12/1/2008 11:08:24 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2008 11:08:24 PM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 using minirosetta version 140
12/1/2008 11:09:05 PM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 exited with zero status but no 'finished' file

Above repeating ~50 times.
And this:

12/2/2008 5:19:27 AM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 exited with zero status but no 'finished' file
12/2/2008 5:19:27 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/2/2008 5:19:27 AM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 using minirosetta version 140
12/2/2008 5:20:08 AM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 exited with zero status but no 'finished' file
12/2/2008 5:20:08 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/2/2008 5:20:08 AM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 using minirosetta version 140

Again repeating many times.

Could someone look into this?

Thanks!

____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57471 - Posted 2 Dec 2008 14:29:53 UTC - in response to Message ID 57469.
Last modified: 2 Dec 2008 14:31:01 UTC

I have a pair of errors to report:

12/1/2008 11:07:42 PM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 exited with zero status but no 'finished' file
12/1/2008 11:07:42 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2008 11:07:43 PM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 using minirosetta version 140
12/1/2008 11:08:24 PM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 exited with zero status but no 'finished' file
12/1/2008 11:08:24 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2008 11:08:24 PM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 using minirosetta version 140
12/1/2008 11:09:05 PM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1494_0 exited with zero status but no 'finished' file

Above repeating ~50 times.
And this:

12/2/2008 5:19:27 AM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 exited with zero status but no 'finished' file
12/2/2008 5:19:27 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/2/2008 5:19:27 AM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 using minirosetta version 140
12/2/2008 5:20:08 AM|rosetta@home|Task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 exited with zero status but no 'finished' file
12/2/2008 5:20:08 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/2/2008 5:20:08 AM|rosetta@home|Restarting task 1vie__BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1vie_-_4845_1471_0 using minirosetta version 140

Again repeating many times.

Could someone look into this?

Thanks!


could you post the links either in plain text or in a link so people can look directly at the files your talking about? because you have two system on rosetta it would take quite a long time to isolate the tasks you are talking about.

Sid Celery

Joined: Feb 11 08
Posts: 796
ID: 241409
Credit: 9,546,016
RAC: 7,460
Message 57504 - Posted 2 Dec 2008 18:53:31 UTC - in response to Message ID 57415.

As I'm out of work (just with results to upload) I'll take the opportunity to delete all the lockfiles again, as previously advised, and reset the project. Seems to me like the perfect opportunity.

*All* the lock files ? Where do they accumulate ? Do they accumulate after every job ? Or only after failed ones ? This might be a leading thread to solving this silly lockfile problem!

In fact I spoke too soon before checking. The last time this was mentioned there were numerous 0-byte boinc_lockfiles in C:\ProgramData\BOINC\slots\0\ (and folders 1, 2, 3, 4 etc) - under Vista64 btw.

This time the slots folder was empty, so no lockfiles, even though I got many WUs with too many errors after repeated "Can't acquire lockfile" messages. I'd been away from home 11/27 to 11/30

See my results

Final note, because I'm now officially depressed:
After uploading all previous results, changing server urls, resetting the project, dl'ing new WUs, my first 4 MiniRosetta WUs all crashed out in the usual way between 10 and 100 minutes. Can't acquire lockfile.

I now have 7 folders inside the \slots\ folder (named 0, 1, 2, 3, 4, 5 & 6) four of which contain a 0-byte boinc_lockfile, while only 2 mini-rosetta WUs are currently running.

I guess I should've let those WUs abort with the usual Computation Error so they could report properly, but I was that p'd off I aborted them to let some infallible Rosetta 5.98 WUs run.
____________

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57506 - Posted 2 Dec 2008 20:38:54 UTC

just delete those lockfiles and you should be able to get back on your way again.
hopefully the new work you get will not contain these problems.
i saw awhile back that they were going to look into that problem and fix it.

Evan

Joined: Dec 23 05
Posts: 268
ID: 42505
Credit: 402,585
RAC: 0
Message 57511 - Posted 2 Dec 2008 21:04:43 UTC

Instant remedy for getting out of lockfile depression - have a change of scenery - go over to RALPH - about 30,000 work units at last count ready to send!
____________

A Few Good Men

Joined: Mar 25 07
Posts: 14
ID: 157915
Credit: 2,031,382
RAC: 23
Message 57544 - Posted 3 Dec 2008 14:09:01 UTC

Result task id's for last 12 hours of Rosetta after resetting client.

All Client Errors

211601578
211514071
211514058
211512399
211512310
211512309
211512308
211512306
211512305

Compute Errors

211522797
211514072

Please Advise.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57547 - Posted 3 Dec 2008 15:08:06 UTC - in response to Message ID 57544.

Result task id's for last 12 hours of Rosetta after resetting client.

All Client Errors

211601578
211514071
211514058
211512399
211512310
211512309
211512308
211512306
211512305

Compute Errors

211522797
211514072

Please Advise.



looks like its a whole load of defective tasks. 2 different systems bombed them.
it is also possible that if your system is being OC'd that your speed is to fast for rosetta to handle. I was working with my OC percentage last night and crashed a whole bunch. Some of the tasks were successful with other users and some of them crashed again. Keep an eye on your current tasks and see if they crash with the same kind of error code. If your running OC'd lower your speed a little bit to see where the threshold is for Rosetta. 5-10 mhz can make a difference in a success and a crash.

A Few Good Men

Joined: Mar 25 07
Posts: 14
ID: 157915
Credit: 2,031,382
RAC: 23
Message 57550 - Posted 3 Dec 2008 15:49:06 UTC

Ill do a run at stock cpu, ram and fsb values. Thanks.

Dave Mickey

Joined: Dec 29 07
Posts: 33
ID: 231007
Credit: 4,136,957
RAC: 0
Message 57577 - Posted 4 Dec 2008 2:43:36 UTC

Just another data point - still have 1.40 tasks that
do not respond to BOINCs command to suspend.

this is not fixed yet.

Dave

DJStarfox

Joined: Jul 19 07
Posts: 140
ID: 191721
Credit: 560,560
RAC: 21
Message 57578 - Posted 4 Dec 2008 4:42:23 UTC

This is just ridiculous. Rosetta Mini 1.40 on Linux does NOT obey the BOINC API to suspend the task. I think it does this whenever it's creating the first decoy in the simulation.

Other than a dedicated server, this makes it really hard to let Rosetta run on a workstation box. No new work for me until you fix this and I hear back.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57580 - Posted 4 Dec 2008 4:58:15 UTC - in response to Message ID 57578.

This is just ridiculous. Rosetta Mini 1.40 on Linux does NOT obey the BOINC API to suspend the task. I think it does this whenever it's creating the first decoy in the simulation.

Other than a dedicated server, this makes it really hard to let Rosetta run on a workstation box. No new work for me until you fix this and I hear back.


Suggestion of how to handle at least part of a fix: Allow it to suspend even during the first decoy if the leave in memory option is selected, as long as paging to the swapfile won't hurt.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57581 - Posted 4 Dec 2008 9:00:45 UTC

what with all this text that shows up in the stder out text?

recovering checkpoint of tag S_U12X5X_00000001 with id abrelax_rg_state
recovering checkpoint of tag S_U12X5X_00000001 with id stage_1
recovering checkpoint of tag S_U12X5X_00000001 with id stage_2


this keeps showing up in alot of tasks. the tasks completes ok.

Tony Profile

Joined: Dec 12 05
Posts: 7
ID: 35547
Credit: 6,724,341
RAC: 598
Message 57586 - Posted 4 Dec 2008 15:58:24 UTC

I think it is not only on linux that minirosetta doesn't suspend. It seem to be like an unrully child that will not mind. In windows start the task manager to see all running processes and sort by cpu usage. Seems some of the processes obey but some keep running after a snooze or suspend. Restart seems to make it behave. I think it may be errors that will not let it stop the running task. I seem to be having lots of errors on three different computers I just started crunching with.
____________

Tony Profile

Joined: Dec 12 05
Posts: 7
ID: 35547
Credit: 6,724,341
RAC: 598
Message 57587 - Posted 4 Dec 2008 16:21:04 UTC - in response to Message ID 57586.

I think it is not only on linux that minirosetta doesn't suspend. It seem to be like an unrully child that will not mind. In windows start the task manager to see all running processes and sort by cpu usage. Seems some of the processes obey but some keep running after a snooze or suspend. Restart seems to make it behave. I think it may be errors that will not let it stop the running task. I seem to be having lots of errors on three different computers I just started crunching with.


Mostly problems with a new computer I just built. It is not overclocked but is running vista ultimate 64 bit with 8 gigs mem amd 9950 processor.
____________

Tony Profile

Joined: Dec 12 05
Posts: 7
ID: 35547
Credit: 6,724,341
RAC: 598
Message 57588 - Posted 4 Dec 2008 17:18:45 UTC - in response to Message ID 57587.

[quote]I think it is not only on linux that minirosetta doesn't suspend. It seem to be like an unrully child that will not mind. In windows start the task manager to see all running processes and sort by cpu usage. Seems some of the processes obey but some keep running after a snooze or suspend. Restart seems to make it behave. I think it may be errors that will not let it stop the running task. I seem to be having lots of errors on three different computers I just started crunching with.


This on an older computer.

12/4/2008 12:08:18 PM||Suspending computation - user is active
12/4/2008 12:08:18 PM||Suspending network activity - user is active
12/4/2008 12:08:35 PM|rosetta@home|Task cc_0_8_nocst4_homo_bench_foldcst_chunk_general_t303__olange_IGNORE_THE_REST_2GO7A_7_5161_15_0 exited with zero status but no 'finished' file
12/4/2008 12:08:35 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
This is repeated many times.

With this message the task seems to be still running even though boinc says computation is suspended.

____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57591 - Posted 4 Dec 2008 19:02:43 UTC - in response to Message ID 57586.

I think it is not only on linux that minirosetta doesn't suspend. It seem to be like an unrully child that will not mind. In windows start the task manager to see all running processes and sort by cpu usage. Seems some of the processes obey but some keep running after a snooze or suspend. Restart seems to make it behave. I think it may be errors that will not let it stop the running task. I seem to be having lots of errors on three different computers I just started crunching with.


Under 32-bit Windows Vista SP1, my results indicate that the suspend problem occurs under Vista also, but not in all workunits. I suspect that it's only in workunits that use the new features added under minirosetta 1.39 and 1.40, and not even all of those. I would like to see Rosetta@home add the option to select which types of workunits a particular computer gets, in order to avoid some of the more problematic new types.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57592 - Posted 4 Dec 2008 19:22:47 UTC - in response to Message ID 57588.

This on an older computer.

12/4/2008 12:08:18 PM||Suspending computation - user is active
12/4/2008 12:08:18 PM||Suspending network activity - user is active
12/4/2008 12:08:35 PM|rosetta@home|Task cc_0_8_nocst4_homo_bench_foldcst_chunk_general_t303__olange_IGNORE_THE_REST_2GO7A_7_5161_15_0 exited with zero status but no 'finished' file
12/4/2008 12:08:35 PM|rosetta@home|If this happens repeatedly you may need to reset the project.
This is repeated many times.

With this message the task seems to be still running even though boinc says computation is suspended.


The first part of that seems likely for workunits that go for a long time between checkpoints on machines that don't have enough memory to allow the workunit to stay in memory, and don't allow BOINC to use enough disk space and swap file space to save the current contents of the memory during user interruptions.

For my computer, about US $50 worth of added memory put it up to the maximum amount of memory that model of computer can handle.

mfbabb2

Joined: Oct 10 08
Posts: 4
ID: 283282
Credit: 10,345
RAC: 0
Message 57594 - Posted 4 Dec 2008 19:28:14 UTC

Running on Vista w/SP1:

Computation Error and no apparent progress.

Project has been reset several times. Rosetta used to work.

12/4/2008 11:57:10 AM|rosetta@home|Restarting task cc_1_0_nocst4_homo_bench_foldcst_chunk_general_t364__olange_IGNORE_THE_REST_1S5UA_5_5206_5_0 using minirosetta version 140
12/4/2008 11:57:51 AM|rosetta@home|Task cc_1_0_nocst4_homo_bench_foldcst_chunk_general_t364__olange_IGNORE_THE_REST_1S5UA_5_5206_5_0 exited with zero status but no 'finished' file
12/4/2008 11:57:51 AM|rosetta@home|If this happens repeatedly you may need to reset the project.

____________

Mod.Sense
Forum moderator
Project administrator

Joined: Aug 22 06
Posts: 3381
ID: 106194
Credit: 0
RAC: 0
Message 57598 - Posted 4 Dec 2008 19:53:10 UTC

The version being tested now on Ralph is 1.45. I'm pretty sure the issue with tasks not suspending when BOINC tells them to has been resolved. Hopefully coming very soon to Rosetta.
____________
Rosetta Moderator: Mod.Sense

Ma3threeX

Joined: Aug 22 08
Posts: 3
ID: 274775
Credit: 347,217
RAC: 0
Message 57604 - Posted 4 Dec 2008 21:52:06 UTC

i don't know if its the best thread for it but...whatever

i have now at least 8 WUS who are 100% crunched and uploaded but it don't dissappears from the list seems like its waiting for something. I also get a Message from the Rosetta Server : " Cant attach shared Memory"

anybody knows the prob?

greetings
Ma3threeX

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57607 - Posted 4 Dec 2008 22:13:33 UTC - in response to Message ID 57604.

the team moved the server to a new address. just let boinc manager sort it out. it needs to get the new info from the new master file. some guys are hitting update 10 times to get to the new master file, but the team says just let the program take it's course, it will self correct.


i don't know if its the best thread for it but...whatever

i have now at least 8 WUS who are 100% crunched and uploaded but it don't dissappears from the list seems like its waiting for something. I also get a Message from the Rosetta Server : " Cant attach shared Memory"

anybody knows the prob?

greetings
Ma3threeX

Nicolai

Joined: Jun 21 08
Posts: 1
ID: 265223
Credit: 142,530
RAC: 0
Message 57632 - Posted 5 Dec 2008 20:00:45 UTC - in response to Message ID 57594.

Running on Vista w/SP1:

Computation Error and no apparent progress.

Project has been reset several times. Rosetta used to work.

12/4/2008 11:57:10 AM|rosetta@home|Restarting task cc_1_0_nocst4_homo_bench_foldcst_chunk_general_t364__olange_IGNORE_THE_REST_1S5UA_5_5206_5_0 using minirosetta version 140
12/4/2008 11:57:51 AM|rosetta@home|Task cc_1_0_nocst4_homo_bench_foldcst_chunk_general_t364__olange_IGNORE_THE_REST_1S5UA_5_5206_5_0 exited with zero status but no 'finished' file
12/4/2008 11:57:51 AM|rosetta@home|If this happens repeatedly you may need to reset the project.



I have been having the same problem for more than a while now...

Alec Rosa

Joined: Nov 11 08
Posts: 18
ID: 287524
Credit: 2,635
RAC: 0
Message 57633 - Posted 5 Dec 2008 20:06:43 UTC

I don't care anymore! Version 1.45 works! (For now anyway.)
Yay.

bluelady9

Joined: Nov 7 08
Posts: 1
ID: 287053
Credit: 10,331
RAC: 0
Message 57637 - Posted 5 Dec 2008 22:56:27 UTC

Hi
I am getting this message constantly on the rosetta project currently running:
restarting task cc_3_5_nocst4_homo_bench_foldst_chunk_general_t368_olange__IGNORE_THE_REST_1NADA_4_5376_10_0_usingminirosettaversion140

over and over again, I have over 60 of these messages one after the other in the messages window. What's going on? Should I just reset the whole thing? I'd appreciate any help with this. Thank you.

Mike Tyka

Joined: Oct 20 05
Posts: 96
ID: 5612
Credit: 2,190
RAC: 0
Message 57642 - Posted 6 Dec 2008 2:19:05 UTC - in response to Message ID 57633.

I don't care anymore! Version 1.45 works! (For now anyway.)
Yay.


Well awesome! That's what we like to hear :) Keep us posted if you see any issues.
I know there are still issues with 1.45 do with the graphics which we'll address in future updates, but they are cosmetic and the errors concerning lockfiles, validator errors etc etc etc had priority. Stability first.

Thanks for crunching!

Mike

____________
http://beautifulproteins.blogspot.com/
http://www.miketyka.com/

rochester new york Profile
Avatar

Joined: Jul 2 06
Posts: 2562
ID: 98229
Credit: 958,139
RAC: 127
Message 57660 - Posted 6 Dec 2008 20:21:01 UTC

http://boinc.bakerlab.org/rosetta/results.php?hostid=267483&offset=20

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57661 - Posted 6 Dec 2008 20:51:20 UTC - in response to Message ID 57660.

http://boinc.bakerlab.org/rosetta/results.php?hostid=267483&offset=20


more detached clients? have you closed boinc mgr and restarted it or your system since this started? only other thing i would suggest is to NOT accept any new work and then reset the project after all the current work has finished and reported.

gabberattack (johnny, eriq, segfault, r2k4, bully, sifon) Profile

Joined: Sep 27 05
Posts: 12
ID: 1341
Credit: 4,598,823
RAC: 181
Message 57666 - Posted 7 Dec 2008 5:37:55 UTC
Last modified: 7 Dec 2008 5:38:14 UTC

iMac Intel Core2Duo 1,86 GHz, OSX 10.5.5, BOINC 5.10.45, CPU use limited to 75%.

This WU http://boinc.bakerlab.org/rosetta/result.php?resultid=211924207 and this one http://boinc.bakerlab.org/rosetta/result.php?resultid=211927271 are running for 18 and 25 hours already using Minirosetta 1.40. Should I reset the project or let it run ? Progress is frozen at 99.079% and 99.353%, remaining time shows on both WU 9 minutes 56 seconds.
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57669 - Posted 7 Dec 2008 8:56:57 UTC - in response to Message ID 57666.

iMac Intel Core2Duo 1,86 GHz, OSX 10.5.5, BOINC 5.10.45, CPU use limited to 75%.

This WU http://boinc.bakerlab.org/rosetta/result.php?resultid=211924207 and this one http://boinc.bakerlab.org/rosetta/result.php?resultid=211927271 are running for 18 and 25 hours already using Minirosetta 1.40. Should I reset the project or let it run ? Progress is frozen at 99.079% and 99.353%, remaining time shows on both WU 9 minutes 56 seconds.


That type of apparantly frozen progress is typical when you get a minirosetta workunit that takes significantly more CPU time than predicted. I got one with a predicted time of 6 CPU hours; it actually took 19.5. What predicted length of workunits have you asked for, or have you left it as the default? If I remember correctly, the default has recently been raised to 6 CPU hours. I'd let it run a few more hours before doing anything.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57671 - Posted 7 Dec 2008 9:22:12 UTC
Last modified: 7 Dec 2008 9:29:05 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=211774333
1scjB_BOINC_ABRELAX_SPLIT_CONTROL_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1scjB-_4846_1647_0
Outcome Client error
Client state Compute error
Exit status -1073741819 (0xc0000005)
CPU time 14839.63

this ran 4 out of 6 hrs and crashed. no credit. that suck big time.
I noticed this task has not been reassigned to anyone.

---------

same with this older task
ran for a good length of time and then crashed

http://boinc.bakerlab.org/rosetta/result.php?resultid=211601090
h014__BOINC_ABRELAX_RANGE_yebf_IGNORE_THE_REST-S25-10-S3-3--h014_-_4675_302_0
Outcome Client error
Client state Compute error
Exit status -1073741819 (0xc0000005)
CPU time 17704.41

--------

older still with same error code
http://boinc.bakerlab.org/rosetta/result.php?resultid=211595747
1louA_BOINC_ABRELAX_SPLIT_SPLIT2_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1louA-_4845_1323_0
Outcome Client error
Client state Compute error
Exit status -1073741819 (0xc0000005)
CPU time 3769.953

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57676 - Posted 7 Dec 2008 13:01:22 UTC

Task ID 211937560
Name cc_0_8_nocst4_homo_bench_foldcst_chunk_general_t362__olange_IGNORE_THE_REST_2GF6A_2_5176_11_0

this shows in the boinc manager as waiting to run while there is 1 rosetta and 1 einstein already running. the cpu time is running and when clicking on the graphics the graphics sequence is also running.

how is this possible? I have only 2 cores. (note as i am writing this note, it has switched to running and the einstein task has gone to waiting)

gabberattack (johnny, eriq, segfault, r2k4, bully, sifon) Profile

Joined: Sep 27 05
Posts: 12
ID: 1341
Credit: 4,598,823
RAC: 181
Message 57679 - Posted 7 Dec 2008 14:49:40 UTC - in response to Message ID 57669.

iMac Intel Core2Duo 1,86 GHz, OSX 10.5.5, BOINC 5.10.45, CPU use limited to 75%.

This WU http://boinc.bakerlab.org/rosetta/result.php?resultid=211924207 and this one http://boinc.bakerlab.org/rosetta/result.php?resultid=211927271 are running for 18 and 25 hours already using Minirosetta 1.40. Should I reset the project or let it run ? Progress is frozen at 99.079% and 99.353%, remaining time shows on both WU 9 minutes 56 seconds.


That type of apparantly frozen progress is typical when you get a minirosetta workunit that takes significantly more CPU time than predicted. I got one with a predicted time of 6 CPU hours; it actually took 19.5. What predicted length of workunits have you asked for, or have you left it as the default? If I remember correctly, the default has recently been raised to 6 CPU hours. I'd let it run a few more hours before doing anything.



Update: - now the WUs show 22 and 29,5 hours, progress is at 99.245 and 99.441 respectively, so it is progressing but very very slowly. Remaining time still shows 9:56 and 9:57 to completion, iMac seems to be instable a bit, sometimes does not respond for couple seconds, but I let it run and see how far can that get. I had not selected any predicted time or any graphics CPU time - so it should be at the lowest numbers (3 hours, 10% graphics).
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57692 - Posted 8 Dec 2008 0:54:33 UTC - in response to Message ID 57676.

Task ID 211937560
Name cc_0_8_nocst4_homo_bench_foldcst_chunk_general_t362__olange_IGNORE_THE_REST_2GF6A_2_5176_11_0

this shows in the boinc manager as waiting to run while there is 1 rosetta and 1 einstein already running. the cpu time is running and when clicking on the graphics the graphics sequence is also running.

how is this possible? I have only 2 cores. (note as i am writing this note, it has switched to running and the einstein task has gone to waiting)


Minirosetta 1.40 sometimes fails to suspend when it comes time for it to yield a timeslice to some other project, and then ends up sharing that CPU core with the other project during the next timeslice and only getting half of the CPU time it thinks it is getting. Expect the einstein workunit to have only gotten half the CPU time it thinks it got during that timeslice also. You can hope to get some minirosetta 1.45 workunits soon, which are more likely to be problem-free.

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57693 - Posted 8 Dec 2008 1:19:52 UTC - in response to Message ID 57679.

iMac Intel Core2Duo 1,86 GHz, OSX 10.5.5, BOINC 5.10.45, CPU use limited to 75%.

This WU http://boinc.bakerlab.org/rosetta/result.php?resultid=211924207 and this one http://boinc.bakerlab.org/rosetta/result.php?resultid=211927271 are running for 18 and 25 hours already using Minirosetta 1.40. Should I reset the project or let it run ? Progress is frozen at 99.079% and 99.353%, remaining time shows on both WU 9 minutes 56 seconds.


That type of apparantly frozen progress is typical when you get a minirosetta workunit that takes significantly more CPU time than predicted. I got one with a predicted time of 6 CPU hours; it actually took 19.5. What predicted length of workunits have you asked for, or have you left it as the default? If I remember correctly, the default has recently been raised to 6 CPU hours. I'd let it run a few more hours before doing anything.



Update: - now the WUs show 22 and 29,5 hours, progress is at 99.245 and 99.441 respectively, so it is progressing but very very slowly. Remaining time still shows 9:56 and 9:57 to completion, iMac seems to be instable a bit, sometimes does not respond for couple seconds, but I let it run and see how far can that get. I had not selected any predicted time or any graphics CPU time - so it should be at the lowest numbers (3 hours, 10% graphics).


Since you have not selected any predicted time, it should the the default instead. You still have the option of selecting 3 hours instead, if that is what you prefer.

I'd let the workunits run for about 5 times the selection of the predicted time when you got those workunits, to see if the automatic cutoff of workunits that take too long is able to work for those workunits. For you, that should be about 30 hours CPU time, but even more wall clock time. I'd then abort them, after making sure that I had reported which workunits this happened to.

Under Vista SP1, I had enough free disk space that I was able to improve performance by telling BOINC that it could use more disk space and a higher percentage of the swap space. I have no idea whether that will work under iMac also. Months ago, I was able to improve performance to adding more RAM memory to my computer, but I'm now at the limit this model of computer can handle. Also, note that on machines with a 32-bit operating system, such as most of those sold with less that 4 GB of RAM memory already installed, 4 GB or less is the limit of what you can use even if more is installed; you need to switch to a 64-bit operating system to get beyond that limit. Most computers sold these days have the capability to switch to a 64-bit operating system, but do not come with one already.

gabberattack (johnny, eriq, segfault, r2k4, bully, sifon) Profile

Joined: Sep 27 05
Posts: 12
ID: 1341
Credit: 4,598,823
RAC: 181
Message 57694 - Posted 8 Dec 2008 2:23:15 UTC - in response to Message ID 57693.

iMac Intel Core2Duo 1,86 GHz, OSX 10.5.5, BOINC 5.10.45, CPU use limited to 75%.

This WU http://boinc.bakerlab.org/rosetta/result.php?resultid=211924207 and this one http://boinc.bakerlab.org/rosetta/result.php?resultid=211927271 are running for 18 and 25 hours already using Minirosetta 1.40. Should I reset the project or let it run ? Progress is frozen at 99.079% and 99.353%, remaining time shows on both WU 9 minutes 56 seconds.


That type of apparantly frozen progress is typical when you get a minirosetta workunit that takes significantly more CPU time than predicted. I got one with a predicted time of 6 CPU hours; it actually took 19.5. What predicted length of workunits have you asked for, or have you left it as the default? If I remember correctly, the default has recently been raised to 6 CPU hours. I'd let it run a few more hours before doing anything.



Update: - now the WUs show 22 and 29,5 hours, progress is at 99.245 and 99.441 respectively, so it is progressing but very very slowly. Remaining time still shows 9:56 and 9:57 to completion, iMac seems to be instable a bit, sometimes does not respond for couple seconds, but I let it run and see how far can that get. I had not selected any predicted time or any graphics CPU time - so it should be at the lowest numbers (3 hours, 10% graphics).


Since you have not selected any predicted time, it should the the default instead. You still have the option of selecting 3 hours instead, if that is what you prefer.

I'd let the workunits run for about 5 times the selection of the predicted time when you got those workunits, to see if the automatic cutoff of workunits that take too long is able to work for those workunits. For you, that should be about 30 hours CPU time, but even more wall clock time. I'd then abort them, after making sure that I had reported which workunits this happened to.

Under Vista SP1, I had enough free disk space that I was able to improve performance by telling BOINC that it could use more disk space and a higher percentage of the swap space. I have no idea whether that will work under iMac also. Months ago, I was able to improve performance to adding more RAM memory to my computer, but I'm now at the limit this model of computer can handle. Also, note that on machines with a 32-bit operating system, such as most of those sold with less that 4 GB of RAM memory already installed, 4 GB or less is the limit of what you can use even if more is installed; you need to switch to a 64-bit operating system to get beyond that limit. Most computers sold these days have the capability to switch to a 64-bit operating system, but do not come with one already.


OK, so both tasks finished successfully - 25 hrs and 33 hrs. I changed CPU load to 100% to speed up the process. Manager is asking 438 and 332 credits, they are pending so far. I will allow more swap space and see if it helps. I have 1 GB now, but I'll change to 2 GB, because I see MiniRosetta is using 865 MB virtual memory for each WU, so it is almost 2GB together. Thanks for help, I appreciate a lot.
____________

gabberattack (johnny, eriq, segfault, r2k4, bully, sifon) Profile

Joined: Sep 27 05
Posts: 12
ID: 1341
Credit: 4,598,823
RAC: 181
Message 57695 - Posted 8 Dec 2008 4:00:50 UTC - in response to Message ID 57694.

iMac Intel Core2Duo 1,86 GHz, OSX 10.5.5, BOINC 5.10.45, CPU use limited to 75%.

This WU http://boinc.bakerlab.org/rosetta/result.php?resultid=211924207 and this one http://boinc.bakerlab.org/rosetta/result.php?resultid=211927271 are running for 18 and 25 hours already using Minirosetta 1.40. Should I reset the project or let it run ? Progress is frozen at 99.079% and 99.353%, remaining time shows on both WU 9 minutes 56 seconds.


That type of apparantly frozen progress is typical when you get a minirosetta workunit that takes significantly more CPU time than predicted. I got one with a predicted time of 6 CPU hours; it actually took 19.5. What predicted length of workunits have you asked for, or have you left it as the default? If I remember correctly, the default has recently been raised to 6 CPU hours. I'd let it run a few more hours before doing anything.



Update: - now the WUs show 22 and 29,5 hours, progress is at 99.245 and 99.441 respectively, so it is progressing but very very slowly. Remaining time still shows 9:56 and 9:57 to completion, iMac seems to be instable a bit, sometimes does not respond for couple seconds, but I let it run and see how far can that get. I had not selected any predicted time or any graphics CPU time - so it should be at the lowest numbers (3 hours, 10% graphics).


Since you have not selected any predicted time, it should the the default instead. You still have the option of selecting 3 hours instead, if that is what you prefer.

I'd let the workunits run for about 5 times the selection of the predicted time when you got those workunits, to see if the automatic cutoff of workunits that take too long is able to work for those workunits. For you, that should be about 30 hours CPU time, but even more wall clock time. I'd then abort them, after making sure that I had reported which workunits this happened to.

Under Vista SP1, I had enough free disk space that I was able to improve performance by telling BOINC that it could use more disk space and a higher percentage of the swap space. I have no idea whether that will work under iMac also. Months ago, I was able to improve performance to adding more RAM memory to my computer, but I'm now at the limit this model of computer can handle. Also, note that on machines with a 32-bit operating system, such as most of those sold with less that 4 GB of RAM memory already installed, 4 GB or less is the limit of what you can use even if more is installed; you need to switch to a 64-bit operating system to get beyond that limit. Most computers sold these days have the capability to switch to a 64-bit operating system, but do not come with one already.


OK, so both tasks finished successfully - 25 hrs and 33 hrs. I changed CPU load to 100% to speed up the process. Manager is asking 438 and 332 credits, they are pending so far. I will allow more swap space and see if it helps. I have 1 GB now, but I'll change to 2 GB, because I see MiniRosetta is using 865 MB virtual memory for each WU, so it is almost 2GB together. Thanks for help, I appreciate a lot.

OK, both WUs got granted credit - first one 26.66, second one 10.43 - the time was not worth the credit - next time I will abort if any WU goes over 5 hours.
____________

robertmiles Profile

Joined: Jun 16 08
Posts: 656
ID: 264600
Credit: 3,462,248
RAC: 2,198
Message 57698 - Posted 8 Dec 2008 4:48:06 UTC - in response to Message ID 57694.

Under Vista SP1, I had enough free disk space that I was able to improve performance by telling BOINC that it could use more disk space and a higher percentage of the swap space. I have no idea whether that will work under iMac also. Months ago, I was able to improve performance to adding more RAM memory to my computer, but I'm now at the limit this model of computer can handle. Also, note that on machines with a 32-bit operating system, such as most of those sold with less that 4 GB of RAM memory already installed, 4 GB or less is the limit of what you can use even if more is installed; you need to switch to a 64-bit operating system to get beyond that limit. Most computers sold these days have the capability to switch to a 64-bit operating system, but do not come with one already.


OK, so both tasks finished successfully - 25 hrs and 33 hrs. I changed CPU load to 100% to speed up the process. Manager is asking 438 and 332 credits, they are pending so far. I will allow more swap space and see if it helps. I have 1 GB now, but I'll change to 2 GB, because I see MiniRosetta is using 865 MB virtual memory for each WU, so it is almost 2GB together. Thanks for help, I appreciate a lot.


If that helps, but not enough, you may also want to try something more than 2 GB multiplied by number of BOINC projects you are participating in for the swap space but not the RAM memory, like I did, in case BOINC is dividing the available swap space equally among the projects before deciding how much to allocate to each workunit.

Greg_BE Profile
Avatar

Joined: May 30 06
Posts: 4835
ID: 85645
Credit: 2,948,921
RAC: 243
Message 57711 - Posted 8 Dec 2008 20:57:11 UTC
Last modified: 8 Dec 2008 21:02:46 UTC

http://boinc.bakerlab.org/rosetta/result.php?resultid=212000802
cc_2_2_nocst4_homo_bench_foldcst_chunk_general_t332__olange_IGNORE_THE_REST_1ZJRA_12_5312_16_0

died after 3000+ secs and was sent out again.


also earlier this one died at 14839 seconds
http://boinc.bakerlab.org/rosetta/result.php?resultid=211774333
1scjB_BOINC_ABRELAX_SPLIT_CONTROL_NOHATR_IGNORE_THE_REST-S25-9-S3-3--1scjB-_4846_1647_0

also sent out to a another person.

can someone tell me if these errors are a result of the task not liking my OC speed or what? funny thing is that 5 tasks were completed ok in between these errors. something odd is going on.

gabberattack (johnny, eriq, segfault, r2k4, bully, sifon) Profile

Joined: Sep 27 05
Posts: 12
ID: 1341
Credit: 4,598,823
RAC: 181
Message 57724 - Posted 9 Dec 2008 0:51:34 UTC - in response to Message ID 57698.

Under Vista SP1, I had enough free disk space that I was able to improve performance by telling BOINC that it could use more disk space and a higher percentage of the swap space. I have no idea whether that will work under iMac also. Months ago, I was able to improve performance to adding more RAM memory to my computer, but I'm now at the limit this model of computer can handle. Also, note that on machines with a 32-bit operating system, such as most of those sold with less that 4 GB of RAM memory already installed, 4 GB or less is the limit of what you can use even if more is installed; you need to switch to a 64-bit operating system to get beyond that limit. Most computers sold these days have the capability to switch to a 64-bit operating system, but do not come with one already.


OK, so both tasks finished successfully - 25 hrs and 33 hrs. I changed CPU load to 100% to speed up the process. Manager is asking 438 and 332 credits, they are pending so far. I will allow more swap space and see if it helps. I have 1 GB now, but I'll change to 2 GB, because I see MiniRosetta is using 865 MB virtual memory for each WU, so it is almost 2GB together. Thanks for help, I appreciate a lot.


If that helps, but not enough, you may also want to try something more than 2 GB multiplied by number of BOINC projects you are participating in for the swap space but not the RAM memory, like I did, in case BOINC is dividing the available swap space equally among the projects before deciding how much to allocate to each workunit.


I have just Rosetta on my iMac. I set the disk space to 4 GB anyway. So far no strange WUs and all new WUs use Minirosetta 1.45 so I hope this should not be a problem anymore. Thanks for help.

____________

Reaper Profile

Joined: Feb 12 06
Posts: 6
ID: 58354
Credit: 364,880
RAC: 0
Message 57777 - Posted 10 Dec 2008 16:40:06 UTC

This one has been running for days and resetting every few minutes.
It is currently at 99.280% complete after just over 23 hours of CPU time.

I had restarted the system on the 7th and another project ran on it until 7:37AM on the 8th. I believe the progress was already at or near 99plus percent complete then as well.
While typing this up I noticed that the latest reset, reduced CPU time to less than 23 hours. I am aborting since I don't seem to be making any progress.
Sorry I didn't notice this sooner.

12/8/2008 7:37:24 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/8/2008 7:53:47 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/8/2008 8:10:54 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/8/2008 8:28:24 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140


... time passes with resets like below
12/10/2008 4:30:21 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 4:46:42 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 5:03:52 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 5:34:01 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 5:50:25 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 6:07:49 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 6:24:11 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 6:52:30 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 7:08:48 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 7:25:21 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 7:41:57 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140
12/10/2008 7:57:46 AM|rosetta@home|Restarting task cc_0_6_nocst4_homo_bench_foldcst_chunk_general_t317__olange_IGNORE_THE_REST_2ESBA_10_5137_6_0 using minirosetta version 140


____________

Message boards : Number crunching : Minirosetta v1.40 bug thread


Home | Join | About | Participants | Community | Statistics

Copyright © 2017 University of Washington

Last Modified: 10 Nov 2010 1:51:38 UTC
Back to top ^