Posts by Mike*

1) Message boards : Cafe Rosetta : Team Thread (Ads only, not for discussions) (Message 64579)
Posted 23 Dec 2009 by Profile Mike*
Post:



If you are a first time cruncher,
If you are a large multi system crunching farm,
Or anyone in between.
If you are looking for a friendly team to join,
We welcome you.






The universe contains all of us. Why shouldn`t We look for the universe in you...

2) Message boards : Number crunching : Problems with Minirosetta Version 1.71 (Message 61650)
Posted 9 Jun 2009 by Profile Mike*
Post:
I have had 5 wu all error with the same result as this one:
<file_xfer_error>
<file_name>lb_thread_all_multi_hb_t328__IGNORE_THE_REST_12734_447_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

All were also lb_thread_all_multi...

One was even reprocessed by another host and IT also has the same error.

I currently have 7 successful WUs, with 6 left in cache, 3 started.

Host is 1077338 (core i7, vista 64 ultimate, 12g memory)

Mike
3) Message boards : Number crunching : Problems with Minirosetta v1.54 (Message 59867)
Posted 27 Feb 2009 by Profile Mike*
Post:
Hi all,
Had the below error show up.
I initially DLd 3 WU, the first 2 bombed, I aborted the 3rd.. I then detached, re-attached, then DLed 11 new ones.

Every one of them went south..

Boinc mgr is 6.2.18

Free disk is 88g
Used by boinc is 4.81
Use at most 100g
Leave 0
Use up to 50% disk
Leave apps in memory.
Only other project (which was suspended was CPDN at 55% @1004 hrs (do not want to loose this)


mike

(extra blank lines removed)
<core_client_version>6.2.18</core_client_version>



A few questions that may help pin down the problem:



The odd thing is that I had successfully finished 3 models a few days ago, and a couple before that, (cant remember the version off hand, only 1 wu at a time) with no issues. I am attached to 7 projects but am not running then all. (I NNT the projects, and have a small buffer so as to not have to worry about having too much (Yea, I know boinc manages it, but I want to make sure everything gets doone quickly).
When you mentioned boinc dividing the disk space, I am wondering if I had the non active projects suspended, which I ususally have done in the past..
I will retry after I get thru the SIMAP run (this is why I keep the tasks low), making sure my buffer is small so as hopefully not grab 11 tasks


Thanks

Mike




Another question that may help pin down the problem:

Did you have graphics enabled at any time during those runs? When I run minirosetta 1.58 for RALPH@home, it completes successfully if I never enable graphics, but fails if I have graphics enabled for a short time during the run.


No, did not have the graphics running, the process crashed immediatly upon startup (or at least within a few seconds).

Interesting thing..

Normally I only have 1 to 3 projects un-suspended at 1 time. I has more than that un-suspended, but No new tasks..
I suspended ALL projects, shut down, and re-booted.
Started up boinc, set to not keep projects in memory, 50% cpu (us the 1 core non HT, unsuspended Rossetta, said give me tasks, hit update. Gave me 6 and then let it do its thing..
Guess what.. no issues..
I suspended 5 of the tasks to let the 1 run.
I also re-adjusted to 100% to use HT, re-started Docking, and had several Docking and 1 Rosetta finish..

Might be due to allocating memory among the active projects..

Am wondering if any of the other bugs I saw here, is the same issue with too many "active projects".
The programmer in me is suspecting that.. Not knowing what goes on in Boinc, etc could not tell (Besides, don't do C++ or later).

Thanks for the 'insight"..
Mike

p.s. added answer on graphics and spellings.
4) Message boards : Number crunching : Problems with Minirosetta v1.54 (Message 59846)
Posted 27 Feb 2009 by Profile Mike*
Post:
Hi all,
Had the below error show up.
I initially DLd 3 WU, the first 2 bombed, I aborted the 3rd.. I then detached, re-attached, then DLed 11 new ones.

Every one of them went south..

Boinc mgr is 6.2.18

Free disk is 88g
Used by boinc is 4.81
Use at most 100g
Leave 0
Use up to 50% disk
Leave apps in memory.
Only other project (which was suspended was CPDN at 55% @1004 hrs (do not want to loose this)


mike

(extra blank lines removed)
<core_client_version>6.2.18</core_client_version>



A few questions that may help pin down the problem:

Are you able to find BOINC 6.2.28, and willing to upgrade to it? That's the only version I have used since 5.10.45, and I don't have that problem.

Have you gone to any extra effort to tell BOINC that it could use more virtual memory than the default?

Have you gone to any extra effort to tell your copy of Windows to allow a bigger swap file than the default?

How many BOINC projects do you have your BOINC Manager set up to recognize? I've seen some so far rather indistinct signs that BOINC divides the disk space it is allowed to use into equal sections for each BOINC project it recognizes before it starts dividing those sections into smaller subsections for each workunit. Therefore, if one BOINC project is heavy on disk space use, workunits for that project might run out of disk space even if some other BOINC project doesn't need all that is reserved for it.

Does this site tell you how much memory your machine has now and what the maximum for that model of computer is?

http://www.crucial.com/

I had problems getting my dual-core CPU to run two Rosetta@home workunits at the same time back when I had only 1 GB of memory to share between Vista and the two workunits, so I ordered an upgrade to the 2 GB maximum my model of computer can handle; now I can run two such workunits at once even while typing this.



The odd thing is that I had successfully finished 3 models a few days ago, and a couple before that, (cant remember the version off hand, only 1 wu at a time) with no issues. I am attached to 7 projects but am not running then all. (I NNT the projects, and have a small buffer so as to not have to worry about having too much (Yea, I know boinc manages it, but I want to make sure everything gets doone quickly).
When you mentioned boinc dividing the disk space, I am wondering if I had the non active projects suspended, which I ususally have done in the past..
I will retry after I get thru the SIMAP run (this is why I keep the tasks low), making sure my buffer is small so as hopefully not grab 11 tasks


Thanks

Mike


5) Message boards : Number crunching : Problems with Minirosetta v1.54 (Message 59835)
Posted 27 Feb 2009 by Profile Mike*
Post:
Hi all,
Had the below error show up.
I initially DLd 3 WU, the first 2 bombed, I aborted the 3rd.. I then detached, re-attached, then DLed 11 new ones.

Every one of them went south..

Boinc mgr is 6.2.18

Free disk is 88g
Used by boinc is 4.81
Use at most 100g
Leave 0
Use up to 50% disk
Leave apps in memory.
Only other project (which was suspended was CPDN at 55% @1004 hrs (do not want to loose this)

My host is 1008545 (should be viewable)

At this point, I will wait till next week (SIMAP starting soon with it's monthly run :)) and will try again.
Don't want to keep trashing WUs for no reason.

I do have the messages from boinc stored if they would be useful, but here is one thing I see, but it may only be due to the process crashing:

2/26/2009 8:04:04 PM|rosetta@home|Starting lr8_A_score12_rlbd_2ci2_IGNORE_THE_REST_DECOY_SAVE_ALL_OUT_7089_1093_0
2/26/2009 8:04:05 PM|rosetta@home|Starting task lr8_A_score12_rlbd_2ci2_IGNORE_THE_REST_DECOY_SAVE_ALL_OUT_7089_1093_0 using minirosetta version 154
2/26/2009 8:04:19 PM|rosetta@home|Computation for task lr8_A_score12_rlbd_2ci2_IGNORE_THE_REST_DECOY_SAVE_ALL_OUT_7089_1093_0 finished
2/26/2009 8:04:19 PM|rosetta@home|Output file lr8_A_score12_rlbd_2ci2_IGNORE_THE_REST_DECOY_SAVE_ALL_OUT_7089_1093_0_0 for task lr8_A_score12_rlbd_2ci2_IGNORE_THE_REST_DECOY_SAVE_ALL_OUT_7089_1093_0 absent

Thanks

mike

(extra blank lines removed)
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
BOINC:: Initializing ... ok.
[2009- 2-26 20:10: 2:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing core...
Initializing options.... ok
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C910193 write attempt to address 0x009882EA
Engaging BOINC Windows Runtime Debugger...
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C910193 write attempt to address 0x0040118E
Engaging BOINC Windows Runtime Debugger...
</stderr_txt>
]]>







©2024 University of Washington
https://www.bakerlab.org