Minirosetta v1.28 bug thread

Message boards : Number crunching : Minirosetta v1.28 bug thread

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
BrnmccO1

Send message
Joined: 26 Jun 07
Posts: 17
Credit: 578,825
RAC: 0
Message 54024 - Posted: 27 Jun 2008, 2:13:44 UTC

2 more Mini128 bugs/crashes.

158211853
157910486

One failed on another computer, one completed sucessfully, both are CASP8's
ID: 54024 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Alexander Klauer

Send message
Joined: 10 Mar 08
Posts: 3
Credit: 110,308
RAC: 0
Message 54027 - Posted: 27 Jun 2008, 9:57:13 UTC

Hi,

occasionally it happens that when I suspend a 1.28 task and resume it later on, it finishes immediately and is returned to the server as a success. For example:

https://boinc.bakerlab.org/rosetta/result.php?resultid=173640156

is such a task. Claimed and granted credit appear to be correctly decreased proportionally, but still, why doesn't the task finish normally?

Best regards,
Alexander
ID: 54027 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1676
Credit: 89,068,443
RAC: 48,420
Message 54028 - Posted: 27 Jun 2008, 10:02:08 UTC - in response to Message 54027.  

Hi,

occasionally it happens that when I suspend a 1.28 task and resume it later on, it finishes immediately and is returned to the server as a success. For example:

https://boinc.bakerlab.org/rosetta/result.php?resultid=173640156

is such a task. Claimed and granted credit appear to be correctly decreased proportionally, but still, why doesn't the task finish normally?

Best regards,
Alexander

Rosetta probably re-calculated that it would take longer to create a new decoy than your time-limit preference is set to. It therefore packages that one decoy and reports it. A Rosetta task consists of as many decoys as can be produced within the allocated time-limit that you set (4hrs by default I believe), so the task did finish normally.
ID: 54028 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Alexander Klauer

Send message
Joined: 10 Mar 08
Posts: 3
Credit: 110,308
RAC: 0
Message 54029 - Posted: 27 Jun 2008, 11:57:55 UTC - in response to Message 54028.  

Hi,

occasionally it happens that when I suspend a 1.28 task and resume it later on, it finishes immediately and is returned to the server as a success. For example:

https://boinc.bakerlab.org/rosetta/result.php?resultid=173640156

is such a task. Claimed and granted credit appear to be correctly decreased proportionally, but still, why doesn't the task finish normally?

Best regards,
Alexander

Rosetta probably re-calculated that it would take longer to create a new decoy than your time-limit preference is set to. It therefore packages that one decoy and reports it. A Rosetta task consists of as many decoys as can be produced within the allocated time-limit that you set (4hrs by default I believe), so the task did finish normally.


Ah, that explains things. Thank you!
ID: 54029 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 28 Sep 05
Posts: 115
Credit: 101,358
RAC: 0
Message 54030 - Posted: 27 Jun 2008, 15:15:18 UTC - in response to Message 54006.  

How to get the job only for the Rosetta 5.96(5.98)?

You'd have to write your own anonymous platform description file app_info.xml for Rosetta project, which would contain just the required application(s) and describe all other necessary files.

Or someone can volunteer...

Pete Leman did. See this message (or better the whole thread).

Peter
ID: 54030 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 54033 - Posted: 27 Jun 2008, 21:30:38 UTC

restart PC and ((
https://boinc.bakerlab.org/rosetta/result.php?resultid=173384058
https://boinc.bakerlab.org/rosetta/result.php?resultid=173167237
- exit code 1073807364 (0x40010004)
https://boinc.bakerlab.org/rosetta/result.php?resultid=173166660
https://boinc.bakerlab.org/rosetta/result.php?resultid=173165661

ID: 54033 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ingleside

Send message
Joined: 25 Sep 05
Posts: 105
Credit: 434,745
RAC: 0
Message 54041 - Posted: 28 Jun 2008, 13:39:26 UTC - in response to Message 53993.  

The errors may indicate a corrupted database file. Can you try resetting the project?

If files becoming corrupt after download is a problem, it would be an idea for Rosetta@home to enable <verify_files_on_app_start>

"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
ID: 54041 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4875
Credit: 4,515,389
RAC: 1,001
Message 54045 - Posted: 28 Jun 2008, 14:51:29 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=172380154

Outcome Client error
Client state Compute error
Exit status -1073741819 (0xc0000005)

CPU time 65.46875
stderr out

<core_client_version>6.2.6</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# cpu_run_time_pref: 14400


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C9113C8 write attempt to address 0x028B5FFC

Big debugger run after this

I was testing out my OC speed when this died and crashed my system. Not sure if that has anything to do with it or not.
ID: 54045 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4875
Credit: 4,515,389
RAC: 1,001
Message 54048 - Posted: 28 Jun 2008, 16:04:56 UTC - in response to Message 54045.  

https://boinc.bakerlab.org/rosetta/result.php?resultid=172380154

Outcome Client error
Client state Compute error
Exit status -1073741819 (0xc0000005)

CPU time 65.46875
stderr out

<core_client_version>6.2.6</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# cpu_run_time_pref: 14400


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C9113C8 write attempt to address 0x028B5FFC

Big debugger run after this

I was testing out my OC speed when this died and crashed my system. Not sure if that has anything to do with it or not.


Edit: It certainly does as I crashed a few other work units as well. Sorry folks.
ID: 54048 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 127
Credit: 18,026,802
RAC: 8,351
Message 54063 - Posted: 29 Jun 2008, 13:52:08 UTC

This one ran over 5 hours (my preference is set to the default, 3) and seemed to be stalled, as the time to completion wasn't changing. I clicked on "Show graphics" and after a few seconds the work unit finished. The graphics never moved. Coincidence? Or flaw in 1.28?

Task ID 173964319
Name h004__BOINC_CASP8_ABRELAX_RANGE_t443_FULL_IGNORE_THE_REST-S25-9-S3-5--h004_-_4119_239_0
Workunit 158766926
Created 27 Jun 2008 18:53:28 UTC
Sent 27 Jun 2008 18:54:32 UTC
Received 29 Jun 2008 9:37:18 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 307276
Report deadline 7 Jul 2008 18:54:32 UTC
CPU time 18349.67
stderr out

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
======================================================
DONE :: 1 starting structures 18349.5 cpu seconds
This process generated 2 decoys from 2 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
]]>

Validate state Valid
Claimed credit 47.9030125857267
Granted credit 44.3947554487014
application version 1.28
ID: 54063 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 05
Posts: 268
Credit: 402,585
RAC: 0
Message 54100 - Posted: 1 Jul 2008, 10:00:33 UTC

This one 174259703 stalled after about 2 hours, eventually uploaded itself and reported yesterday. However there is no record of it so it appears to be lost somewhere in cyberspace.
ID: 54100 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,003,213
RAC: 0
Message 54110 - Posted: 1 Jul 2008, 15:22:00 UTC - in response to Message 54041.  

The errors may indicate a corrupted database file. Can you try resetting the project?

If files becoming corrupt after download is a problem, it would be an idea for Rosetta@home to enable


will do, thanks for pointing that out.
ID: 54110 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 162
Credit: 703,854
RAC: 0
Message 54123 - Posted: 1 Jul 2008, 21:48:33 UTC

Is there away to tell what model & how many decoys the work unit has done? I have had a look in the slot '0' & slot '1' folder but I can't see anything relating to models & decoys. Every time there is a checkpoint made does this mean that a model/decoy has been finished?
Thanks in advance
Speedy
Have a crunching good day!!
ID: 54123 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Arthur Esseling

Send message
Joined: 20 Jan 06
Posts: 1
Credit: 34,840
RAC: 0
Message 54131 - Posted: 2 Jul 2008, 2:24:46 UTC

This is the first time i have run a minirosetta since i sign up quite a while ago. The graphics look like barren tree branches in the winter time. Is this normal??
ID: 54131 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 2,962,224
RAC: 3,583
Message 54134 - Posted: 2 Jul 2008, 3:11:13 UTC

h001a_BOINC_CASP8_ABRELAX_RANGE_t443_IGNORE_THE_REST-S25-6-S3-9--h001a-_4108_170


CPU time 10188.64
stderr out <core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>

</stderr_txt>
]]>


Validate state Invalid
Claimed credit 43.4047540418738
Granted credit 0
application version 1.28
ID: 54134 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 2,962,224
RAC: 3,583
Message 54136 - Posted: 2 Jul 2008, 3:17:12 UTC

h003__BOINC_CASP8_ABRELAX_RANGE_t405_IGNORE_THE_REST-S25-7-S3-12--h003_-_3755_123_0


CPU time 10734.16
stderr out <core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>

</stderr_txt>
]]>


Validate state Invalid
Claimed credit 45.7287306888967
Granted credit 0
application version 1.28
ID: 54136 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 2,962,224
RAC: 3,583
Message 54137 - Posted: 2 Jul 2008, 3:19:31 UTC

rb_06_20_11699_20969_T0465_IGNORE_THE_REST_04_08_4098_465_0


CPU time 1429.313
stderr out <core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>
Too many restarts with no progress. Keep application in memory while preempted.
======================================================
DONE :: 1 starting structures 1429 cpu seconds
This process generated 0 decoys from 0 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>rb_06_20_11699_20969_T0465_IGNORE_THE_REST_04_08_4098_465_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>


Validate state Invalid
Claimed credit 6.18136630971515
Granted credit 0
application version 1.28
ID: 54137 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 2,962,224
RAC: 3,583
Message 54138 - Posted: 2 Jul 2008, 3:22:17 UTC

d001a_BOINC_CASP8_ABRELAX_main_t405_IGNORE_THE_REST-S25-10-S3-9--d001a-_3648_14243_1


CPU time 0.670804
stderr out <core_client_version>5.10.13</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x004C1C41 read attempt to address 0x0000000C

Engaging BOINC Windows Runtime Debugger...



Validate state Invalid
Claimed credit 0.00290103374559817
Granted credit 0.00290103374559817
application version 1.28
ID: 54138 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 162
Credit: 703,854
RAC: 0
Message 54139 - Posted: 2 Jul 2008, 3:28:41 UTC - in response to Message 54131.  

This is the first time i have run a minirosetta since i sign up quite a while ago. The graphics look like barren tree branches in the winter time. Is this normal??

Yes this is normal for the mini application graphics.
Speedy
Have a crunching good day!!
ID: 54139 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 610
Credit: 9,785,375
RAC: 3,989
Message 54152 - Posted: 3 Jul 2008, 9:06:21 UTC

Unqualified validation error 175005020
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>

</stderr_txt>
]]>

Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 54152 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Minirosetta v1.28 bug thread



©2021 University of Washington
https://www.bakerlab.org