Report Problems with Rosetta Version 5.25

Message boards : Number crunching : Report Problems with Rosetta Version 5.25

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 12 · Next

AuthorMessage
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 19529 - Posted: 30 Jun 2006, 4:36:34 UTC

Thanks for posting problems with the new app here...
ID: 19529 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tralala

Send message
Joined: 8 Apr 06
Posts: 376
Credit: 581,806
RAC: 0
Message 19591 - Posted: 30 Jun 2006, 19:34:21 UTC

I just got this WU. Its named Fra_t103_Casp7... I assume that is a typo and probably means t303 since no t103 exists in CASP7.
ID: 19591 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Keith Akins

Send message
Joined: 22 Oct 05
Posts: 176
Credit: 71,779
RAC: 0
Message 19600 - Posted: 1 Jul 2006, 2:42:22 UTC

When the last 5.24 WU finished and the first 5.25 WU began a wierd graphics glitch appeared just for a second, then cleared up.

Probably just the transition.

Eyes pealed.
ID: 19600 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tralala

Send message
Joined: 8 Apr 06
Posts: 376
Credit: 581,806
RAC: 0
Message 19626 - Posted: 1 Jul 2006, 12:15:23 UTC - in response to Message 19591.  

I just got this WU. Its named Fra_t103_Casp7... I assume that is a typo and probably means t303 since no t103 exists in CASP7.


Now I got a WU named "FRA_t130_CASP7...". This seems to be a typo as well. Rhiju can you comment on that?
ID: 19626 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
J D K
Avatar

Send message
Joined: 23 Sep 05
Posts: 168
Credit: 101,266
RAC: 0
Message 19661 - Posted: 2 Jul 2006, 1:53:49 UTC

Just had eight tasks do this,

Result ID 26682014
Name FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_41_852_26_1
Workunit 22693343
Created 2 Jul 2006 1:12:20 UTC
Sent 2 Jul 2006 1:14:26 UTC
Received 2 Jul 2006 1:35:10 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status 0 (0x0)
Computer ID 178904
Report deadline 9 Jul 2006 1:14:26 UTC
CPU time 22.765625
stderr out <core_client_version>5.4.9</core_client_version>
<stderr_txt>
WARNING! attempt to gzip file .aat329.out failed: file does not exist.
# DONE :: 1 starting structures built 0 (nstruct) times
# This process generated 0 decoys from 0 attempts
# 1 starting pdbs were skipped


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_41_852_26_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>


Validate state Invalid
Claimed credit 0.0410291910172216
Granted credit 0
application version 5.25


running 840ee ht enabled, 3 gig ram
BOINC Wiki

ID: 19661 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 19666 - Posted: 2 Jul 2006, 3:39:52 UTC

ID: 19666 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mark Henderson

Send message
Joined: 24 May 06
Posts: 9
Credit: 643,001
RAC: 0
Message 19672 - Posted: 2 Jul 2006, 5:14:49 UTC

Same Here......
ID: 19672 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ananas

Send message
Joined: 1 Jan 06
Posts: 232
Credit: 752,471
RAC: 0
Message 19674 - Posted: 2 Jul 2006, 6:01:05 UTC
Last modified: 2 Jul 2006, 6:19:07 UTC

2 errors with FRA_t329... here too.

But I just see ... I have one FRA_t329 that is running and survived the first few seconds (at 6 hours now, going for 10 hours) :

wuid=22684007

so it is probably not the whole series.

Damaged :
FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_502_852_6
FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1230_852_1

running :
FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_560_852_6

The gzip error and the -161 are not the cause of this btw., it's just a followup error, the result file that has not been created, caused by some different error that happened before.
ID: 19674 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Shoikan

Send message
Joined: 4 Apr 06
Posts: 14
Credit: 180,211
RAC: 0
Message 19678 - Posted: 2 Jul 2006, 9:07:30 UTC

I've had 5 FRA_t329 WU's resulting in compute errors just in a few seconds of computing all in a row and with different computers.

Here they are:

26659064
26667400
26668807
26669392
26670318

Regards
ID: 19678 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
_heinz

Send message
Joined: 30 Jun 06
Posts: 24
Credit: 38,697
RAC: 0
Message 19680 - Posted: 2 Jul 2006, 9:58:01 UTC
Last modified: 2 Jul 2006, 10:03:26 UTC

I got the following error:
02.07.2006 11:53:51|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1171_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1171_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)
26652396 <----
ID: 19680 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
_heinz

Send message
Joined: 30 Jun 06
Posts: 24
Credit: 38,697
RAC: 0
Message 19685 - Posted: 2 Jul 2006, 12:44:18 UTC
Last modified: 2 Jul 2006, 12:45:49 UTC

next errors happen
02.07.2006 14:30:17|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_115_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_115_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)
02.07.2006 14:30:51|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1150_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1150_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)
02.07.2006 14:31:26|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1099_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1099_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)
look here ---->
26652379
26652378
26652337
ID: 19685 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]Division_Brabant~OldButNotSoWise
Avatar

Send message
Joined: 23 Jan 06
Posts: 42
Credit: 371,797
RAC: 0
Message 19686 - Posted: 2 Jul 2006, 12:52:17 UTC
Last modified: 2 Jul 2006, 12:57:57 UTC

02/07/2006 12:23:25|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1161_852_10_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1161_852_10_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 12:53:50|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_100_852_8_1 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_100_852_8_1_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 13:41:21|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_112_852_8_1 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_112_852_8_1_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 13:41:56|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_108_852_6_1 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_108_852_6_1_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 13:42:32|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1088_852_8_1 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1088_852_8_1_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

etc etc etc

Second PC:

02/07/2006 10:45:33|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1378_852_24_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1378_852_24_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 10:46:08|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1278_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1278_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 10:46:41|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1292_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1292_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 11:30:30|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1308_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1308_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 12:16:34|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_133_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_133_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 13:04:57|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_42_852_11_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_42_852_11_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 13:05:32|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_326_852_14_1 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_326_852_14_1_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

02/07/2006 13:06:04|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_390_852_18_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_390_852_18_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

etc etc etc


After checking my workqeue and noticed he was filled with the same WU's, I've reset the project.
ID: 19686 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 19690 - Posted: 2 Jul 2006, 14:04:45 UTC

ID: 19690 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 19693 - Posted: 2 Jul 2006, 15:31:21 UTC - in response to Message 19690.  

ID: 19693 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
_heinz

Send message
Joined: 30 Jun 06
Posts: 24
Credit: 38,697
RAC: 0
Message 19702 - Posted: 2 Jul 2006, 17:01:48 UTC

next error:
02.07.2006 18:16:06|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1152_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1152_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)
02.07.2006 18:53:08|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1087_852_28_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1087_852_28_0_0</file_name> <error_code>-161</error_code> <error_message></error_message></file_xfer_error>)

26652329
26652338
26652380
26652397
ID: 19702 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Reverend Scott E. Lee
Avatar

Send message
Joined: 14 Jun 06
Posts: 1
Credit: 2,570
RAC: 0
Message 19707 - Posted: 2 Jul 2006, 19:02:05 UTC

FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1468_858_8_0
FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1488_858_8_0
FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1523_858_8_0

I suspended rosetta after these three died on the table. They all had this:

Unrecoverable error for result above
(<file_xfer_error>
<file_name>above</file_name>
<error_code>-161</error_code>
</file_xfer_error>)

I have seven more WUs that will probably die the same, horrid death.

-The Rev.
ID: 19707 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Studer SL

Send message
Joined: 6 Jun 06
Posts: 28
Credit: 11,166
RAC: 0
Message 19709 - Posted: 2 Jul 2006, 20:09:54 UTC

This task crashed about 1 minute 30 seconds in - -

7/2/2006 3:54:56 PM|rosetta@home|Starting task FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1479_858_4_0 using rosetta version 525
7/2/2006 3:55:28 PM|rosetta@home|Computation for task FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1479_858_4_0 finished
7/2/2006 3:55:29 PM|rosetta@home|Unrecoverable error for result FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1479_858_4_0 (<file_xfer_error> <file_name>FRA_t329_CASP7_hom001_6_t329_6_2ah5A_IGNORE_THE_REST_1479_858_4_0_0</file_name> <error_code>-161</error_code></file_xfer_error>)

This task started after the above task crashed. So far, it is still running.

7/2/2006 3:55:28 PM|rosetta@home|Starting task FRA_t338_CASP7_hom001_6_IGNORE_THE_REST_t338_6_1vin___1605.pdb_862_7_0 using rosetta version 525

ID: 19709 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile cmds
Avatar

Send message
Joined: 29 Jun 06
Posts: 13
Credit: 41,811
RAC: 0
Message 19714 - Posted: 2 Jul 2006, 21:22:13 UTC - in response to Message 19709.  
Last modified: 2 Jul 2006, 21:28:22 UTC

This task crashed about 1 minute 30 seconds in - -


My Pentium 4 with WInXP crashed after 25 seconds, activated HT, so 2 WUs were killed.
Example:
https://boinc.bakerlab.org/rosetta/result.php?resultid=26791193
This Wu also a 329 works fine for now 90 Min on a Linux-Host
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=22790026
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=266145

Chris
ID: 19714 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 19716 - Posted: 2 Jul 2006, 21:28:22 UTC
Last modified: 2 Jul 2006, 21:29:14 UTC

I also had FRA_t329* WU crash on WinXP. Perhaps the project should delete them from the queue?

Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 19716 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bin Qian

Send message
Joined: 13 Jul 05
Posts: 33
Credit: 36,897
RAC: 0
Message 19719 - Posted: 2 Jul 2006, 23:05:40 UTC - in response to Message 19716.  

Thanks for reporting this error! The jobs have been deleted from the queue, but if you still got those queued up on your system, please feel free to delete them. Sorry!

About a third of the 1000 starting structures were not recognized by the rosetta application for this WU. When that happened, Rosetta will finish within a couple of minutes of starting. Since no .out.gz result file was produced, boinc reports a "file not found" error. Apparently those bad starting structures were not included in our ralph test of this WU where only a samll subset of the starting structures were used.

We will remove the bad starting structures and resend the jobs. Thanks again for reporting this!

I also had FRA_t329* WU crash on WinXP. Perhaps the project should delete them from the queue?


ID: 19719 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · 4 . . . 12 · Next

Message boards : Number crunching : Report Problems with Rosetta Version 5.25



©2024 University of Washington
https://www.bakerlab.org