Problems with Rosetta version 5.80

Message boards : Number crunching : Problems with Rosetta version 5.80

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
M.L.

Send message
Joined: 21 Nov 06
Posts: 182
Credit: 180,462
RAC: 0
Message 47428 - Posted: 5 Oct 2007, 21:24:23 UTC
Last modified: 5 Oct 2007, 21:26:03 UTC

Rhiju.
Do you mean batch 2156 or 2056?
[both nbrs appear in your post].
ID: 47428 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mark Henderson

Send message
Joined: 24 May 06
Posts: 9
Credit: 643,001
RAC: 0
Message 47430 - Posted: 6 Oct 2007, 1:30:15 UTC
Last modified: 6 Oct 2007, 1:47:58 UTC

These 2 errored on me after completion

100299846 sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_442

100299822 sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_418
ID: 47430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile JChojnacki
Avatar

Send message
Joined: 17 Sep 05
Posts: 71
Credit: 10,747,694
RAC: 4,384
Message 47433 - Posted: 6 Oct 2007, 7:13:14 UTC
Last modified: 6 Oct 2007, 7:14:49 UTC

Looks like I had another 5.80 error, with this work unit:

FLUA__BOINC_LONGNOE_JUMPRELAX_SAVE_ALL_OUT_BARCODE-FLUA_-_2120_46884_1
https://boinc.bakerlab.org/rosetta/result.php?resultid=110504357

Joel
ID: 47433 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bletchley Park

Send message
Joined: 4 Oct 07
Posts: 4
Credit: 18,052
RAC: 0
Message 47437 - Posted: 6 Oct 2007, 9:25:01 UTC
Last modified: 6 Oct 2007, 9:27:22 UTC

I have another issue with a workunit for 5.80. computation error.
sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_4758_0.
ID: 47437 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Path7

Send message
Joined: 25 Aug 07
Posts: 128
Credit: 61,751
RAC: 0
Message 47439 - Posted: 6 Oct 2007, 9:35:31 UTC
Last modified: 6 Oct 2007, 10:15:15 UTC

WU 100348417 ended with an error:

https://boinc.bakerlab.org/rosetta/result.php?resultid=110420517

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

Does anyone knows what "file xfer error" actually means?

Think I found ½ the answer to my question in “stdoutdae.txt”:

2007-10-05 22:59:47 [rosetta@home] Computation for task sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0 finished
2007-10-05 22:59:47 [rosetta@home] Output file sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0_0 for task sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0 absent

I'm not sure whether the absence of the file is a 5.80 error, or a common error. Anybody an Idea?

Path7.

ID: 47439 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Marky-UK

Send message
Joined: 1 Nov 05
Posts: 73
Credit: 1,689,495
RAC: 0
Message 47443 - Posted: 6 Oct 2007, 10:47:21 UTC

I'm getting several -161 errors on 5.80 WUs now too.

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_21387_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>


Waste of CPU time...
ID: 47443 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
`

Send message
Joined: 21 Oct 06
Posts: 254
Credit: 56,691
RAC: 0
Message 47448 - Posted: 6 Oct 2007, 13:54:31 UTC

I received one also:

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_21608_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

ID: 47448 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jmarks
Avatar

Send message
Joined: 16 Jul 07
Posts: 132
Credit: 98,025
RAC: 0
Message 47449 - Posted: 6 Oct 2007, 14:21:53 UTC

2120
FLUA__BOINC_LONGNOE_JUMPRELAX_SAVE_ALL_OUT_BARCODE-FLUA_-_2120_33288_0
Jmarks
ID: 47449 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 22,813,645
RAC: 3,531
Message 47456 - Posted: 6 Oct 2007, 18:44:49 UTC

Just noticed that I got this bad result on September 28.
ID: 47456 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nothing But Idle Time

Send message
Joined: 28 Sep 05
Posts: 209
Credit: 139,545
RAC: 0
Message 47483 - Posted: 7 Oct 2007, 10:58:32 UTC

Eight hours wasted on this; wingman also bombed on it:

<core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 28800
# random seed: 3947179
======================================================
DONE :: 1 starting structures 29178.8 cpu seconds
This process generated 13 decoys from 13 attempts
======================================================
BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...
</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_32822_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>
...etc
ID: 47483 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike Francis
Avatar

Send message
Joined: 24 Nov 05
Posts: 8
Credit: 623,519
RAC: 0
Message 47487 - Posted: 7 Oct 2007, 15:31:08 UTC

I had a compute error on Beta;
This is the error message I received.

10/7/2007 8:10:01 AM|rosetta@home|Deferring communication for 1 min 0 sec
10/7/2007 8:10:01 AM|rosetta@home|Reason: Unrecoverable error for result mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0 ( - exit code -1073741819 (0xc0000005))
10/7/2007 8:10:01 AM|rosetta@home|Computation for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0 finished
10/7/2007 8:10:01 AM|rosetta@home|Output file mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0_0 for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0 absent

ID: 47487 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ziegenmelker

Send message
Joined: 26 Jul 06
Posts: 10
Credit: 26,061
RAC: 0
Message 47492 - Posted: 7 Oct 2007, 18:49:21 UTC

Two compute errors with 5.80 (64-Bit Linux on X2 4400 no oc).

First one:

###BEGIN############################################################
<core_client_version>5.10.8</core_client_version>
<![CDATA[
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
# random seed: 3979598
*** glibc detected *** corrupted double-linked list: 0x09647f08 ***
SIGABRT: abort called
Stack trace (19 frames):
[0x8d7cf2f]
[0x8d77d1c]
[0xffffe500]
[0x8de8234]
[0x8dfd0ce]
[0x8e01ae2]
[0x8e02774]
[0x8e04045]
[0x8dd24b7]
[0x8dd3f51]
[0x8b1c308]
[0x8ccedcd]
[0x84b7f90]
[0x80d82b5]
[0x85f6c37]
[0x87320a7]
[0x8732152]
[0x8de10f4]
[0x8048121]

Exiting...
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
*** glibc detected *** corrupted double-linked list: 0x099a9ea0 ***
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
======================================================
DONE :: 1 starting structures 14009.8 cpu seconds
This process generated 10 decoys from 10 attempts
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_403_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
###END########################################################################

And the second one:

###BEGIN######################################################################
<core_client_version>5.10.8</core_client_version>
<![CDATA[
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
# random seed: 3979904
*** glibc detected *** corrupted double-linked list: 0x0991f6b8 ***
Graphics are disabled due to configuration...
# cpu_run_time_pref: 14400
======================================================
DONE :: 1 starting structures 13128 cpu seconds
This process generated 9 decoys from 9 attempts
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_97_1_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
###END######################################################################

Maybe I should adjust crunching time to one h till the problems are solved?

cu,
Michael
ID: 47492 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
`

Send message
Joined: 21 Oct 06
Posts: 254
Credit: 56,691
RAC: 0
Message 47494 - Posted: 7 Oct 2007, 19:01:20 UTC

ID: 47494 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ziegenmelker

Send message
Joined: 26 Jul 06
Posts: 10
Credit: 26,061
RAC: 0
Message 47498 - Posted: 7 Oct 2007, 20:17:50 UTC

I think it's not because of the app, but because of the WU-type. All my wingmen got errors too. My next WU is a different type(1ubi__BOINC_ABRELAX_SHORTRELAX_SAVE_ALL_OUT-1ubi_-frags83__2162...), so I'm curious if this on will crash too.

cu,
Michael
ID: 47498 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BitSpit
Avatar

Send message
Joined: 5 Nov 05
Posts: 33
Credit: 4,147,344
RAC: 0
Message 47504 - Posted: 7 Oct 2007, 22:07:10 UTC

Guess I'll do the job of the admins at the moment and say feel free to abort all sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155 jobs. They've been marked as canceled.
ID: 47504 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
M.L.

Send message
Joined: 21 Nov 06
Posts: 182
Credit: 180,462
RAC: 0
Message 47506 - Posted: 8 Oct 2007, 0:12:27 UTC
Last modified: 8 Oct 2007, 0:15:11 UTC

And another...
Result ID 110379414
Name sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_11869_0
Workunit 100311273
Created 5 Oct 2007 3:10:59 UTC
Sent 5 Oct 2007 11:23:37 UTC
Received 8 Oct 2007 0:07:03 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status 0 (0x0)
Computer ID 510574
Report deadline 15 Oct 2007 11:23:37 UTC
CPU time 20188.78125
stderr out <core_client_version>5.10.20</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# random seed: 3968132
======================================================
DONE :: 1 starting structures 20188.1 cpu seconds
This process generated 14 decoys from 14 attempts
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_11869_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>


Validate state Invalid
Claimed credit 84.3860161877551
Granted credit 0
application version 5.80
-----
Is this WU one of the ones to be 'aborted' or not?? Would love to hear the official verdict.

ID: 47506 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mysteron347

Send message
Joined: 10 May 07
Posts: 1
Credit: 3,855,444
RAC: 361
Message 47530 - Posted: 9 Oct 2007, 2:43:14 UTC

Result ID 110418391
Name sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_31217_0
Workunit 100346611
Created 5 Oct 2007 7:33:41 UTC
Sent 5 Oct 2007 7:36:21 UTC
Received 8 Oct 2007 13:21:37 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status 0 (0x0)
Computer ID 500090
Report deadline 15 Oct 2007 7:36:21 UTC
CPU time 9604.046875
stderr out

<core_client_version>5.10.20</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 10800
# random seed: 3948784
# cpu_run_time_pref: 10800
# cpu_run_time_pref: 10800
======================================================
DONE :: 1 starting structures 9603.16 cpu seconds
This process generated 5 decoys from 5 attempts
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_31217_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>

Ditto result IDs

110418409
110418408
110418407
110418390

(all error code -161)

All sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_31217
(31217 -> 31216, 31233, 31234, 31235)

Also:

I have 179 WUs reported as being "In Progress" according to https://boinc.bakerlab.org/rosetta/results.php?userid=175558
Yet BOINC shows only 29WUs.

Something not quite right here.....
ID: 47530 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Irwan Adinatha

Send message
Joined: 18 Jan 06
Posts: 5
Credit: 1,245,260
RAC: 0
Message 47551 - Posted: 9 Oct 2007, 17:58:42 UTC

Here two mcr51 one with error and one fine.
error:
10/9/2007 8:57:58 PM|rosetta@home|Reason: Unrecoverable error for result mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0 (Incorrect function. (0x1) - exit code 1 (0x1))
10/9/2007 8:57:58 PM|rosetta@home|Computation for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0 finished
10/9/2007 8:57:58 PM|rosetta@home|Output file mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0_0 for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0 absent

OK:
10/9/2007 9:34:17 PM|rosetta@home|Starting mcr1__BOINC_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_11025_0
10/9/2007 9:34:17 PM|rosetta@home|Starting task mcr1__BOINC_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_11025_0 using rosetta_beta version 580
10/10/2007 12:20:28 AM|rosetta@home|Computation for task mcr1__BOINC_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_11025_0 finished
ID: 47551 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 47552 - Posted: 9 Oct 2007, 18:11:02 UTC

2007-10-09 15:01:12 [rosetta@home] Deferring communication for 1 min 0 sec
2007-10-09 15:01:12 [rosetta@home] Reason: Unrecoverable error for result sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_37278_0 (<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_37278_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>
)

2007-10-09 19:11:43 [rosetta@home] Deferring communication for 1 min 0 sec
2007-10-09 19:11:43 [rosetta@home] Reason: Unrecoverable error for result sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_13617_0 (<file_xfer_error>
<file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_13617_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>
)

ID: 47552 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nemesis
Avatar

Send message
Joined: 12 Mar 06
Posts: 149
Credit: 21,395
RAC: 0
Message 47566 - Posted: 9 Oct 2007, 19:59:50 UTC

You would think that they would send this turkey back to RALPH by now.
Nemesis n. A righteous infliction of retribution manifested by an appropriate agent.


ID: 47566 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

Message boards : Number crunching : Problems with Rosetta version 5.80



©2024 University of Washington
https://www.bakerlab.org