Message boards : Number crunching : Problems with Rosetta version 5.80
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next
Author | Message |
---|---|
M.L. Send message Joined: 21 Nov 06 Posts: 182 Credit: 180,462 RAC: 0 |
Rhiju. Do you mean batch 2156 or 2056? [both nbrs appear in your post]. |
Mark Henderson Send message Joined: 24 May 06 Posts: 9 Credit: 643,001 RAC: 0 |
These 2 errored on me after completion 100299846 sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_442 100299822 sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_418 |
JChojnacki Send message Joined: 17 Sep 05 Posts: 71 Credit: 10,747,694 RAC: 4,384 |
Looks like I had another 5.80 error, with this work unit: FLUA__BOINC_LONGNOE_JUMPRELAX_SAVE_ALL_OUT_BARCODE-FLUA_-_2120_46884_1 https://boinc.bakerlab.org/rosetta/result.php?resultid=110504357 Joel |
Bletchley Park Send message Joined: 4 Oct 07 Posts: 4 Credit: 18,052 RAC: 0 |
I have another issue with a workunit for 5.80. computation error. sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_4758_0. |
Path7 Send message Joined: 25 Aug 07 Posts: 128 Credit: 61,751 RAC: 0 |
WU 100348417 ended with an error: https://boinc.bakerlab.org/rosetta/result.php?resultid=110420517 </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> Does anyone knows what "file xfer error" actually means? Think I found ½ the answer to my question in “stdoutdae.txt”: 2007-10-05 22:59:47 [rosetta@home] Computation for task sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0 finished 2007-10-05 22:59:47 [rosetta@home] Output file sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0_0 for task sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_33023_0 absent I'm not sure whether the absence of the file is a 5.80 error, or a common error. Anybody an Idea? Path7. |
Marky-UK Send message Joined: 1 Nov 05 Posts: 73 Credit: 1,689,495 RAC: 0 |
I'm getting several -161 errors on 5.80 WUs now too. </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_21387_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> Waste of CPU time... |
` Send message Joined: 21 Oct 06 Posts: 254 Credit: 56,691 RAC: 0 |
I received one also: </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_21608_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> |
Jmarks Send message Joined: 16 Jul 07 Posts: 132 Credit: 98,025 RAC: 0 |
|
googloo Send message Joined: 15 Sep 06 Posts: 133 Credit: 22,813,645 RAC: 3,531 |
Just noticed that I got this bad result on September 28. |
Nothing But Idle Time Send message Joined: 28 Sep 05 Posts: 209 Credit: 139,545 RAC: 0 |
Eight hours wasted on this; wingman also bombed on it: <core_client_version>5.10.13</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 28800 # random seed: 3947179 ====================================================== DONE :: 1 starting structures 29178.8 cpu seconds This process generated 13 decoys from 13 attempts ====================================================== BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_32822_1_0</file_name> <error_code>-161</error_code> </file_xfer_error> ...etc |
Mike Francis Send message Joined: 24 Nov 05 Posts: 8 Credit: 623,519 RAC: 0 |
I had a compute error on Beta; This is the error message I received. 10/7/2007 8:10:01 AM|rosetta@home|Deferring communication for 1 min 0 sec 10/7/2007 8:10:01 AM|rosetta@home|Reason: Unrecoverable error for result mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0 ( - exit code -1073741819 (0xc0000005)) 10/7/2007 8:10:01 AM|rosetta@home|Computation for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0 finished 10/7/2007 8:10:01 AM|rosetta@home|Output file mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0_0 for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_12945_0 absent |
ziegenmelker Send message Joined: 26 Jul 06 Posts: 10 Credit: 26,061 RAC: 0 |
Two compute errors with 5.80 (64-Bit Linux on X2 4400 no oc). First one: ###BEGIN############################################################ <core_client_version>5.10.8</core_client_version> <![CDATA[ <stderr_txt> Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 # random seed: 3979598 *** glibc detected *** corrupted double-linked list: 0x09647f08 *** SIGABRT: abort called Stack trace (19 frames): [0x8d7cf2f] [0x8d77d1c] [0xffffe500] [0x8de8234] [0x8dfd0ce] [0x8e01ae2] [0x8e02774] [0x8e04045] [0x8dd24b7] [0x8dd3f51] [0x8b1c308] [0x8ccedcd] [0x84b7f90] [0x80d82b5] [0x85f6c37] [0x87320a7] [0x8732152] [0x8de10f4] [0x8048121] Exiting... Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 *** glibc detected *** corrupted double-linked list: 0x099a9ea0 *** Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 ====================================================== DONE :: 1 starting structures 14009.8 cpu seconds This process generated 10 decoys from 10 attempts ====================================================== BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_403_1_0</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> ###END######################################################################## And the second one: ###BEGIN###################################################################### <core_client_version>5.10.8</core_client_version> <![CDATA[ <stderr_txt> Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 # random seed: 3979904 *** glibc detected *** corrupted double-linked list: 0x0991f6b8 *** Graphics are disabled due to configuration... # cpu_run_time_pref: 14400 ====================================================== DONE :: 1 starting structures 13128 cpu seconds This process generated 9 decoys from 9 attempts ====================================================== BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_97_1_0</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> ###END###################################################################### Maybe I should adjust crunching time to one h till the problems are solved? cu, Michael |
` Send message Joined: 21 Oct 06 Posts: 254 Credit: 56,691 RAC: 0 |
Another -161 error to report: sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_21898 |
ziegenmelker Send message Joined: 26 Jul 06 Posts: 10 Credit: 26,061 RAC: 0 |
I think it's not because of the app, but because of the WU-type. All my wingmen got errors too. My next WU is a different type(1ubi__BOINC_ABRELAX_SHORTRELAX_SAVE_ALL_OUT-1ubi_-frags83__2162...), so I'm curious if this on will crash too. cu, Michael |
BitSpit Send message Joined: 5 Nov 05 Posts: 33 Credit: 4,147,344 RAC: 0 |
Guess I'll do the job of the admins at the moment and say feel free to abort all sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155 jobs. They've been marked as canceled. |
M.L. Send message Joined: 21 Nov 06 Posts: 182 Credit: 180,462 RAC: 0 |
And another... Result ID 110379414 Name sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_11869_0 Workunit 100311273 Created 5 Oct 2007 3:10:59 UTC Sent 5 Oct 2007 11:23:37 UTC Received 8 Oct 2007 0:07:03 UTC Server state Over Outcome Client error Client state Compute error Exit status 0 (0x0) Computer ID 510574 Report deadline 15 Oct 2007 11:23:37 UTC CPU time 20188.78125 stderr out <core_client_version>5.10.20</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 21600 # random seed: 3968132 ====================================================== DONE :: 1 starting structures 20188.1 cpu seconds This process generated 14 decoys from 14 attempts ====================================================== BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_11869_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> Validate state Invalid Claimed credit 84.3860161877551 Granted credit 0 application version 5.80 ----- Is this WU one of the ones to be 'aborted' or not?? Would love to hear the official verdict. |
Mysteron347 Send message Joined: 10 May 07 Posts: 1 Credit: 3,855,444 RAC: 361 |
Result ID 110418391 Name sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_31217_0 Workunit 100346611 Created 5 Oct 2007 7:33:41 UTC Sent 5 Oct 2007 7:36:21 UTC Received 8 Oct 2007 13:21:37 UTC Server state Over Outcome Client error Client state Compute error Exit status 0 (0x0) Computer ID 500090 Report deadline 15 Oct 2007 7:36:21 UTC CPU time 9604.046875 stderr out <core_client_version>5.10.20</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 10800 # random seed: 3948784 # cpu_run_time_pref: 10800 # cpu_run_time_pref: 10800 ====================================================== DONE :: 1 starting structures 9603.16 cpu seconds This process generated 5 decoys from 5 attempts ====================================================== BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down... </stderr_txt> <message> <file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_31217_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> Ditto result IDs 110418409 110418408 110418407 110418390 (all error code -161) All sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_31217 (31217 -> 31216, 31233, 31234, 31235) Also: I have 179 WUs reported as being "In Progress" according to https://boinc.bakerlab.org/rosetta/results.php?userid=175558 Yet BOINC shows only 29WUs. Something not quite right here..... |
Irwan Adinatha Send message Joined: 18 Jan 06 Posts: 5 Credit: 1,245,260 RAC: 0 |
Here two mcr51 one with error and one fine. error: 10/9/2007 8:57:58 PM|rosetta@home|Reason: Unrecoverable error for result mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 10/9/2007 8:57:58 PM|rosetta@home|Computation for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0 finished 10/9/2007 8:57:58 PM|rosetta@home|Output file mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0_0 for task mcr1__BOINC_RG_FULLWEIGHT_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_9186_0 absent OK: 10/9/2007 9:34:17 PM|rosetta@home|Starting mcr1__BOINC_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_11025_0 10/9/2007 9:34:17 PM|rosetta@home|Starting task mcr1__BOINC_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_11025_0 using rosetta_beta version 580 10/10/2007 12:20:28 AM|rosetta@home|Computation for task mcr1__BOINC_SYMM_FOLD_AND_DOCK_RELAX-mcr1_-mfr__2128_11025_0 finished |
(_KoDAk_) Send message Joined: 18 Jul 06 Posts: 109 Credit: 1,859,263 RAC: 0 |
2007-10-09 15:01:12 [rosetta@home] Deferring communication for 1 min 0 sec 2007-10-09 15:01:12 [rosetta@home] Reason: Unrecoverable error for result sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_37278_0 (<file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_37278_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> ) 2007-10-09 19:11:43 [rosetta@home] Deferring communication for 1 min 0 sec 2007-10-09 19:11:43 [rosetta@home] Reason: Unrecoverable error for result sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_13617_0 (<file_xfer_error> <file_name>sen15_RESAMPLE_BOINC_MFR_ABRELAX_PICKED_2155_13617_0_0</file_name> <error_code>-161</error_code> </file_xfer_error> ) |
Nemesis Send message Joined: 12 Mar 06 Posts: 149 Credit: 21,395 RAC: 0 |
You would think that they would send this turkey back to RALPH by now. Nemesis n. A righteous infliction of retribution manifested by an appropriate agent. |
Message boards :
Number crunching :
Problems with Rosetta version 5.80
©2024 University of Washington
https://www.bakerlab.org