Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 25 · 26 · 27 · 28 · 29 · 30 · 31 . . . 309 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 90972 - Posted: 5 Aug 2019, 4:00:19 UTC

Current server status

rah_assimilator_rosetta1 (rosetta) bwsrv2 Running
rah_assimilator_rosetta2 (rosetta) bwsrv2 Not Running
rah_assimilator_mini1 (minirosetta) bwsrv2 Not Running
rah_assimilator_mini2 (minirosetta) bwsrv2 Running
rah_validator_rosetta1 (rosetta) bwsrv2 Not Running
rah_validator_rosetta2 (rosetta) bwsrv2 Running
rah_validator_mini1 (minirosetta) bwsrv2 Not Running
rah_validator_mini2 (minirosetta) bwsrv2 Running
file_deleter1 bwsrv2 Running
file_deleter2 bwsrv2 Running
db_purge bwsrv2 Not Running

Workunits waiting for validation 2057
Workunits waiting for assimilation 1031

I have some unvalidated tasks held up here - while others go straight through.

Any chance of a bump please?
ID: 90972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 90976 - Posted: 5 Aug 2019, 16:40:26 UTC - in response to Message 90972.  

The whole of Server BWSRV2 is not running right now. Hopefully that means it's being worked on

Workunits waiting for validation 62381
Workunits waiting for assimilation 11088
ID: 90976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JohnH

Send message
Joined: 25 Mar 13
Posts: 43
Credit: 2,319,355
RAC: 0
Message 90977 - Posted: 5 Aug 2019, 17:57:43 UTC

rah_assimilator_rosetta1 (rosetta) bwsrv2 Not Running
rah_assimilator_rosetta2 (rosetta) bwsrv2 Not Running
rah_assimilator_mini1 (minirosetta) bwsrv2 Not Running
rah_assimilator_mini2 (minirosetta) bwsrv2 Not Running
rah_validator_rosetta1 (rosetta) bwsrv2 Not Running
rah_validator_rosetta2 (rosetta) bwsrv2 Not Running
rah_validator_mini1 (minirosetta) bwsrv2 Not Running
rah_validator_mini2 (minirosetta) bwsrv2 Not Running
file_deleter1 bwsrv2 Not Running
file_deleter2 bwsrv2 Not Running
db_purge bwsrv2 Not Running

Anybody minding the store ???
ID: 90977 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 6,697
Message 90979 - Posted: 5 Aug 2019, 20:09:39 UTC - in response to Message 90977.  

Anybody minding the store ???


Maybe all the people is here: RosettaCon 2019
ID: 90979 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 90983 - Posted: 6 Aug 2019, 0:20:53 UTC - in response to Message 90979.  

Anybody minding the store ???

Maybe all the people is here: RosettaCon 2019

Oh! Could be. Meanwhile...

Workunits waiting for validation 111762
Workunits waiting for assimilation 12405
ID: 90983 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 90993 - Posted: 6 Aug 2019, 20:44:56 UTC - in response to Message 90976.  

The whole of Server BWSRV2 is not running right now. Hopefully that means it's being worked on

Workunits waiting for validation 62381
Workunits waiting for assimilation 11088

Aaaannnnddd... we're back. Sort of.

bwsvr2 running, validation all up to date, assimilation not yet but I don't actually know what that does tbh

Workunits waiting for validation 6
Workunits waiting for assimilation 61639
ID: 90993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91308 - Posted: 28 Oct 2019, 10:25:23 UTC

BWSRV2 not running and seems to have been down for 5 or 6 hours.

Task validation is building up - currently at 49905
ID: 91308 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91312 - Posted: 29 Oct 2019, 10:35:30 UTC - in response to Message 91308.  

BWSRV2 not running and seems to have been down for 5 or 6 hours.

Task validation is building up - currently at 49905

Noticed earlier this morning everything's back running and all my validation is caught up. Thanks for fixing.
ID: 91312 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
danny

Send message
Joined: 2 Nov 19
Posts: 1
Credit: 0
RAC: 0
Message 91328 - Posted: 2 Nov 2019, 18:36:32 UTC - in response to Message 80897.  

Excellent. Thanks.
ID: 91328 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 107
Credit: 865,910
RAC: 993
Message 91353 - Posted: 11 Nov 2019, 15:10:00 UTC

Rosetta is doing it again. Lots of wasted computer time and it's pissing me off.

1104110555 993281955 3551508 11 Nov 2019, 3:13:36 UTC 11 Nov 2019, 14:03:47 UTC Error while computing 32,815.09 29,296.58 --- Rosetta v4.07
windows_x86_64
windows_intelx86

1102262280 992814189 3551508 30 Oct 2019, 13:29:52 UTC 31 Oct 2019, 8:17:27 UTC Error while computing 31,262.13 29,037.15 --- Rosetta v4.07
windows_x86_64

1102262227 992814166 3551508 30 Oct 2019, 13:29:52 UTC 4 Nov 2019, 18:17:19 UTC Error while computing 33,356.00 29,107.82 --- Rosetta v4.07
windows_x86_64

Steven Gaber
Oldsmar, FL
ID: 91353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
benefique pour tous

Send message
Joined: 12 Oct 19
Posts: 2
Credit: 2,629,601
RAC: 0
Message 91469 - Posted: 19 Dec 2019, 7:55:48 UTC - in response to Message 80621.  

I must put 3 of my computers at cleaning
The perform is Under usual hability to compute
It overheat same with only 40% of processors with only 25 % power for each processors
now came the time to eliminate the dust
I am happy because the cruncher solved over 12 millions tasks this year
Happy Crunching
Happy Christmas feasts
Guy PFLIEGER
ID: 91469 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Hal Bregg

Send message
Joined: 16 Sep 18
Posts: 2
Credit: 418,479
RAC: 0
Message 91546 - Posted: 13 Jan 2020, 22:31:24 UTC

One of the WUs errored out after 5hrs of running. It should run around 1 hour as per project settings.

Name 	rb_01_09_13386_13515_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_07_11_885102_10_1

Exit status 	139 (0x0000008B) Unknown error code


<core_client_version>7.9.3</core_client_version>
<![CDATA[
<message>
process got signal 11</message>
<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.07_i686-pc-linux-gnu @rb_01_09_13386_13515_ab_t000__robetta_FLAGS -in::file::fasta t000_.fasta -psipred_ss2 t000_.spider3_ss2 -kill_hairpins t000_.nobuformat.spider3_ss2 -jumps:pairing_file t000_.fasta.bbcontacts.jumps -abinitio::use_filters false -skip_convergence_check -jumps:overlap_chainbreak -seq_sep_stages 1 1 1 -ramp_chainbreaks -sep_switch_accelerate 0.8 -jumps:random_sheets 5 4 1 -constraints::cst_file t000_.fasta.CB.cst -constraints:cst_weight 5.0 -constraints::cst_fa_file t000_.fasta.MIN.cst -constraints:cst_fa_weight 5.0 -in:file:boinc_wu_zip rb_01_09_13386_13515_ab_t000__robetta.zip -frag3 rb_01_09_13386_13515_ab_t000__robetta.200.3mers.index.gz -fragA rb_01_09_13386_13515_ab_t000__robetta.200.11mers.index.gz -fragB rb_01_09_13386_13515_ab_t000__robetta.200.7mers.index.gz -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1009779
Starting watchdog...
Watchdog active.
BOINC:: CPU time: 18038.9s, 14400s + 3600s[2020- 1-13  6:54: 3:] :: BOINC 
WARNING! cannot get file size for default.out.gz: could not open file.
Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE ::     1 starting structures  18038.9 cpu seconds
This process generated      1 decoys from       1 attempts
======================================================
06:54:03 (24943): called boinc_finish(0)

</stderr_txt>
]]>

ID: 91546 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91549 - Posted: 15 Jan 2020, 2:33:31 UTC

New tasks are a bit stopstart today. No tasks earlier today, then a batch of new ones came through, but empty again right now.
ID: 91549 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91554 - Posted: 15 Jan 2020, 17:54:50 UTC - in response to Message 91549.  

Bump. Hand to mouth over the last 24hrs.
Though tbf this is pretty typical for January
New tasks are a bit stopstart today. No tasks earlier today, then a batch of new ones came through, but empty again right now.

ID: 91554 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 91555 - Posted: 15 Jan 2020, 18:06:22 UTC - in response to Message 91554.  
Last modified: 15 Jan 2020, 18:08:06 UTC

Now that you mention it, I have one core free too. A manual update did not get anything.
It appears that the Christmas holiday has come late (or early) this year.
ID: 91555 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
premier

Send message
Joined: 30 Dec 05
Posts: 14
Credit: 23,872,868
RAC: 0
Message 91557 - Posted: 16 Jan 2020, 7:31:43 UTC - in response to Message 91555.  

Same Here. 11 Machines are boring ATM. Not getting new job.
ID: 91557 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91573 - Posted: 18 Jan 2020, 1:35:00 UTC - in response to Message 91554.  

Bump. Hand to mouth over the last 24hrs.
Though tbf this is pretty typical for January
New tasks are a bit stopstart today. No tasks earlier today, then a batch of new ones came through, but empty again right now.

A fair few tasks became available today, but it seems our buffers are so empty they all got taken and we're back to zero again.

Some efforts seem to have made, but more still needed
ID: 91573 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Igor Kurpis
Avatar

Send message
Joined: 13 Jan 20
Posts: 2
Credit: 103,514
RAC: 0
Message 91601 - Posted: 23 Jan 2020, 21:55:57 UTC

Hello!

I was looking at my results recently I there is something odd in output of Rosetta task. Don't know if I should care or if there is a way to resolve this.

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
ter: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
[...]
** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
*** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x0000000005f08e93 ***
Starting watchdog...
Watchdog active.
Starting watchdog...
Watchdog active.
======================================================
DONE ::     1 starting structures    28302 cpu seconds
This process generated     41 decoys from      41 attempts
======================================================
BOINC :: WS_max 3.91049e+08

BOINC :: Watchdog shutting down...
03:06:19 (14253): called boinc_finish(0)

</stderr_txt>
]]>


1118219840

Task itself completed succesfuly and has been validated, but still those errors doesn't look like something normal. In MiniRosetta there are no such errors.[/url]
ID: 91601 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91630 - Posted: 31 Jan 2020, 1:25:06 UTC
Last modified: 31 Jan 2020, 1:26:04 UTC

Validators have been down for about 12 hours

Anyone around to give bwsrv2 a kick?

93k tasks awaiting validation - 30 of them mine
ID: 91630 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 12,941
Message 91634 - Posted: 31 Jan 2020, 14:12:36 UTC - in response to Message 91630.  

Validators have been down for about 12 hours

Anyone around to give bwsrv2 a kick?

93k tasks awaiting validation - 30 of them mine

No change after 24hrs - bwsrv2 still down

185, 069 tasks awaiting validation
ID: 91634 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 25 · 26 · 27 · 28 · 29 · 30 · 31 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org