Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 25 · 26 · 27 · 28 · 29 · 30 · 31 . . . 56 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 90685 - Posted: 17 Apr 2019, 22:25:50 UTC

Validation seems to be offline for the last half hour
ID: 90685 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 90686 - Posted: 18 Apr 2019, 0:24:03 UTC - in response to Message 90685.  

Validation seems to be offline for the last half hour

And back about 30mins ago, I think
ID: 90686 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 104
Credit: 3,640,091
RAC: 7,681
Message 90706 - Posted: 21 Apr 2019, 20:44:07 UTC - in response to Message 90686.  

Validation seems to be offline for the last half hour

And back about 30mins ago, I think


And off again since 04:00 this morning
ID: 90706 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trevor ct

Send message
Joined: 7 Oct 14
Posts: 2
Credit: 14,056,393
RAC: 18,916
Message 90964 - Posted: 2 Aug 2019, 15:19:23 UTC

Rosetta 4.07 work tasks are reporting 'computational error' immediately they are opened on one of my two computers. Only known difference is affected computer BOINC version is 7.14.2 and the non-affected is earlier version 7.6.33.

I cannot discover how to revert version as a trial.

Trevor ct
ID: 90964 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 424
Credit: 19,046,906
RAC: 83,879
Message 90965 - Posted: 2 Aug 2019, 16:14:58 UTC - in response to Message 90964.  

Rosetta 4.07 work tasks are reporting 'computational error' immediately they are opened on one of my two computers. Only known difference is affected computer BOINC version is 7.14.2 and the non-affected is earlier version 7.6.33.

It is not the BOINC version. I see the errors too (one on each of two Ubuntu machines), and they are both running BOINC 7.14.2.
ID: 90965 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
LarryMajor

Send message
Joined: 1 Apr 16
Posts: 22
Credit: 31,261,057
RAC: 0
Message 90966 - Posted: 2 Aug 2019, 20:47:19 UTC - in response to Message 90965.  

Yeah, I got a bunch of these on different machines, and they all fail when they are resent to someone else.
It's the work units, not your computer.
ID: 90966 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trevor ct

Send message
Joined: 7 Oct 14
Posts: 2
Credit: 14,056,393
RAC: 18,916
Message 90968 - Posted: 3 Aug 2019, 21:30:12 UTC - in response to Message 90966.  

Thank you. Comforting I am not running a rogue program.
ID: 90968 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 90972 - Posted: 5 Aug 2019, 4:00:19 UTC

Current server status

rah_assimilator_rosetta1 (rosetta) bwsrv2 Running
rah_assimilator_rosetta2 (rosetta) bwsrv2 Not Running
rah_assimilator_mini1 (minirosetta) bwsrv2 Not Running
rah_assimilator_mini2 (minirosetta) bwsrv2 Running
rah_validator_rosetta1 (rosetta) bwsrv2 Not Running
rah_validator_rosetta2 (rosetta) bwsrv2 Running
rah_validator_mini1 (minirosetta) bwsrv2 Not Running
rah_validator_mini2 (minirosetta) bwsrv2 Running
file_deleter1 bwsrv2 Running
file_deleter2 bwsrv2 Running
db_purge bwsrv2 Not Running

Workunits waiting for validation 2057
Workunits waiting for assimilation 1031

I have some unvalidated tasks held up here - while others go straight through.

Any chance of a bump please?
ID: 90972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 90976 - Posted: 5 Aug 2019, 16:40:26 UTC - in response to Message 90972.  

The whole of Server BWSRV2 is not running right now. Hopefully that means it's being worked on

Workunits waiting for validation 62381
Workunits waiting for assimilation 11088
ID: 90976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JohnH

Send message
Joined: 25 Mar 13
Posts: 43
Credit: 1,899,280
RAC: 672
Message 90977 - Posted: 5 Aug 2019, 17:57:43 UTC

rah_assimilator_rosetta1 (rosetta) bwsrv2 Not Running
rah_assimilator_rosetta2 (rosetta) bwsrv2 Not Running
rah_assimilator_mini1 (minirosetta) bwsrv2 Not Running
rah_assimilator_mini2 (minirosetta) bwsrv2 Not Running
rah_validator_rosetta1 (rosetta) bwsrv2 Not Running
rah_validator_rosetta2 (rosetta) bwsrv2 Not Running
rah_validator_mini1 (minirosetta) bwsrv2 Not Running
rah_validator_mini2 (minirosetta) bwsrv2 Not Running
file_deleter1 bwsrv2 Not Running
file_deleter2 bwsrv2 Not Running
db_purge bwsrv2 Not Running

Anybody minding the store ???
ID: 90977 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1096
Credit: 4,563,054
RAC: 4,219
Message 90979 - Posted: 5 Aug 2019, 20:09:39 UTC - in response to Message 90977.  

Anybody minding the store ???


Maybe all the people is here: RosettaCon 2019
ID: 90979 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 90983 - Posted: 6 Aug 2019, 0:20:53 UTC - in response to Message 90979.  

Anybody minding the store ???

Maybe all the people is here: RosettaCon 2019

Oh! Could be. Meanwhile...

Workunits waiting for validation 111762
Workunits waiting for assimilation 12405
ID: 90983 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 90993 - Posted: 6 Aug 2019, 20:44:56 UTC - in response to Message 90976.  

The whole of Server BWSRV2 is not running right now. Hopefully that means it's being worked on

Workunits waiting for validation 62381
Workunits waiting for assimilation 11088

Aaaannnnddd... we're back. Sort of.

bwsvr2 running, validation all up to date, assimilation not yet but I don't actually know what that does tbh

Workunits waiting for validation 6
Workunits waiting for assimilation 61639
ID: 90993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 91308 - Posted: 28 Oct 2019, 10:25:23 UTC

BWSRV2 not running and seems to have been down for 5 or 6 hours.

Task validation is building up - currently at 49905
ID: 91308 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 91312 - Posted: 29 Oct 2019, 10:35:30 UTC - in response to Message 91308.  

BWSRV2 not running and seems to have been down for 5 or 6 hours.

Task validation is building up - currently at 49905

Noticed earlier this morning everything's back running and all my validation is caught up. Thanks for fixing.
ID: 91312 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
danny

Send message
Joined: 2 Nov 19
Posts: 1
Credit: 0
RAC: 0
Message 91328 - Posted: 2 Nov 2019, 18:36:32 UTC - in response to Message 80897.  

Excellent. Thanks.
ID: 91328 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 18
Credit: 107,428
RAC: 271
Message 91353 - Posted: 11 Nov 2019, 15:10:00 UTC

Rosetta is doing it again. Lots of wasted computer time and it's pissing me off.

1104110555 993281955 3551508 11 Nov 2019, 3:13:36 UTC 11 Nov 2019, 14:03:47 UTC Error while computing 32,815.09 29,296.58 --- Rosetta v4.07
windows_x86_64
windows_intelx86

1102262280 992814189 3551508 30 Oct 2019, 13:29:52 UTC 31 Oct 2019, 8:17:27 UTC Error while computing 31,262.13 29,037.15 --- Rosetta v4.07
windows_x86_64

1102262227 992814166 3551508 30 Oct 2019, 13:29:52 UTC 4 Nov 2019, 18:17:19 UTC Error while computing 33,356.00 29,107.82 --- Rosetta v4.07
windows_x86_64

Steven Gaber
Oldsmar, FL
ID: 91353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Guy PF Masevaux

Send message
Joined: 12 Oct 19
Posts: 2
Credit: 2,523,372
RAC: 579
Message 91469 - Posted: 19 Dec 2019, 7:55:48 UTC - in response to Message 80621.  

I must put 3 of my computers at cleaning
The perform is Under usual hability to compute
It overheat same with only 40% of processors with only 25 % power for each processors
now came the time to eliminate the dust
I am happy because the cruncher solved over 12 millions tasks this year
Happy Crunching
Happy Christmas feasts
Guy PFLIEGER
ID: 91469 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Hal Bregg

Send message
Joined: 16 Sep 18
Posts: 2
Credit: 379,764
RAC: 98
Message 91546 - Posted: 13 Jan 2020, 22:31:24 UTC

One of the WUs errored out after 5hrs of running. It should run around 1 hour as per project settings.

Name 	rb_01_09_13386_13515_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_07_11_885102_10_1

Exit status 	139 (0x0000008B) Unknown error code


<core_client_version>7.9.3</core_client_version>
<![CDATA[
<message>
process got signal 11</message>
<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.07_i686-pc-linux-gnu @rb_01_09_13386_13515_ab_t000__robetta_FLAGS -in::file::fasta t000_.fasta -psipred_ss2 t000_.spider3_ss2 -kill_hairpins t000_.nobuformat.spider3_ss2 -jumps:pairing_file t000_.fasta.bbcontacts.jumps -abinitio::use_filters false -skip_convergence_check -jumps:overlap_chainbreak -seq_sep_stages 1 1 1 -ramp_chainbreaks -sep_switch_accelerate 0.8 -jumps:random_sheets 5 4 1 -constraints::cst_file t000_.fasta.CB.cst -constraints:cst_weight 5.0 -constraints::cst_fa_file t000_.fasta.MIN.cst -constraints:cst_fa_weight 5.0 -in:file:boinc_wu_zip rb_01_09_13386_13515_ab_t000__robetta.zip -frag3 rb_01_09_13386_13515_ab_t000__robetta.200.3mers.index.gz -fragA rb_01_09_13386_13515_ab_t000__robetta.200.11mers.index.gz -fragB rb_01_09_13386_13515_ab_t000__robetta.200.7mers.index.gz -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1009779
Starting watchdog...
Watchdog active.
BOINC:: CPU time: 18038.9s, 14400s + 3600s[2020- 1-13  6:54: 3:] :: BOINC 
WARNING! cannot get file size for default.out.gz: could not open file.
Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE ::     1 starting structures  18038.9 cpu seconds
This process generated      1 decoys from       1 attempts
======================================================
06:54:03 (24943): called boinc_finish(0)

</stderr_txt>
]]>

ID: 91546 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1262
Credit: 23,899,127
RAC: 6,765
Message 91549 - Posted: 15 Jan 2020, 2:33:31 UTC

New tasks are a bit stopstart today. No tasks earlier today, then a batch of new ones came through, but empty again right now.
ID: 91549 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 25 · 26 · 27 · 28 · 29 · 30 · 31 . . . 56 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2020 University of Washington
https://www.bakerlab.org