Rosetta 4.1+

Message boards : Number crunching : Rosetta 4.1+

To post messages, you must log in.

AuthorMessage
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 4510
Credit: 0
RAC: 0
Message 88767 - Posted: 27 Apr 2018, 23:24:44 UTC

Please post issues regarding Rosetta and Rosetta android 4.1+ application versions here.
ID: 88767 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Emerald42

Send message
Joined: 9 Jun 08
Posts: 9
Credit: 1,255,484
RAC: 65
Message 88770 - Posted: 28 Apr 2018, 10:32:53 UTC
Last modified: 28 Apr 2018, 10:54:54 UTC

1st issue

On one device there's download the files, but won't crunsh anything. Says "download error".

Android 7.1.x 64-bit Amlogic

On Android 6.0 Marshmellow NO ISSUES, it crunshes well.
ID: 88770 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1008
Credit: 3,715,941
RAC: 1,187
Message 88772 - Posted: 28 Apr 2018, 17:37:18 UTC - in response to Message 88770.  

Hmmm, any other information? Let us know if it persists.
ID: 88772 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
supdood

Send message
Joined: 3 Aug 15
Posts: 6
Credit: 189,013
RAC: 0
Message 88910 - Posted: 15 May 2018, 16:37:54 UTC

I had a whole batch of WUs error with 30 seconds runtime and 10 seconds CPU time. The device does not normally have any issues (has over 100k credits).

Rosetta for Android v4.10 running on Android 6.0.1

<core_client_version>7.4.53</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_android_4.10_arm-android-linux-gnu @cispro_backbone_scaffold_test_7res_PPXXXXX.flags -nstruct 10000 -cpu_run_time 14400 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3911373

ERROR: in::file::boinc_wu_zip cispro_backbone_scaffold_7res_PPXXXXX.zip does not exist!
ERROR:: Exit from: src/apps/public/boinc/minirosetta.cc line: 180
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish(1)

</stderr_txt>
]]>
ID: 88910 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 38
Credit: 424,768
RAC: 189
Message 88912 - Posted: 16 May 2018, 7:08:04 UTC
Last modified: 16 May 2018, 7:19:58 UTC

Application version Rosetta for Android v4.10
Device 3396190, Task 998505726, and WU 899502854.
Status: Error while computing.
Errors: Too many errors (may have bug). Too many total results.
Exit status: 1 (0x00000001) Unknown error code

CPU type ARM AArch64 Processor rev 3 (aarch64)
Operating System Android 3.18.19-g407ac4f82ff (Android 5.1.1)
Device: Amazon Fire tablet
<message> process exited with code 1 (0x1, -255) </message>
WARNING: linker: ../../projects/boinc.bakerlab.org_rosetta/rosetta_android_4.10_arm-android-linux-gnu: unused DT entry: type 0x6ffffffe arg 0x26cc
WARNING: linker: ../../projects/boinc.bakerlab.org_rosetta/rosetta_android_4.10_arm-android-linux-gnu: unused DT entry: type 0x6fffffff arg 0x2

ERROR: in::file::boinc_wu_zip cispro_backbone_scaffold_7res_PPXXXXX.zip does not exist!
ERROR:: Exit from: src/apps/public/boinc/minirosetta.cc line: 180
BOINC:: Error reading and gzipping output datafile: default.out called boinc_finish(1)

Note that mine was 2nd host to try to process this WU, with basically the same error other than the WARNINGs above.

Also had same error with 2 other WUs:
899502576
899503341
ID: 88912 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Usuario1_S

Send message
Joined: 24 Mar 14
Posts: 88
Credit: 3,059,705
RAC: 500
Message 89161 - Posted: 27 Jun 2018, 19:36:49 UTC

I think it's not as light as it should be, I set BOINC to run 100% of CPUs= 1=8 Cores, and 100% of the time, the processes run on Low Priority on Win 8.1 64-bit, btu they hog the computer sometimes, not as responsive as it used to be, so please fix this, same thing on the Android client, if you make it really transparent as it used to be a few months/years ago would be better for everyone, because sometimes I need to pause the computation and I forget about it, and BOINC resumes after 1 hour later, so I'd like to avoid to pause BOINC
ID: 89161 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
nostremitus

Send message
Joined: 7 May 18
Posts: 2
Credit: 122,055
RAC: 1
Message 89162 - Posted: 28 Jun 2018, 8:34:30 UTC
Last modified: 28 Jun 2018, 8:37:44 UTC

Double post
ID: 89162 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
nostremitus

Send message
Joined: 7 May 18
Posts: 2
Credit: 122,055
RAC: 1
Message 89163 - Posted: 28 Jun 2018, 8:35:35 UTC

No errors, but I've stopped receiving tasks.

I finished out a queue overnight and I'm not getting any new ones from the server. Running on Android.
ID: 89163 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
djoser

Send message
Joined: 7 Apr 18
Posts: 4
Credit: 50,340
RAC: 0
Message 89169 - Posted: 28 Jun 2018, 15:21:26 UTC - in response to Message 89163.  

There are currently no tasks available for Android.
See server status page...
Why mine when you can research? - GRIDCOIN - Real cryptocurrency without wasting hashes! www.gridcoin.us
ID: 89169 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Peter

Send message
Joined: 24 Mar 18
Posts: 1
Credit: 964,825
RAC: 577
Message 89664 - Posted: 29 Sep 2018, 23:54:43 UTC

What has happened to Android WU ?
Nothing for over a week now.
No WU unsent, very few in progress.
ID: 89664 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Daedalus

Send message
Joined: 1 Aug 08
Posts: 14
Credit: 4,557,810
RAC: 5,202
Message 89667 - Posted: 30 Sep 2018, 14:40:19 UTC

Look here: http://boinc.bakerlab.org/rosetta/server_status.php

There are no work units available for android it seems. 204 units running with 13 users in total. While there are around 18 000 users running PC and Mac
ID: 89667 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rjs5

Send message
Joined: 22 Nov 10
Posts: 226
Credit: 5,690,987
RAC: 3,502
Message 89668 - Posted: 30 Sep 2018, 16:01:53 UTC - in response to Message 89667.  

Look here: http://boinc.bakerlab.org/rosetta/server_status.php

There are no work units available for android it seems. 204 units running with 13 users in total. While there are around 18 000 users running PC and Mac


Right now, there are
0 Rosetta Android 4.0 jobs in queue for 204 users and
3 Rosetta 4.0 jobs in the queue for 8192 PC/MAC users (I am not sure what the average number of computers per "user" is).
14,985 Rosetta 3.78 jobs in the queue.

Android cannot run the 3.78 jobs. My guess is that the researchers can select either 3.78 or 4.0 applications and many select the 3.78 because they trust their jobs will complete. There has been a regular failure of the 4.0 jobs on the newer Linux distributions.

Once David gets the new Rosetta with the putenv() fix qualified, I suspect that there will be less researcher focus on running Rosetta 3.78 jobs. I don't know why the server needed to distinguish between any of the 4.0 application computers and should be able to just deploy to the next requesting machine.

The Rosetta job dispatcher, however, does not seem to be one of the more robust or automated implementations.
ID: 89668 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 937
Credit: 17,091,759
RAC: 15,580
Message 89689 - Posted: 4 Oct 2018, 1:10:23 UTC

The server Status page finally shows Android tasks - 4655 right now

I currently can't get any because my buffer is full of WCG tasks, but they should start coming through very soon, hopefully
ID: 89689 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rjs5

Send message
Joined: 22 Nov 10
Posts: 226
Credit: 5,690,987
RAC: 3,502
Message 89692 - Posted: 4 Oct 2018, 15:42:39 UTC - in response to Message 89689.  

The server Status page finally shows Android tasks - 4655 right now

I currently can't get any because my buffer is full of WCG tasks, but they should start coming through very soon, hopefully



When I want to fix that, I just SUSPEND the project (WCG in this case) for a few seconds and disrupt the scheduler. It downloads the new WU and then rotates between them. BUT, .... I have my buffer set at 0.1 day's work AND several backup projects at 0 resource.
ID: 89692 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 937
Credit: 17,091,759
RAC: 15,580
Message 89694 - Posted: 4 Oct 2018, 23:18:55 UTC - in response to Message 89692.  

The Server Status page finally shows Android tasks - 4655 right now

I currently can't get any because my buffer is full of WCG tasks, but they should start coming through very soon, hopefully

When I want to fix that, I just SUSPEND the project (WCG in this case) for a few seconds and disrupt the scheduler. It downloads the new WU and then rotates between them. BUT... I have my buffer set at 0.1 day's work AND several backup projects at 0 resource.

I couldn't be bothered at the time, but I've set WCG to No New Tasks now.

And, of course, all Android tasks have gone to others and no more have come through, so this situation could linger further. Everyone stand down...

Ah well, I'll take another glance in the morning.
ID: 89694 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 937
Credit: 17,091,759
RAC: 15,580
Message 89695 - Posted: 4 Oct 2018, 23:25:32 UTC - in response to Message 89694.  

Tell a lie, 4 came down and I already completed them during the day, only to find none further to grab. Ok, that's not quite so bad. At least I got my share while they were available.
ID: 89695 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 937
Credit: 17,091,759
RAC: 15,580
Message 89707 - Posted: 9 Oct 2018, 0:30:22 UTC - in response to Message 89695.  

Finally took a look at my android tasks - still none, but a quick update brought 4 down and Server Status shows ~10k to grab.

Could we be back on stream?
ID: 89707 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Daedalus

Send message
Joined: 1 Aug 08
Posts: 14
Credit: 4,557,810
RAC: 5,202
Message 89717 - Posted: 12 Oct 2018, 11:26:54 UTC

Back to regular flow. :)
ID: 89717 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Rosetta 4.1+



©2018 University of Washington
http://www.bakerlab.org