If You Don't Know Where to Put it, Post it here.

Message boards : Number crunching : If You Don't Know Where to Put it, Post it here.

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 12 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1965
Credit: 38,174,417
RAC: 10,123
Message 94784 - Posted: 18 Apr 2020, 17:20:06 UTC - in response to Message 94768.  

My task list here says the tasks I got this morning are the x86_64 64 bit tasks. Yet, my tasks properties say they are i686 tasks. Why is that? Or is it that the tasks are intelligent and decide which app is gonna crunch them? lol ;) Or is it that the server itself does not know which app is gonna crunch the tasks so it shows the better of the 2? lol ;)

Something else I don't quite understand is why, if I'm running a 64 bit processor and a 64 bit OS (Linux Mint), do I run a 32 bit app for these tasks?

I may be out of date, but my understanding is that <all> tasks are 32-bit but some/many have a 64-bit wrapper.
I could easily be wrong - this isn't my strong point at all and lots of things have changed recently.
ID: 94784 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 94788 - Posted: 18 Apr 2020, 17:52:18 UTC

I noticed the other day that some completed tasks showed they were x86_64 in the WU header, but then in the WU output, you could see that the command used ran a i686.exe. So yes, it seems the WU itself doesn't care which app is used.

Do let us know if project reset gets predictable control of the app. I've been concerned about how this cc_config alt_platform stuff is going for Linux folks.
Rosetta Moderator: Mod.Sense
ID: 94788 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 94794 - Posted: 18 Apr 2020, 18:59:58 UTC - in response to Message 94788.  

I noticed the other day that some completed tasks showed they were x86_64 in the WU header, but then in the WU output, you could see that the command used ran a i686.exe. So yes, it seems the WU itself doesn't care which app is used.

Do let us know if project reset gets predictable control of the app. I've been concerned about how this cc_config alt_platform stuff is going for Linux folks.


DK tells me they found a server-side change that they've implemented that should help machines get the app. desired. Please continue watching and reporting.
Rosetta Moderator: Mod.Sense
ID: 94794 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 94872 - Posted: 19 Apr 2020, 13:54:36 UTC

Greetings,

Well one thing is fer sure. I have eliminated the tasks running in high priority doing what I am doing.

Now I'm waiting for my last 8 to finish so I can reset the project and hopefully rid myself from the use of the i686 app. :)

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 94872 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 94889 - Posted: 19 Apr 2020, 15:32:46 UTC

Greetings;

Is this [marked **] a cause for concern?
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.15_i686-pc-linux-gnu -run:protocol jd2_scripting -parser:protocol jhr_boinc_v3_cart.xml @flags -in:file:silent Junior_HalfRoid_design4_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_6lv4ll1f.silent -in:file:silent_struct_type binary -silent_gz -mute all -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip Junior_HalfRoid_design4_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_6lv4ll1f.zip @Junior_HalfRoid_design4_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_6lv4ll1f.flags -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 1000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1834344
**Starting watchdog...
**Watchdog active.
**Starting watchdog...
**Watchdog active.
**Starting watchdog...
**Watchdog active.
BOINC:: CPU time: 29394.3s, 14400s + 14400s[2020- 4-18 20:59:37:] :: BOINC 
**WARNING! cannot get file size for default.out.gz: could not open file.
**Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE ::     1 starting structures  29394.3 cpu seconds
This process generated      1 decoys from       1 attempts
======================================================
20:59:37 (2109): called boinc_finish(0)

</stderr_txt>
]]>

This is one of the 20 credit tasks. The non-20 credit tasks do not show as the [**] marked stuff above.

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 94889 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 94899 - Posted: 19 Apr 2020, 16:00:01 UTC - in response to Message 94889.  
Last modified: 19 Apr 2020, 16:00:48 UTC

Normal tasks should show the starting watchdog, watchdog active. And there is usually a "watchdog shutting down". Which I have always found confusing, because it just means that at the end of the work unit, that the watchdog is ending, it does not mean the watchdog is ending the computation of the WU.

However...
BOINC:: CPU time: 29394.3s, 14400s + 14400s

This indicates that the watchdog did step in and end this thread. It may be an example of why they are extending the watchdog to 10hrs rather than the 4 hours (14400 seconds) you saw here.
Rosetta Moderator: Mod.Sense
ID: 94899 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 94981 - Posted: 20 Apr 2020, 10:21:45 UTC

Greetings,

It worked! It really, really worked!!! :)

I emptied my cache and reset the project. I got new tasks and set NNT. The ones running are using the x86_64 app. :) I got back from running World of Warcraft and all tasks continued were they left off. I also increased the CPU cores used to 9 and saved the others for SETI. :)

Thanks to all for the help. :)

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 94981 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JAMES

Send message
Joined: 5 May 07
Posts: 8
Credit: 275,386
RAC: 0
Message 95003 - Posted: 20 Apr 2020, 18:37:12 UTC

Is the project out of COVID19 work? About 3 weeks ago I rejoined this project after several years away. I presently have 12 cores on 3 machines running Rosetta and up till now I have been getting about 40% COVID19 WU’s. In the last 12 hours all new work has been non-COVID WU’s. Are they all gone? Will there be more?
ID: 95003 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CIA

Send message
Joined: 3 May 07
Posts: 100
Credit: 21,059,812
RAC: 0
Message 95013 - Posted: 20 Apr 2020, 20:33:21 UTC - in response to Message 95003.  

Is the project out of COVID19 work? About 3 weeks ago I rejoined this project after several years away. I presently have 12 cores on 3 machines running Rosetta and up till now I have been getting about 40% COVID19 WU’s. In the last 12 hours all new work has been non-COVID WU’s. Are they all gone? Will there be more?


COVID work comes and goes as researchers submit jobs, they get crunched, and then they look at the results to determine what the next best course of action is. There will most likely be more at some point once they determine what they next need to study.

Should be noted that all research here (even when not specific to COVID) involves principles that can possibly apply to COVID research, so just because you don't see "Covid" in the job name, doesn't mean you aren't doing immensely helpful work.
ID: 95013 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1965
Credit: 38,174,417
RAC: 10,123
Message 95017 - Posted: 20 Apr 2020, 21:53:49 UTC - in response to Message 95013.  

Is the project out of COVID19 work? About 3 weeks ago I rejoined this project after several years away. I presently have 12 cores on 3 machines running Rosetta and up till now I have been getting about 40% COVID19 WU’s. In the last 12 hours all new work has been non-COVID WU’s. Are they all gone? Will there be more?

COVID work comes and goes as researchers submit jobs, they get crunched, and then they look at the results to determine what the next best course of action is. There will most likely be more at some point once they determine what they next need to study.

Should be noted that all research here (even when not specific to COVID) involves principles that can possibly apply to COVID research, so just because you don't see "Covid" in the job name, doesn't mean you aren't doing immensely helpful work.

Agree with this very much. I consider the non-COVID-specific tasks even more important as they'll benefit COVID tasks and whatever the next one is down the line.
I didn't even notice tasks with COVID in the name had reduced (they have). Plus, just because they don't have COVID in the name, that doesn't make me think they're non-COVID. It's just a name.
ID: 95017 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 95342 - Posted: 25 Apr 2020, 10:29:56 UTC

Greetings,

Since I started I had not seen a "Validation pending"... until this morning! Woohoo! :) I have 1.

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 95342 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CIA

Send message
Joined: 3 May 07
Posts: 100
Credit: 21,059,812
RAC: 0
Message 95353 - Posted: 25 Apr 2020, 15:48:23 UTC
Last modified: 25 Apr 2020, 15:50:14 UTC

Quick question about changing work time. I have a few 24/7 machines running. Will changing my time setting affect RAM use? For example currently my 8 core machines seem to crunch fine with 16 or 32GB of ram using the default 8hour time limit. I never get "Waiting for memory" notices on them. If I swap them to 24hr tasks will they use 3 times the RAM since the tasks now will run ~3 times longer? Or will RAM use remain about the same even though time to complete has tripled?

/edit I would drop my work cache down to 0 for these machines, so basically 1 in when 1 goes out, but they would hit the server less since each task would take a day to complete.
ID: 95353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 95359 - Posted: 25 Apr 2020, 19:03:58 UTC - in response to Message 95353.  

ID: 95359 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 95407 - Posted: 26 Apr 2020, 20:06:19 UTC

Greetings,

Here's a good question for y'all:

I have 8 tasks on my Error list. 6 of those tasks have credit attached them. How is that possible?

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 95407 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 4805
Credit: 0
RAC: 0
Message 95410 - Posted: 26 Apr 2020, 21:57:37 UTC - in response to Message 95407.  

We are awarding credit now even for jobs that were canceled, with errors, and invalid.
ID: 95410 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1965
Credit: 38,174,417
RAC: 10,123
Message 95414 - Posted: 27 Apr 2020, 7:14:32 UTC - in response to Message 95410.  

We are awarding credit now even for jobs that were canceled, with errors, and invalid.

Awarding credit for Validation errors makes sense as it's not the fault of users if jobs completed but something went wrong at validation.

And for those jobs with Computation errors it may be, but just as often isn't, an error on the host system, so again it seems right credit is awarded for the CPU time completed.

What I didn't expect, but actually makes sense now I think about it, is what looks like a default 10 credits being awarded for jobs cancelled by server.
Meaning they've been downloaded in good faith but, for some reason at the project end, need to be withdrawn before running. That shows good will at least.

If you carry on like this, there'll be nothing for us to moan about. Fair play.

Wondering what to do with myself now...
ID: 95414 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 95416 - Posted: 27 Apr 2020, 9:49:43 UTC - in response to Message 95410.  

We are awarding credit now even for jobs that were canceled, with errors, and invalid.

Hi Admin,

Thanks! That answers my question. :)

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 95416 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 95468 - Posted: 28 Apr 2020, 9:40:05 UTC

Greetings,

Holy moley! What is up with this task? That is a BOAT load of errors!

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 95468 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 95479 - Posted: 28 Apr 2020, 14:37:32 UTC

Greetings,

Uh oh! I believe I just discovered a problem:

Ok, I cannot figure out how to copy/paste from the tasks tab in BOINC. CTRL-C does not work and there is no pop up that shows when right clicking. So I will have to do my best to explain what I found.

I have 9 tasks running. Tasks 2 through 9 are showing the progress bar with various %s and I can see the numbers increase as they run. The 1st, however, seems to be stuck in a causality loop. It's been running for 2 hrs and 48 mins yet the progress bar is showing 0.455%. Task #2 has been running 10 minutes less than #1 and is at 53%. Do I need to abort task #1 or should I just let it run to see what happens?

Here we go, thought of another way to show:

The estimated remaining time is not decreasing either.

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 95479 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Siran d'Vel'nahr
Avatar

Send message
Joined: 15 Nov 06
Posts: 72
Credit: 2,674,678
RAC: 0
Message 95485 - Posted: 28 Apr 2020, 15:50:42 UTC - in response to Message 95479.  

Greetings,

Well, I just discovered something new about that task that is not showing any progress:

It is showing a minute increase in progress. It's gone from 0.455% to 0.457%. And that remaining estimate has taken off. It is going UP. It is currently at 38 days 3 hours and counting.

This is weird! It should be running in PANIC mode since the deadline is this Friday May 1st. Should I abort or hope it settles down?

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO
USS Vre'kasht NCC-33187

"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 95485 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 12 · Next

Message boards : Number crunching : If You Don't Know Where to Put it, Post it here.



©2024 University of Washington
https://www.bakerlab.org