Posts by Luigi R.

21) Message boards : Number crunching : Computation Error (Message 80299)
Posted 27 Jun 2016 by Luigi R.
Post:
Sorry, I don't know. It's weird some successes occurred too.
22) Message boards : Number crunching : Rosetta Mini for Android is not available for your type of computer. (Message 80297)
Posted 27 Jun 2016 by Luigi R.
Post:
I can meet the deadlines.

That's good, but a bit surprising if you hold a 7-day buffer but half the tasks have a deadline of 2 or 5 days rather than the more usual 14 days


Deadline of 2 is rare.
5 days is enough to complete 240 4-hour tasks.
If I set 7 days of work queue, it's not my mistake whether BOINC software downloads work that's expiring in 5 days. Indeed very few tasks were going to expire in 5 days.
As a last resort if I want to meet deadlines and I can't do it, I could switch to 1-hour configuration and update preferences. Then all the starting tasks will get cpu_run_time_pref=3600. In my case the required time to complete tasks would be divided by 4.

Finally I don't think my '24/7 effort' is comparable to server work to generate tasks and according to me a deadline of 5 days doesn't mean "hurry up!". Some administrators of other projects say "feel free to abort tasks if you need" too.
I don't know if a participation by this behaviour could damage Rosetta@home project, but I think not.
23) Message boards : Number crunching : Stuck on Uploading (Message 80290)
Posted 25 Jun 2016 by Luigi R.
Post:
I am running BOINC on Linux machines headless. After updating /etc/hosts with 128.95.160.145 srv1.bakerlab.org they still weren't able to upload recent work. I control BOINC through boinctui but couldn't find a way to force retrying of upload. The solution I found was to restart BOINC after the hosts file edit, then the uploads retried, and successfully uploaded. On a recent Debian system or derivative (Ubuntu and more), BOINC is restarted from the terminal by running sudo systemctl restart boinc.

Yes, on Xubuntu you didn't need restarting BOINC to get hosts modifications effective and I'm not running BOINC as a service.

You could have tried
$boinccmd --get_file_transfers

Then
$boinccmd --file_transfer http://boinc.bakerlab.org/rosetta $filename retry

where $boinccmd is your boinccmd command including path and $filename is the result you want try to reupload.

Or simply
$boinccmd --set_network_mode never
$boinccmd --set_network_mode auto

and retry should be automatic.

See boinccmd.


Without making any changes to any hosts files and simply waiting (While still crunching at least), all my completed tasks finally uploaded and I got credit for them. Looks like they fixed it.

That solution was recommended for people having near-deadline tasks. Some robetta (rb_*) tasks were expiring on 24/06.
24) Message boards : Number crunching : Computation Error (Message 80289)
Posted 25 Jun 2016 by Luigi R.
Post:
It looks like there is no db_set3_7res_android_d_c.20.10_0001_SAVE_ALL_OUT_344080_2094_1_0 file. Maybe no read permissions for some reason?
25) Message boards : Number crunching : Stuck on Uploading (Message 80275)
Posted 24 Jun 2016 by Luigi R.
Post:
99% of the users will forget that they have made this change an will later have another problems only caused by this solution!

Maybe not. These servers will be both online, so there could be no difference in the future. For forgetful users it could be fine to edit hosts file to upload only and to comment that line right after upload process.

I'm agree we should not do any special configuration. We should run BOINC only.

Not our problem, but we want our work to get validated too.
26) Message boards : Number crunching : Stuck on Uploading (Message 80273)
Posted 24 Jun 2016 by Luigi R.
Post:
..., we're all on the same team here man :)

No! That is wrong, they need us, we don't.
They get money for the job. I spend money, for hardware and for power. I spend my spare time for maintenance of my systems to keep them always crunching.
Later, in an hopefully no so far future, if the first results come to the market, pay i again to get the medicine or whatever.

I share this point, but - you know - BOINC is plenty of medicine projects. If Rosetta@home is not always reliable or doesn't meet your demands, you can choose another project to place side by side or to replace it.

For example I don't run Rosetta@home tasks very much cause of inefficiency, but it is off-topic. Anyway 3.73 app sounds more cpu-intensive to me.
27) Message boards : Number crunching : Stuck on Uploading (Message 80272)
Posted 24 Jun 2016 by Luigi R.
Post:
Change of hosts.txt is a stupid idea! There should be another way.

No, it's not stupid. It worked fine for me. I uploaded and got validated 14 4-hour tasks, that's 56 hours of computing.

On Linux is /etc/hosts.

Replace of "srv1.bakerlab.org" in client_state.xml is useless. Whenever i restart the boinc manager is it again there. I can't find from where boinc restore this.
i have it also replace in client_state_prev.xml.

Yep, that is what I was worried about.

Thanks your lazy guys have i now two WU's lost! 24 hours of work for nothing.

See above.
28) Message boards : Number crunching : Stuck on Uploading (Message 80267)
Posted 24 Jun 2016 by Luigi R.
Post:
Thanks Timo! I just tried it and it works for uploads.

Here is the line to add to Windows hosts file to work around the server that is not responding. If you don't know what to do with this information, it would be best to just wait for the issue to be resolved on the server side.

128.95.160.145 srv1.bakerlab.org

Thanks, it works on linux *buntu too.

Put that line in /etc/hosts.
29) Message boards : Number crunching : Stuck on Uploading (Message 80265)
Posted 24 Jun 2016 by Luigi R.
Post:
What about manually modifying client_state.xml? I'm wondering if it could be safe or have some effect something like...

***PLEASE DON'T DO THIS***
Replace
<upload_url>http://srv1.bakerlab.org/rosetta_cgi/file_upload_handler</upload_url>

with
<upload_url>http://srv4.bakerlab.org/rosetta_cgi/file_upload_handler</upload_url>
30) Message boards : Number crunching : Stuck on Uploading (Message 80232)
Posted 23 Jun 2016 by Luigi R.
Post:
Yes, I answered in your previous thread.

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6849&nowrap=true#80231
31) Message boards : Number crunching : Stuck on "Ready to Report" (Message 80231)
Posted 23 Jun 2016 by Luigi R.
Post:
Currently my host is not uploading any result.

23-Jun-2016 11:45:49 [rosetta@home] Started upload of FFD__462b8a0655b364e9452ad1a759651ef7_abinitioDocking_16_06_16_35_36_globalDocking_4_SAVE_ALL_OUT_383173_1_0_0
23-Jun-2016 11:47:49 [---] Project communication failed: attempting access to reference site
23-Jun-2016 11:47:49 [rosetta@home] Temporarily failed upload of FFD__462b8a0655b364e9452ad1a759651ef7_abinitioDocking_16_06_16_35_36_globalDocking_4_SAVE_ALL_OUT_383173_1_0_0: transient HTTP error
23-Jun-2016 11:47:49 [rosetta@home] Backing off 04:19:42 on upload of FFD__462b8a0655b364e9452ad1a759651ef7_abinitioDocking_16_06_16_35_36_globalDocking_4_SAVE_ALL_OUT_383173_1_0_0
23-Jun-2016 11:47:51 [---] Internet access OK - project servers may be temporarily down.
32) Message boards : Number crunching : Rosetta Mini for Android is not available for your type of computer. (Message 80228)
Posted 23 Jun 2016 by Luigi R.
Post:
I can meet the deadlines.
33) Message boards : Number crunching : Rosetta Mini for Android is not available for your type of computer. (Message 80221)
Posted 22 Jun 2016 by Luigi R.
Post:
Solution: Drop the support for smartphones and tablets and concentrate on enabling AVX2. This should have a much higher priority.

Totally agree.


Anyway a temporary "solution":
-block work request of all project except Rosetta
-increase queue days (e.g. 7)
-flood rosetta server for 2 hours (a request every 4 minutes)


Now I have got more than 400 tasks and my host will not get dried.
34) Message boards : Number crunching : Rosetta Mini for Android is not available for your type of computer. (Message 80218)
Posted 22 Jun 2016 by Luigi R.
Post:
Same problem here.

22-Jun-2016 14:51:23 [rosetta@home] Scheduler request completed: got 0 new tasks
22-Jun-2016 14:51:23 [rosetta@home] No work sent
22-Jun-2016 14:51:23 [rosetta@home] Rosetta Mini for Android is not available for your type of computer.
22-Jun-2016 14:51:23 [rosetta@home] (won't finish in time) BOINC runs 81.4% of time, computation enabled 72.7% of that


That's not true! Indeed, after manual update some minutes later...

22-Jun-2016 15:35:14 [rosetta@home] Scheduler request completed: got 25 new tasks


I need to use a small task queue.
Unfortunately this forces me to switch to another project.
35) Message boards : Number crunching : Weird deadline (Message 79212)
Posted 13 Dec 2015 by Luigi R.
Post:
Maybe they need the results ASAP, so they force "high priority" on client side. Also got one of them, not a real issue, unless someone just planned to turn his machine off for a couple of days.

EDIT: or this might be their way to fix something else.

Well, they sent it to my worst host, a computer that is slow and not often turned on. I would try to end within the deadline.
36) Message boards : Number crunching : Weird deadline (Message 79205)
Posted 13 Dec 2015 by Luigi R.
Post:
I've got a two-days-deadline WU. I'm just curious, how come?

Created: 13 Dec 2015 14:01:30 UTC
Sent: 13 Dec 2015 14:01:49 UTC
Report deadline: 15 Dec 2015 14:01:49 UTC

Link
37) Message boards : Number crunching : validate errors (Message 79191)
Posted 12 Dec 2015 by Luigi R.
Post:
Both of those WUs came from jobs that got cancelled by the researcher (see here and here - note the 'errors' field shows the job was cancelled) after your received the WU in your queue. Generally if you wait a couple of days the claimed credit will be granted on a future sweep of the validator program.

Ok! I thought granted credit was immediately equal to claimed credit when a WU failed and the only credit counter was updated after a couple of days. Glad to know I was wrong.

Situations like this are why David has been looking at tightening up the required 'average TAT' for some jobs. It doesn't appear to me that the BOINC client/server supports any kind of 'remote kill switch' for a given WU once it's been downloaded, so there's no way to prevent said WUs from needlessly running after queued if the job they belong to gets cancelled

I've seen many times workunits to get "cancelled by server" when are no longer needed or for other reasons.

Personally, to improve my average turn around time for my boxes and thus increase the chance that the crunching I'm doing is actually going to contribute to someone's query results, I've actually tightened my queue settings (I basically don't queue any tasks) and my target runtime is a more modest 8 hours.

I usually run default WUs. I was doing some tests and my queue's settings were "min/max reserve of work" = "0.5/0.6 days".
38) Message boards : Number crunching : validate errors (Message 79189)
Posted 11 Dec 2015 by Luigi R.
Post:
App version: Rosetta Mini 3.67

I'm still getting some validate errors from two different i7.

These 24 hours of computing have 0 granted credit... why???
https://boinc.bakerlab.org/rosetta/result.php?resultid=777361677
https://boinc.bakerlab.org/rosetta/result.php?resultid=777391503
39) Message boards : Number crunching : Minirosetta 3.62-3.65 (Message 78804)
Posted 16 Sep 2015 by Luigi R.
Post:
Got many validate errors. :(

[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]


Previous 20



©2024 University of Washington
https://www.bakerlab.org