Posts by QueueNut

1) Message boards : Number crunching : Temporarily failed upload of (WU name):HTTP error (Message 66826)
Posted 9 Jul 2010 by QueueNut
Post:
I did not deinstall/reinstall the BOINC Manager (BTW does that wipe out all completed and "upload pending" WUs?)


That's a good question. I didn't dare trying to udpade the BOINC-Manager as long as there are tasks.
I would like to know as well, if this is an option somce this would save me the time it takes to let Rosetta run out of work before updating the BOINC-Manager.

Back to your problem: Did you change anything in your router or add any hosts-entries to the Windows hosts file? Any other changes made to the system?

If this happens again, please try to ping and tracert boinc.bakerlab.org
and srv4.bakerlab.org. This should at least indicate wether this is a local problem or a problem with your provider (sometimes DNS-caching gets messed up).

cu

Joe


Hi Joe -

No changes to hosts file, no changes to router, for >1 year.
Installed Western Digital (WD) disk diagnostics and Acronis True Image WD Edition. Without other evidence, must conclude this software is irrelevant to BOINC and Rosetta.

Both bakerlab.org FQDNs you mention ping successfully, as expected.

Update on behavior last night: After aborting transfer (upload) of a handful of completed WUs, BOINC uploaded many but not all completed WUs. It stopped before the last ~50+ were uploaded.

Update late this afternoon: Completed WUs are piling up again, 100+ now. Retry of transfers does not respond successfully. Messages once again state:

"Started upload of (WU name)"
"Project communication failed: attempting access to reference site"
"Temporarily failed uploaded of (WU name):HTTP error"
"Backing off (time delta) on upload of (WU name)"
"Internet access OK - project servers may be temporarily down."

Pinged those two bakerlab.org FQDNs again, successfully.

Aborted a transfer of a completed WU. Retry Transfer. Failed again, same error messages as above. Tried this same sequence one at a time, 8 WUs, no change.

Also observed no new WUs are being downloaded. Only 5 WUs remain to be processed - yet preferences are set for a queue of 4 days worth (Core i7 920 processor).

Commanded an Update at Project tab. This reported 3 completed WUs in the Messages log. Remaining are 100 completed WUs to upload, in Retry status.


QueueNut
2) Message boards : Number crunching : Temporarily failed upload of (WU name):HTTP error (Message 66811)
Posted 8 Jul 2010 by QueueNut
Post:
Under Transfers are those 200+ completed WUs - in perpetual Retry in nn:nn:nn with a project backoff time.

I aborted transfer of first one and then a second WU. BOINC reattempts the upload, fails and injects a gradually increasing project backup time delta for each retry and failure.

"Retry..Upload Pending..Uploading.." then backoff to retry again later (which gradually increases).

Finally after aborted 6-8 of these 200+ pending WU uploads, BOINC began to successfully upload one at a time.

Once again, this problem behavior did not change after system reboot.

I did not deinstall/reinstall the BOINC Manager (BTW does that wipe out all completed and "upload pending" WUs?)


QueueNut
3) Message boards : Number crunching : Temporarily failed upload of (WU name):HTTP error (Message 66810)
Posted 8 Jul 2010 by QueueNut
Post:
Which OS? Try disabling the firewall temporarily, if any is running. Just to make sure, it's not the firewall causing the problems.


The operating system is Windows XP SP3.

After disabling the Windows firewall, commanded an update in BOINC. Messages state:

"update requested by user"
"Sending scheduler request: Requested by user."
"Not reporting or requesting tasks"
"Scheduler request completed"

No change, no completed WUs uploaded (now >200 of them).

The firewall would not have been expected to be causing a problem. Rosetta has been working fine for a long time.

QueueNut


Might sound archaic, but have you tried rebooting the PC?




See the first post in the thread,
"This continues following system reboots."

QueueNut
4) Message boards : Number crunching : Temporarily failed upload of (WU name):HTTP error (Message 66806)
Posted 8 Jul 2010 by QueueNut
Post:
Which OS? Try disabling the firewall temporarily, if any is running. Just to make sure, it's not the firewall causing the problems.


The operating system is Windows XP SP3.

After disabling the Windows firewall, commanded an update in BOINC. Messages state:

"update requested by user"
"Sending scheduler request: Requested by user."
"Not reporting or requesting tasks"
"Scheduler request completed"

No change, no completed WUs uploaded (now >200 of them).

The firewall would not have been expected to be causing a problem. Rosetta has been working fine for a long time.

QueueNut
5) Message boards : Number crunching : Temporarily failed upload of (WU name):HTTP error (Message 66802)
Posted 7 Jul 2010 by QueueNut
Post:
BOINC Manager shows 183 completed Rosetta work units in a status of Uploading yet they do not upload.

Rosetta is the only project this system executes.

The Transfers tab shows these 183 completed WUs are in Upload Pending status.

Messages tab show the following log events individually for each of the 183 WUs. This started July 1 and has continued ever since through today, July 7. The oldest completed WUs start to expire early tomorrow morning.

t0 "Started download of (WU name)"
t0+22 sec "Project communication failed: attempting
access to reference site"
t0+22 sec "Temporarily failed upload of (WU name):
HTTP error"
t0+22 sec "Backing off (delta time) on upload of (WU
name)
t0+24 sec "Internet access OK - porject servers may
be temporarily down."

This continues following system reboots.

All other internet access is AOK, no problems.

How might this problem be further investigated, and remedied? Can this be solved without deleting all or most of the completed WUs?

Thanks much


QueueNut
6) Message boards : Number crunching : Scheduling request completed: got 0 new tasks (Message 64646)
Posted 30 Dec 2009 by QueueNut
Post:
Same situation here. No work units downloaded. Server status on Rosetta page shows multiple servers "not running". I've experienced problems like this before on holidays when nobody is monitoring the servers.



The heck with this project.

I'm going skiing.

7) Message boards : Number crunching : Scheduling request completed: got 0 new tasks (Message 64641)
Posted 30 Dec 2009 by QueueNut
Post:
BOINC Manager 6.10.18

Rebooted system, Windows XP SP3 32-bit.

The Rosetta/BOINC Scheduler is not sending any tasks when requested, multiple times over the past ~30 minutes.

Just prior to this there were 8 tasks (Core i7 920) executing.

Checked preferences. No changes, large memory & disk space limit, unlimited network bandwidth. >2GB available physical memory.


Are you running other projects too on the same machine? If so Boinc could just be balancing the load out and getting more work from other projects that it thinks need them. Each project has a percentage that you assign it, over the long term Boinc will try and maintain or get to that percentage. Over the short term however Boinc is not a very good time manager.

You could also read this thread: http://boinc.bakerlab.org/rosetta/forum_thread.php?id=5199 They are talking about the same thing you are, no new work.



The system receiving 0 new tasks only runs Rosetta, no other science projects.
Another system that is receiving tasks also runs only Rosetta.

Tried stopping BOINC (no tasks available to run), deleting the BOINC cookie, restart BOINC/Rosetta. No new tasks.
8) Message boards : Number crunching : Scheduling request completed: got 0 new tasks (Message 64620)
Posted 29 Dec 2009 by QueueNut
Post:
BOINC Manager 6.10.18

Rebooted system, Windows XP SP3 32-bit.

The Rosetta/BOINC Scheduler is not sending any tasks when requested, multiple times over the past ~30 minutes.

Just prior to this there were 8 tasks (Core i7 920) executing.

Checked preferences. No changes, large memory & disk space limit, unlimited network bandwidth. >2GB available physical memory.
9) Message boards : Number crunching : Problems with web site (Message 63354)
Posted 15 Sep 2009 by QueueNut
Post:
About 24 hours ago I brought a new Core i7 system online with BOINC/Rosetta@home (6.6.36 for windows_intelx86). Message log shows a number of work units downloaded, computed and uploaded. No changes in individual user average or total credit scores.

Another Core2 system was down since middle of last week. Brought it up at the same time with 6.6.36, ~24 hours ago. It, too, has been computing work unit results. No change of score from it, either.






©2024 University of Washington
https://www.bakerlab.org