Failed download

Message boards : Number crunching : Failed download

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Sailor
Avatar

Send message
Joined: 19 Mar 07
Posts: 75
Credit: 89,192
RAC: 0
Message 39386 - Posted: 15 Apr 2007, 10:32:39 UTC

Just rise your CPU target time :)
ID: 39386 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael.L

Send message
Joined: 12 Nov 06
Posts: 67
Credit: 31,295
RAC: 0
Message 39387 - Posted: 15 Apr 2007, 11:14:02 UTC

Hope that someone gets to the Rosie office today!Have suspended internet connections until current problems are cleared cos my screen is full of 'failed connections'
ID: 39387 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
milw0rm

Send message
Joined: 10 Dec 05
Posts: 22
Credit: 6,212,194
RAC: 118
Message 39388 - Posted: 15 Apr 2007, 11:20:57 UTC

15/04/2007 12:16:42|rosetta@home|[file_xfer] Started download of file frags83_1bk2_.fasta.gz
15/04/2007 12:17:03||Project communication failed: attempting access to reference site
15/04/2007 12:17:03|rosetta@home|[file_xfer] Temporarily failed download of frags83_1bk2_.fasta.gz: system connect
15/04/2007 12:17:03|rosetta@home|Backing off 1 hr 6 min 12 sec on download of file frags83_1bk2_.fasta.gz
15/04/2007 12:17:06||Access to reference site succeeded - project servers may be temporarily down.

I am glad it is not just me :D

RESTART!
ID: 39388 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Henry Huff

Send message
Joined: 31 May 06
Posts: 6
Credit: 2,298,502
RAC: 0
Message 39393 - Posted: 15 Apr 2007, 13:59:34 UTC

Also same problem - Myrtle Beach SC
ID: 39393 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Babbit

Send message
Joined: 8 Feb 06
Posts: 1
Credit: 615,634
RAC: 0
Message 39398 - Posted: 15 Apr 2007, 15:06:10 UTC

4/15/2007 11:03:39 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:03:55 AM||Project communication failed: attempting access to reference site
4/15/2007 11:03:55 AM|rosetta@home|[file_xfer] Temporarily failed download of aa1k9kA03_05.200_v1_3.gz: system connect
4/15/2007 11:03:55 AM|rosetta@home|[file_xfer] Started download of file 5croA.psipred_ss2.gz
4/15/2007 11:03:56 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:03:59 AM||Project communication failed: attempting access to reference site
4/15/2007 11:03:59 AM|rosetta@home|[file_xfer] Temporarily failed download of 5croA.fasta: system connect
4/15/2007 11:03:59 AM|rosetta@home|[file_xfer] Started download of file 5cro.pdb.gz
4/15/2007 11:04:00 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:16 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:16 AM|rosetta@home|[file_xfer] Temporarily failed download of 5croA.psipred_ss2.gz: system connect
4/15/2007 11:04:16 AM|rosetta@home|[file_xfer] Started download of file BAR_R16H_cc5croA03_05.200_v1_3.gz
4/15/2007 11:04:17 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:20 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:20 AM|rosetta@home|[file_xfer] Temporarily failed download of 5cro.pdb.gz: system connect
4/15/2007 11:04:20 AM|rosetta@home|[file_xfer] Started download of file BAR_R16H_cc5croA09_05.200_v1_3.gz
4/15/2007 11:04:21 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:37 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:37 AM|rosetta@home|[file_xfer] Temporarily failed download of BAR_R16H_cc5croA03_05.200_v1_3.gz: system connect
4/15/2007 11:04:37 AM|rosetta@home|[file_xfer] Started download of file ccfrags200.txt
4/15/2007 11:04:38 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:42 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:42 AM|rosetta@home|[file_xfer] Temporarily failed download of BAR_R16H_cc5croA09_05.200_v1_3.gz: system connect
4/15/2007 11:04:42 AM|rosetta@home|[file_xfer] Started download of file 5croA_R16H_cheat.bar
4/15/2007 11:04:43 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:58 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:58 AM|rosetta@home|[file_xfer] Temporarily failed download of ccfrags200.txt: system connect
4/15/2007 11:04:59 AM|rosetta@home|[file_xfer] Started download of file rosetta_5.59_windows_intelx86.exe
4/15/2007 11:05:00 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:03 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:03 AM|rosetta@home|[file_xfer] Temporarily failed download of 5croA_R16H_cheat.bar: system connect
4/15/2007 11:05:04 AM|rosetta@home|[file_xfer] Started download of file 1k9k.pdb.gz
4/15/2007 11:05:05 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:21 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:21 AM|rosetta@home|[file_xfer] Temporarily failed download of rosetta_5.59_windows_intelx86.exe: system connect
4/15/2007 11:05:21 AM|rosetta@home|[file_xfer] Started download of file 1k9k.loop
4/15/2007 11:05:23 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:26 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:26 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.pdb.gz: system connect
4/15/2007 11:05:26 AM|rosetta@home|[file_xfer] Started download of file 1k9k.fasta
4/15/2007 11:05:28 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:43 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:43 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.loop: system connect
4/15/2007 11:05:43 AM|rosetta@home|[file_xfer] Started download of file aa1k9kA09_05.200_v1_3.gz
4/15/2007 11:05:45 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:48 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:48 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.fasta: system connect
4/15/2007 11:05:48 AM|rosetta@home|[file_xfer] Started download of file 1k9k.description.txt
4/15/2007 11:05:50 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:06:05 AM||Project communication failed: attempting access to reference site
4/15/2007 11:06:05 AM|rosetta@home|[file_xfer] Temporarily failed download of aa1k9kA09_05.200_v1_3.gz: system connect
4/15/2007 11:06:05 AM|rosetta@home|[file_xfer] Started download of file aa1k9kA03_05.200_v1_3.gz
4/15/2007 11:06:06 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:06:10 AM||Project communication failed: attempting access to reference site
4/15/2007 11:06:10 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.description.txt: system connect
4/15/2007 11:06:10 AM|rosetta@home|[file_xfer] Started download of file 5croA.fasta
4/15/2007 11:06:11 AM||Access to reference site succeeded - project servers may be temporarily down.


Same problem here. I'm sure they will get to it when they can.
ID: 39398 · Rating: -2 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [B@H] Ray
Avatar

Send message
Joined: 20 Sep 05
Posts: 118
Credit: 100,251
RAC: 0
Message 39399 - Posted: 15 Apr 2007, 15:15:56 UTC - in response to Message 39385.  

Not everyone has a large cache or a fall back project, so perhaps there are concerns of wasted (idle) CPU time.

I'm crunching Proteins@Home when my cache is empty.

For crunching protien projects when the que is empty of rosetta I find that Tanpaku is better than protiens@home.

protiens@home writes to you hard drive all the time rather than only at the checkpoints like Rosetta and Tanpaku. That had been mentioned at the protiens@home message boards last year and the people running it said that they wanted it that way and run it like that on all there systems. The way it ran raised the temp. of my hard drive a few degrees but I don't remember how much. I would rather not put the added work on the HD. That is only my opinion about why Tanpaku over protiens@home.
ID: 39399 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile alpha

Send message
Joined: 4 Nov 06
Posts: 27
Credit: 1,550,107
RAC: 0
Message 39400 - Posted: 15 Apr 2007, 15:35:24 UTC - in response to Message 39399.  

Not everyone has a large cache or a fall back project, so perhaps there are concerns of wasted (idle) CPU time.

I'm crunching Proteins@Home when my cache is empty.

For crunching protien projects when the que is empty of rosetta I find that Tanpaku is better than protiens@home.

protiens@home writes to you hard drive all the time rather than only at the checkpoints like Rosetta and Tanpaku. That had been mentioned at the protiens@home message boards last year and the people running it said that they wanted it that way and run it like that on all there systems. The way it ran raised the temp. of my hard drive a few degrees but I don't remember how much. I would rather not put the added work on the HD. That is only my opinion about why Tanpaku over protiens@home.


I've contributed to TANPAKU briefly in the past, but I dislike the lack of communication from the project admin(s).

Thanks for the heads-up about Proteins@Home though, I've confirmed what you said using Sysinternals FileMonitor. Perhaps I'll look into running BOINC from a RAM disk, then the problem will no longer exist.
ID: 39400 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 39402 - Posted: 15 Apr 2007, 16:00:43 UTC - in response to Message 39398.  
Last modified: 15 Apr 2007, 16:01:06 UTC

See comments below

4/15/2007 11:03:39 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:03:55 AM||Project communication failed: attempting access to reference site
4/15/2007 11:03:55 AM|rosetta@home|[file_xfer] Temporarily failed download of aa1k9kA03_05.200_v1_3.gz: system connect
4/15/2007 11:03:55 AM|rosetta@home|[file_xfer] Started download of file 5croA.psipred_ss2.gz
4/15/2007 11:03:56 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:03:59 AM||Project communication failed: attempting access to reference site
4/15/2007 11:03:59 AM|rosetta@home|[file_xfer] Temporarily failed download of 5croA.fasta: system connect
4/15/2007 11:03:59 AM|rosetta@home|[file_xfer] Started download of file 5cro.pdb.gz
4/15/2007 11:04:00 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:16 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:16 AM|rosetta@home|[file_xfer] Temporarily failed download of 5croA.psipred_ss2.gz: system connect
4/15/2007 11:04:16 AM|rosetta@home|[file_xfer] Started download of file BAR_R16H_cc5croA03_05.200_v1_3.gz
4/15/2007 11:04:17 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:20 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:20 AM|rosetta@home|[file_xfer] Temporarily failed download of 5cro.pdb.gz: system connect
4/15/2007 11:04:20 AM|rosetta@home|[file_xfer] Started download of file BAR_R16H_cc5croA09_05.200_v1_3.gz
4/15/2007 11:04:21 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:37 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:37 AM|rosetta@home|[file_xfer] Temporarily failed download of BAR_R16H_cc5croA03_05.200_v1_3.gz: system connect
4/15/2007 11:04:37 AM|rosetta@home|[file_xfer] Started download of file ccfrags200.txt
4/15/2007 11:04:38 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:42 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:42 AM|rosetta@home|[file_xfer] Temporarily failed download of BAR_R16H_cc5croA09_05.200_v1_3.gz: system connect
4/15/2007 11:04:42 AM|rosetta@home|[file_xfer] Started download of file 5croA_R16H_cheat.bar
4/15/2007 11:04:43 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:04:58 AM||Project communication failed: attempting access to reference site
4/15/2007 11:04:58 AM|rosetta@home|[file_xfer] Temporarily failed download of ccfrags200.txt: system connect
4/15/2007 11:04:59 AM|rosetta@home|[file_xfer] Started download of file rosetta_5.59_windows_intelx86.exe
4/15/2007 11:05:00 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:03 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:03 AM|rosetta@home|[file_xfer] Temporarily failed download of 5croA_R16H_cheat.bar: system connect
4/15/2007 11:05:04 AM|rosetta@home|[file_xfer] Started download of file 1k9k.pdb.gz
4/15/2007 11:05:05 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:21 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:21 AM|rosetta@home|[file_xfer] Temporarily failed download of rosetta_5.59_windows_intelx86.exe: system connect
4/15/2007 11:05:21 AM|rosetta@home|[file_xfer] Started download of file 1k9k.loop
4/15/2007 11:05:23 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:26 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:26 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.pdb.gz: system connect
4/15/2007 11:05:26 AM|rosetta@home|[file_xfer] Started download of file 1k9k.fasta
4/15/2007 11:05:28 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:43 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:43 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.loop: system connect
4/15/2007 11:05:43 AM|rosetta@home|[file_xfer] Started download of file aa1k9kA09_05.200_v1_3.gz
4/15/2007 11:05:45 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:05:48 AM||Project communication failed: attempting access to reference site
4/15/2007 11:05:48 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.fasta: system connect
4/15/2007 11:05:48 AM|rosetta@home|[file_xfer] Started download of file 1k9k.description.txt
4/15/2007 11:05:50 AM||Access to reference site succeeded - project servers may be temporarily down.
4/15/2007 11:06:05 AM||Project communication failed: attempting access to reference site
4/15/2007 11:06:05 AM|rosetta@home|[file_xfer] Temporarily failed download of aa1k9kA09_05.200_v1_3.gz: system connect
4/15/2007 11:06:05 AM|rosetta@home|[file_xfer] Started download of file aa1k9kA03_05.200_v1_3.gz
4/15/2007 11:06:06 AM||Access to reference site succeeded - project servers may be temporarily down.


4/15/2007 11:06:10 AM||Project communication failed: attempting access to reference site
4/15/2007 11:06:10 AM|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.description.txt: system connect
4/15/2007 11:06:10 AM|rosetta@home|[file_xfer] Started download of file 5croA.fasta
4/15/2007 11:06:11 AM||Access to reference site succeeded - project servers may be temporarily down.

Same problem here. I'm sure they will get to it when they can.


Yep, same joy here, connected to project but no data exchange, so suspending network for awhile.

ID: 39402 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sailor
Avatar

Send message
Joined: 19 Mar 07
Posts: 75
Credit: 89,192
RAC: 0
Message 39407 - Posted: 15 Apr 2007, 16:14:24 UTC - in response to Message 39400.  



protiens@home writes to you hard drive all the time rather than only at the checkpoints like Rosetta and Tanpaku. That had been mentioned at the protiens@home message boards last year and the people running it said that they wanted it that way and run it like that on all there systems. The way it ran raised the temp. of my hard drive a few degrees but I don't remember how much. I would rather not put the added work on the HD. That is only my opinion about why Tanpaku over protiens@home.


True, constant activite of your harddisk lowers the lifetime a lot...

And @ all the logposters: Im not sure, but I dont think anyone needs 50times 50 copies of the failed download message :) Short notice seems to be enough :)
ID: 39407 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 24
Message 39411 - Posted: 15 Apr 2007, 16:50:54 UTC

I am hoping that an announcement will be made on the home page once the download server disfunctionality is resolved. The approach I've elected to employ is to simply shut off allow new work on each workstation that encounters the download problem from Rosetta. That is draining out existing work (my other projects can pick up the slack), but it is also avoiding the constant pinging activity to Rosetta (probably good for both the project servers and my workstations).


ID: 39411 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 39413 - Posted: 15 Apr 2007, 17:07:31 UTC

seems the solution for now is suspend network activity and let the system keep crunching on existing WU's.

I download 2.5-3 days of work at a time so I can report back to the system later if need be to get more work.

I just tried to reconnect again but the servers are still down.

end of story for now...check back in a few hours when they wake up over there.
it is sunday after all and computer guys do need at least one day to do laundry.
ID: 39413 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CelluX

Send message
Joined: 12 Apr 07
Posts: 1
Credit: 1,447
RAC: 0
Message 39418 - Posted: 15 Apr 2007, 17:54:24 UTC - in response to Message 39413.  


I download 2.5-3 days of work at a time so I can report back to the system later if need be to get more work.


You download new task's, do the work and after 3 day's you download new task's again?

Where i had to adjust this?

MFG. CelluX
ID: 39418 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
John McCallum
Avatar

Send message
Joined: 8 Jan 06
Posts: 12
Credit: 6,882,960
RAC: 4,471
Message 39420 - Posted: 15 Apr 2007, 18:10:46 UTC

15/04/2007 18:37:51|rosetta@home|[file_xfer] Temporarily failed download of aa1k9kA09_05.200_v1_3.gz: system connect
15/04/2007 18:37:51|rosetta@home|Backing off 2 hr 29 min 33 sec on download of file aa1k9kA09_05.200_v1_3.gz
15/04/2007 18:37:51|rosetta@home|[file_xfer] Temporarily failed download of 1k9k.description.txt: system connect
15/04/2007 18:37:52||Access to reference site succeeded - project servers may be temporarily down.
Been going on since 1550 BST.
If you can't take a joke you should never have joined.
ID: 39420 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 39423 - Posted: 15 Apr 2007, 18:32:18 UTC - in response to Message 39418.  

click on participants then scroll down to general preferences and then scroll down to network preferences and click on the edit line and adjust your connection to network to however many days of work you want to do. This lets BOINC manager download about 7 WU's for you system to chew on, it will still connect to upload the results and report work done and so every time a WU ends, but for cases like this when no comm is available then you can just continue crunching the work you have with network activity suspended until the server comes back on and then you can resume network activity and report and upload/download more work. But in this case you have 2.5 days of work at 8hrs per work unit.


I download 2.5-3 days of work at a time so I can report back to the system later if need be to get more work.


You download new task's, do the work and after 3 day's you download new task's again?

Where i had to adjust this?

MFG. CelluX


ID: 39423 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sailor
Avatar

Send message
Joined: 19 Mar 07
Posts: 75
Credit: 89,192
RAC: 0
Message 39425 - Posted: 15 Apr 2007, 19:19:20 UTC - in response to Message 39418.  


I download 2.5-3 days of work at a time so I can report back to the system later if need be to get more work.


You download new task's, do the work and after 3 day's you download new task's again?

Where i had to adjust this?

MFG. CelluX


Actually, it keeps a buffer of 3days (when you have networking access all the time ofc). I have it set to 1 day, cuz my PC is not running 24/7, but when my PC is on, i have inet access also. So the clients is requesting constantly new workunits. Pretty usefull, so im not running "dry" in situations like these. I changed these settings after the first serverdowntime after i started with rosetta, when i was kinda dissapointed, that i cant crunch.

Maybe an info would be nice, what causes these downtimes every once in a while ?
Too much traffic ? If thats the case, we all should increase our target CPU run time. Changing the runtime from 3 to 6 hours would save the rosetta server 50% of upload traffic, this is a huge ammount of data, if many ppl would change it.
Ofc it makes only sense to heavy crunchers, ppl that are using their machine less, will risk to run of the deadline before finished crunching. But ppl with machines running 24/7 shouldnt have a problem to change their runtime per unit to 24 hours? (correct me if im wrong here)
ID: 39425 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MattDavis
Avatar

Send message
Joined: 22 Sep 05
Posts: 206
Credit: 1,377,748
RAC: 0
Message 39427 - Posted: 15 Apr 2007, 19:22:59 UTC

People.

Leave BOINC alone.

It will automatically get new work when work is available.
ID: 39427 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 39430 - Posted: 15 Apr 2007, 19:39:41 UTC - in response to Message 39427.  
Last modified: 15 Apr 2007, 19:42:28 UTC

data level is not the issue, its just one of those IT problems that happens.
the place I work at with a custom written program goes to heck every time IT does a upgrade. so issues like this are no surprise. things happen. maybe a bit to frequently, but this is a work in progress. so just take it stride. and yeah, if you want to have a bunch of work stacked up for dry spells then just do what i suggested. when the server communications is back on then reconnect and let it do its thing.

note: still no comm so i will check back for my own computer in the morning as its 941pm here now. no worries, just letting things run for now.

People.

Leave BOINC alone.

It will automatically get new work when work is available.


ID: 39430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 24
Message 39431 - Posted: 15 Apr 2007, 19:41:26 UTC - in response to Message 39427.  

Actually, I'm leaving BOINC alone, I am just shutting off new work from Rosetta and leaving all my other projects happy.

With the structure of the Rosetta work units, there are a bunch of files downloading (or trying to download) for each work unit. When the download server is sick (or communications from the download server is sick), all of those files get set up for retries (at various times). That can add a LOT to the noise level (for my workstation, for my proxy server and for the Rosetta download server). While some might view the noise and extra traffic as a salubrious event, I just don't see it that way.

Since I have three or more projects on each on my workstations, shutting down new work with Rosetta simply shifts work to other project which are not engaged in DSSSS (Download Server System Stress Syndrome).

Not sure what threshold was crossed in the past several days to suddenly make the problem as bad as it is. I sort of doubt it is an 'incremental workload' issue -- rather I suspect there is something else at play here. Perhaps something similar to issues identified and resolved by both the SETI and Einstein folks earlier this year.

That being said, it sure would be nice to get some description of the detail of the problem, as, encountering the problem, then checking an all green board on server status is just annoying. Sort of like the 'only good news' handling of events in our political process.



People.

Leave BOINC alone.

It will automatically get new work when work is available.


ID: 39431 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 39433 - Posted: 15 Apr 2007, 19:46:49 UTC - in response to Message 39427.  
Last modified: 15 Apr 2007, 19:48:21 UTC

wish this were true. if i do not manually reset other projects on the laptop where rosetta is stalled, no d/c work is being done at all. cpu cycles are being wasted for hours at a time, due to no work. this was the second time in 24 hours that i checked and was forced to reset projects...

don't know why this is. i know it is not supposed to be this way. but it is...

People.

Leave BOINC alone.

It will automatically get new work when work is available.

ID: 39433 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mhhall

Send message
Joined: 28 Mar 06
Posts: 7
Credit: 10,188,899
RAC: 1
Message 39435 - Posted: 15 Apr 2007, 20:02:53 UTC

I presume that the download server is currently hung.
My dual processor system finished two jobs, had one
remaining to be reported (which appears to have been
accepted), but all downloads to my system are showing
as downloading or "download pending".
ID: 39435 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Failed download



©2024 University of Washington
https://www.bakerlab.org