Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 28 · 29 · 30 · 31 · 32 · 33 · 34 . . . 55 · Next

AuthorMessage
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 76795 - Posted: 2 Jun 2014, 23:08:29 UTC
Last modified: 2 Jun 2014, 23:22:06 UTC

Hi SBF-GODS-STONE.

You seem to be mixing up cpu cashe with system ram, your system is showing about
3 gigabyte of ram ( Memory 2989.52 MB ).

Rosetta can use up to and over 1 gig of ram per task/wu sometimes, there is no way
that can fit in the cpu's cashe.

You said - This CPU chip has for L2 and L3 cashe storage 1 and 2 gig's of memory on the chip.
====================================================
See spec's for your cpu from CPU-World site.

AMD FX-4350

Frequency: 4200 MHz
Turbo frequency: 4300 MHz

Level 1 cache size: 2 x 64 KB shared instruction caches 4 x 16 KB data caches
Level 2 cache size: 2 x 2 MB shared exclusive caches
Level 3 cache size: 8 MB shared cache



Memory controller: The number of controllers: 1
Memory channels: 2
Supported memory: DDR3-1866
======================================================

i.m.h.o. - Sorry you need more RAM!
ID: 76795 · Rating: 0 · rate: Rate + / Rate - Report as offensive
SBF-GODS-STONE

Send message
Joined: 6 Nov 05
Posts: 15
Credit: 44,784
RAC: 0
Message 76796 - Posted: 3 Jun 2014, 3:33:59 UTC - in response to Message 76795.  

Hi SBF-GODS-STONE.

You seem to be mixing up cpu cashe with system ram, your system is showing about
3 gigabyte of ram ( Memory 2989.52 MB ).

Rosetta can use up to and over 1 gig of ram per task/wu sometimes, there is no way
that can fit in the cpu's cashe.

You said - This CPU chip has for L2 and L3 cashe storage 1 and 2 gig's of memory on the chip.
====================================================
See spec's for your cpu from CPU-World site.

AMD FX-4350

Frequency: 4200 MHz
Turbo frequency: 4300 MHz

Level 1 cache size: 2 x 64 KB shared instruction caches 4 x 16 KB data caches
Level 2 cache size: 2 x 2 MB shared exclusive caches
Level 3 cache size: 8 MB shared cache



Memory controller: The number of controllers: 1
Memory channels: 2
Supported memory: DDR3-1866
======================================================

i.m.h.o. - Sorry you need more RAM!


This machine has 8 gig's of 1866 (XP can only use 3 gig's). The version of the amd cpu is the FX 4350 Black 12mb cashe for each core (unlocked).

At any rate the issue was not how much work the cpu's could do, it was the fragments on the work(boinc) disk which is now 10 gigs and seems to have cleared that problem.

thks.
ID: 76796 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 76797 - Posted: 3 Jun 2014, 4:25:08 UTC - in response to Message 76796.  
Last modified: 3 Jun 2014, 4:26:14 UTC

Hi SBF-GODS-STONE.
You seem to be mixing up cpu cache with system ram, your system is showing about
3 gigabyte of ram ( Memory 2989.52 MB ).

Rosetta can use up to and over 1 gig of ram per task/wu sometimes, there is no way that can fit in the cpu's cache.

You said - This CPU chip has for L2 and L3 cache storage 1 and 2 gig's of memory on the chip.
====================================================
See spec's for your cpu from CPU-World site.

AMD FX-4350
Level 1 cache size: 2 x 64 KB shared instruction caches 4 x 16 KB data caches
Level 2 cache size: 2 x 2 MB shared exclusive caches
Level 3 cache size: 8 MB shared cache


Memory controller: The number of controllers: 1
Memory channels: 2
Supported memory: DDR3-1866
======================================================

i.m.h.o. - Sorry you need more RAM!

This machine has 8 gig's of 1866 (XP can only use 3 gig's). The version of the amd cpu is the FX 4350 Black 12mb cache for each core (unlocked).

At any rate the issue was not how much work the cpu's could do, it was the fragments on the work(boinc) disk which is now 10 gigs and seems to have cleared that problem.

thks.

I run a different AMD machine, but each task here is using approx 500Mb. On my 8-core machine that's about 4Gb. Fortunately I have a 64-bit OS so I can access all 8Gb of RAM. In my task manager I can see 6Gb RAM is in use in total, including Windows and other applications, so I'm fine.

On your 4-core machine you'll be using 2Gb less, but 4Gb is more than the 3Gb RAM your 32-bit OS can access, so it does make sense that a lot of data is getting thrown down to disk.

Reserving more space for Boinc does seem to be helping you from what you've said, but your problems may return unless you can find a way of only running 2 Rosetta tasks at a time and letting other projects use your other 2 cores. Ideally, you need to use a 64-bit version of Windows to get the full benefit of the RAM you already have installed.

I'm sorry I can't be of more help.
ID: 76797 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 76970 - Posted: 7 Jul 2014, 20:55:46 UTC

Download files getting truncated

07/07/2014 21:41:01 | rosetta@home | Sending scheduler request: To report completed tasks.
07/07/2014 21:41:01 | rosetta@home | Reporting 1 completed tasks
07/07/2014 21:41:01 | rosetta@home | Requesting new tasks for CPU
07/07/2014 21:41:03 | rosetta@home | Scheduler request completed: got 1 new tasks
07/07/2014 21:41:05 | rosetta@home | Started download of flags_rb_07_07_49373_95018__t000__2_C1_robetta
07/07/2014 21:41:05 | rosetta@home | Started download of input_rb_07_07_49373_95018__t000__2_C1_robetta.zip
07/07/2014 21:41:07 | rosetta@home | Incomplete read of 1754.000000 < 5KB for flags_rb_07_07_49373_95018__t000__2_C1_robetta - truncating
07/07/2014 21:41:07 | rosetta@home | Incomplete read of 1754.000000 < 5KB for input_rb_07_07_49373_95018__t000__2_C1_robetta.zip - truncating
07/07/2014 21:41:07 | rosetta@home | Finished download of flags_rb_07_07_49373_95018__t000__2_C1_robetta
07/07/2014 21:41:07 | rosetta@home | Finished download of input_rb_07_07_49373_95018__t000__2_C1_robetta.zip
07/07/2014 21:41:07 | rosetta@home | [error] File flags_rb_07_07_49373_95018__t000__2_C1_robetta has wrong size: expected 842, got 0
07/07/2014 21:41:07 | rosetta@home | [error] Checksum or signature error for flags_rb_07_07_49373_95018__t000__2_C1_robetta
07/07/2014 21:41:07 | rosetta@home | [error] File input_rb_07_07_49373_95018__t000__2_C1_robetta.zip has wrong size: expected 7511515, got 0
07/07/2014 21:41:07 | rosetta@home | [error] Checksum or signature error for input_rb_07_07_49373_95018__t000__2_C1_robetta.zip


This task then displays as "download failed" in the Boinc Tasks window

I also have one or two files that are failing to upload after the first 1.71Kb uploaded of 166.19Kb - frxtrimer_5_0979_b5r3_fold_SAVE_ALL_OUT_171800_974_0 - as I checked these numbers I saw an upload going through of over 1Mb without problem.

I also seem to have massive problems with downloading

Me

Anyone else seeing this or only me?

ID: 76970 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 76986 - Posted: 9 Jul 2014, 16:43:03 UTC

Sid, have you white-listed R@h in your AV software?
Rosetta Moderator: Mod.Sense
ID: 76986 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 76991 - Posted: 10 Jul 2014, 2:15:44 UTC - in response to Message 76986.  
Last modified: 10 Jul 2014, 2:23:09 UTC

Sid, have you white-listed R@h in your AV software?

I did since posting, but with no improvement. I can't see anything in my history indicating an issue since the start of the month either.

It also started happening on my laptop shortly after, which may well indicate it's an issue with the AV getting updated later, but I just can't see what.

That said, I'm on my travels again and have just opened my laptop and looked to see if there was any improvement. The event log included this excerpt directly on startup:

10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/ANK12_A.pdb_ANK12_B.pdb_global_docking.xml: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/ANK12_A.pdb_ANK12_B.pdb_fakenative.pdb: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/fold_and_dock_140602.5._data.zip: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/140602.5._fold_and_dock_flags: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/Exp02-020a_optimization_127_117_1_1_Input_0030_S01_Input_0010_data.zip: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/ANK12_A_ANK12_B_patchdock_split_01.pdb: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/ANK12_A_ANK12_B_patchdock_split_01.patchdock: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/syIL2v3noM_4_data.zip: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/rb_07_09_47791_93530_ab_stage0_h002___robetta_h002_.200.10mers.gz: wrong size
10/07/2014 01:00:31 | | Resetting file projects/boinc.bakerlab.org_rosetta/rb_07_09_47791_93530_ab_stage0_h002___robetta_h002_.200.4mers.gz: wrong size

Lo and behold, when I manually retried uploading results they went through first time and I've uploaded results and returned tasks perfectly and downloaded 5 more with complete success.

10/07/2014 01:06:46 | rosetta@home | Started upload of rb_06_20_47426_92932__t000__4_C1_SAVE_ALL_OUT_IGNORE_THE_REST_170261_537_0_0
10/07/2014 01:06:56 | rosetta@home | Finished upload of rb_06_20_47426_92932__t000__4_C1_SAVE_ALL_OUT_IGNORE_THE_REST_170261_537_0_0
10/07/2014 01:07:01 | rosetta@home | Started upload of hc_centroids_1lu4_4_0.25_06-01-14_SAVE_ALL_OUT_168127_2103_0_0
10/07/2014 01:07:05 | rosetta@home | Finished upload of hc_centroids_1lu4_4_0.25_06-01-14_SAVE_ALL_OUT_168127_2103_0_0
10/07/2014 01:07:12 | rosetta@home | update requested by user
10/07/2014 01:07:17 | rosetta@home | Sending scheduler request: Requested by user.
10/07/2014 01:07:17 | rosetta@home | Reporting 2 completed tasks
10/07/2014 01:07:17 | rosetta@home | Requesting new tasks for CPU
10/07/2014 01:07:20 | rosetta@home | Scheduler request completed: got 5 new tasks
10/07/2014 01:07:23 | rosetta@home | Started download of 1L-5E-4L-5E-2L-12H-1L_1-2.A.0_0087_data.zip
10/07/2014 01:07:23 | rosetta@home | Started download of ANK12_A.pdb_ANK12_B.pdb_global_docking.xml
10/07/2014 01:07:25 | rosetta@home | Finished download of ANK12_A.pdb_ANK12_B.pdb_global_docking.xml
10/07/2014 01:07:25 | rosetta@home | Started download of ANK12_A_ANK12_B_patchdock_split_08.pdb
10/07/2014 01:07:27 | rosetta@home | Finished download of 1L-5E-4L-5E-2L-12H-1L_1-2.A.0_0087_data.zip
10/07/2014 01:07:27 | rosetta@home | Started download of ANK12_A_ANK12_B_patchdock_split_08.patchdock
10/07/2014 01:07:29 | rosetta@home | Finished download of ANK12_A_ANK12_B_patchdock_split_08.pdb
10/07/2014 01:07:29 | rosetta@home | Started download of ANK12_A.pdb_ANK12_B.pdb_fakenative.pdb
10/07/2014 01:07:30 | rosetta@home | Finished download of ANK12_A_ANK12_B_patchdock_split_08.patchdock
10/07/2014 01:07:30 | rosetta@home | Started download of 1L-6E-4L-6E-3L-15H-1L_1-2.A.0_0044_data.zip
10/07/2014 01:07:32 | rosetta@home | Finished download of ANK12_A.pdb_ANK12_B.pdb_fakenative.pdb
10/07/2014 01:07:32 | rosetta@home | Started download of frxtrimer_10_0465_b5r3_data.zip
10/07/2014 01:07:34 | rosetta@home | Finished download of 1L-6E-4L-6E-3L-15H-1L_1-2.A.0_0044_data.zip
10/07/2014 01:07:34 | rosetta@home | Started download of flags_rb_06_09_45642_94507__t000__2_C1_robetta
10/07/2014 01:07:36 | rosetta@home | Finished download of flags_rb_06_09_45642_94507__t000__2_C1_robetta
10/07/2014 01:07:36 | rosetta@home | Started download of input_rb_06_09_45642_94507__t000__2_C1_robetta.zip
10/07/2014 01:07:37 | rosetta@home | Finished download of frxtrimer_10_0465_b5r3_data.zip
10/07/2014 01:07:45 | rosetta@home | Finished download of input_rb_06_09_45642_94507__t000__2_C1_robetta.zip

Whether this is something that's changed at R@H or something to do with me, I can't tell. Whether it's also going to automatically start working on my desktop at home, I won't be able to discover until I return on Sunday.

I had updated and rebooted the desktop a few times as well and rebooted the router for good measure. Nothing like the above happened when I did. It's really like something just changed at R@H in the last few hours.

That said, no-one else has backed up my report, which points the finger back at me...

<confused>

Edit: Just to add, occasional tasks (1 in 15?) are downloading ok and the majority (but not all - 9 of 10?) are uploading ok, so it's partial failuresuccess in both directions, not complete failure. I don't know why this is either...
ID: 76991 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 77006 - Posted: 13 Jul 2014, 0:50:21 UTC - in response to Message 76970.  

Download files getting truncated

07/07/2014 21:41:01 | rosetta@home | Sending scheduler request: To report completed tasks.
07/07/2014 21:41:01 | rosetta@home | Reporting 1 completed tasks
07/07/2014 21:41:01 | rosetta@home | Requesting new tasks for CPU
07/07/2014 21:41:03 | rosetta@home | Scheduler request completed: got 1 new tasks
07/07/2014 21:41:05 | rosetta@home | Started download of flags_rb_07_07_49373_95018__t000__2_C1_robetta
07/07/2014 21:41:05 | rosetta@home | Started download of input_rb_07_07_49373_95018__t000__2_C1_robetta.zip
07/07/2014 21:41:07 | rosetta@home | Incomplete read of 1754.000000 < 5KB for flags_rb_07_07_49373_95018__t000__2_C1_robetta - truncating
07/07/2014 21:41:07 | rosetta@home | Incomplete read of 1754.000000 < 5KB for input_rb_07_07_49373_95018__t000__2_C1_robetta.zip - truncating
07/07/2014 21:41:07 | rosetta@home | Finished download of flags_rb_07_07_49373_95018__t000__2_C1_robetta
07/07/2014 21:41:07 | rosetta@home | Finished download of input_rb_07_07_49373_95018__t000__2_C1_robetta.zip
07/07/2014 21:41:07 | rosetta@home | [error] File flags_rb_07_07_49373_95018__t000__2_C1_robetta has wrong size: expected 842, got 0
07/07/2014 21:41:07 | rosetta@home | [error] Checksum or signature error for flags_rb_07_07_49373_95018__t000__2_C1_robetta
07/07/2014 21:41:07 | rosetta@home | [error] File input_rb_07_07_49373_95018__t000__2_C1_robetta.zip has wrong size: expected 7511515, got 0
07/07/2014 21:41:07 | rosetta@home | [error] Checksum or signature error for input_rb_07_07_49373_95018__t000__2_C1_robetta.zip


This task then displays as "download failed" in the Boinc Tasks window

I also have one or two files that are failing to upload after the first 1.71Kb uploaded of 166.19Kb - frxtrimer_5_0979_b5r3_fold_SAVE_ALL_OUT_171800_974_0 - as I checked these numbers I saw an upload going through of over 1Mb without problem.

I also seem to have massive problems with downloading

Me

Anyone else seeing this or only me?

Ok, I'm still not home yet, but I note the task quoted above has finally reported back, 5 days after completing along with several others similarly stuck. I also note tasks stopped downloading 4 days ago and none more have come down even after this backlog cleared.

I'll report back tomorrow what the Events log showed. I wasn't there to have done anything, unless a reboot was forced somehow.

Still completely mystified as it's been my most reliable computer up until now.

ID: 77006 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 77007 - Posted: 13 Jul 2014, 4:51:56 UTC

And here's the relevant excerpt from the Event Log:
12/07/2014 16:02:10 | | Running CPU benchmarks
12/07/2014 16:02:10 | | Suspending computation - CPU benchmarks in progress
12/07/2014 16:02:42 | | Benchmark results:
12/07/2014 16:02:42 | | Number of CPUs: 8
12/07/2014 16:02:42 | | 2866 floating point MIPS (Whetstone) per CPU
12/07/2014 16:02:42 | | 8953 integer MIPS (Dhrystone) per CPU
12/07/2014 16:02:43 | | Resuming computation
12/07/2014 16:11:04 | rosetta@home | Started upload of rb_07_06_47737_93704__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_173218_501_0_0
12/07/2014 16:11:04 | rosetta@home | Started upload of dbtriangle14_fold_SAVE_ALL_OUT_171750_2818_0_0
12/07/2014 16:11:09 | rosetta@home | Finished upload of dbtriangle14_fold_SAVE_ALL_OUT_171750_2818_0_0
12/07/2014 16:11:09 | rosetta@home | Started upload of rb_07_09_47791_93530_ab_stage0_t000___robetta_IGNORE_THE_REST_04_13_173782_6_0_0
12/07/2014 16:11:15 | rosetta@home | Finished upload of rb_07_06_47737_93704__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_173218_501_0_0
12/07/2014 16:11:15 | rosetta@home | Started upload of frxtrimer_5_0979_b5r3_fold_SAVE_ALL_OUT_171800_974_0_0
12/07/2014 16:11:20 | rosetta@home | Finished upload of frxtrimer_5_0979_b5r3_fold_SAVE_ALL_OUT_171800_974_0_0
12/07/2014 16:11:20 | rosetta@home | Started upload of rb_07_06_47737_93704__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_173218_33_0_0
12/07/2014 16:11:21 | rosetta@home | Finished upload of rb_07_09_47791_93530_ab_stage0_t000___robetta_IGNORE_THE_REST_04_13_173782_6_0_0
12/07/2014 16:11:21 | rosetta@home | Started upload of 140604.39._fold_and_dock_SAVE_ALL_OUT_168494_3996_0_0
12/07/2014 16:11:31 | rosetta@home | Finished upload of rb_07_06_47737_93704__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_173218_33_0_0
12/07/2014 16:11:31 | rosetta@home | Started upload of dbtriangle14_fold_SAVE_ALL_OUT_171750_2140_0_0
12/07/2014 16:11:35 | rosetta@home | Finished upload of dbtriangle14_fold_SAVE_ALL_OUT_171750_2140_0_0
12/07/2014 16:11:40 | rosetta@home | Finished upload of 140604.39._fold_and_dock_SAVE_ALL_OUT_168494_3996_0_0
12/07/2014 16:11:41 | rosetta@home | Sending scheduler request: To report completed tasks.
12/07/2014 16:11:41 | rosetta@home | Reporting 7 completed tasks
12/07/2014 16:11:41 | rosetta@home | Not requesting tasks: some task is suspended via Manager
12/07/2014 16:11:44 | rosetta@home | Scheduler request completed
12/07/2014 16:30:43 | rosetta@home | Started upload of benchmark_0024_alex_metric_332d631779a7456b4ee7d35a7bbca2b0522dcada_brian_997258_0008_contact_opt_iteration_6_8026cb396f1048c895cf54d1496c0f3a_fold_SAVE_ALL_OUT_173209_2761_0_0
12/07/2014 16:30:47 | rosetta@home | Finished upload of benchmark_0024_alex_metric_332d631779a7456b4ee7d35a7bbca2b0522dcada_brian_997258_0008_contact_opt_iteration_6_8026cb396f1048c895cf54d1496c0f3a_fold_SAVE_ALL_OUT_173209_2761_0_0
12/07/2014 16:30:51 | rosetta@home | Sending scheduler request: To report completed tasks.
12/07/2014 16:30:51 | rosetta@home | Reporting 1 completed tasks
12/07/2014 16:30:51 | rosetta@home | Not requesting tasks: some task is suspended via Manager
12/07/2014 16:30:53 | rosetta@home | Scheduler request completed
12/07/2014 18:15:57 | rosetta@home | Started upload of benchmark_0024_alex_metric_332d631779a7456b4ee7d35a7bbca2b0522dcada_brian_997258_0008_contact_opt_iteration_6_8026cb396f1048c895cf54d1496c0f3a_fold_SAVE_ALL_OUT_173209_2888_0_0
12/07/2014 18:16:02 | rosetta@home | Finished upload of benchmark_0024_alex_metric_332d631779a7456b4ee7d35a7bbca2b0522dcada_brian_997258_0008_contact_opt_iteration_6_8026cb396f1048c895cf54d1496c0f3a_fold_SAVE_ALL_OUT_173209_2888_0_0
12/07/2014 18:16:02 | rosetta@home | Sending scheduler request: To report completed tasks.
12/07/2014 18:16:02 | rosetta@home | Reporting 1 completed tasks
12/07/2014 18:16:02 | rosetta@home | Not requesting tasks: some task is suspended via Manager
12/07/2014 18:16:05 | rosetta@home | Scheduler request completed

So, just new CPU benchmarks? That makes no sense to me. My AV did an update a few hours before - I suppose it's possible that cleared the blockage.

"Not requesting tasks: some task is suspended via Manager" also makes no sense to me. A manual update immediately delivered 6 tasks.

Bottom line is everything's back working. Panic over. Now to clear a stack of WCG jobs dl'd in the meantime before I fully get back on track here at Rosetta.
ID: 77007 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 77028 - Posted: 18 Jul 2014, 3:33:29 UTC - in response to Message 76970.  

I also seem to have massive problems with downloading

Me

Anyone else seeing this or only me?

Happening to me again... :(
ID: 77028 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 77044 - Posted: 20 Jul 2014, 2:30:58 UTC - in response to Message 77028.  

I also seem to have massive problems with downloading

Me

Anyone else seeing this or only me?

Happening to me again... :(

And sorted itself out again 2 days later. Again on a Saturday afternoon, like last week.

I wish I had the first clue what was going wrong and how it's righting itself without any intervention along the way. But I don't :(
ID: 77044 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 77047 - Posted: 20 Jul 2014, 15:25:10 UTC - in response to Message 77044.  

And sorted itself out again 2 days later. Again on a Saturday afternoon, like last week.

I wish I had the first clue what was going wrong and how it's righting itself without any intervention along the way. But I don't :(


A pattern suggests a setting in BOINC or your system could be causing an issue.

Do you have any system maintenance tasks that are scheduled to occur on a Saturday?

If your problem is caused by the system being confused about available space in memory or hard disk then one of the maintenance tools may be prompting the system to realise that the space is available.


However, as a first step if the issue recurs, I would set the entry in the BOINC Manager Activity menu to "Network activity always available". If the problem resolves itself straight away then it must be an issue with your BOINC preferences.
ID: 77047 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 77050 - Posted: 21 Jul 2014, 0:03:31 UTC - in response to Message 77047.  

And sorted itself out again 2 days later. Again on a Saturday afternoon, like last week.

I wish I had the first clue what was going wrong and how it's righting itself without any intervention along the way. But I don't :(

A pattern suggests a setting in BOINC or your system could be causing an issue.

Do you have any system maintenance tasks that are scheduled to occur on a Saturday?

If your problem is caused by the system being confused about available space in memory or hard disk then one of the maintenance tools may be prompting the system to realise that the space is available.

However, as a first step if the issue recurs, I would set the entry in the BOINC Manager Activity menu to "Network activity always available". If the problem resolves itself straight away then it must be an issue with your BOINC preferences.

Rather than the problem beginning on Saturday afternoon, that's when it resolves itself. I agree it may be some unknown scheduled task that's putting things right, but it may also be a coincidence. Two occasions is hardly representative.

Just stepped through the task scheduler - nothing ran around that time.

I've adjusted the Network activity option as you've suggested, just in case it's relevant. There shouldn't be any restrictions, but it's always possible I've missed something. Thanks.
ID: 77050 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2146
Credit: 41,570,180
RAC: 8,210
Message 77069 - Posted: 28 Jul 2014, 2:18:50 UTC

Not that I expect it's of any relevance to anyone except me, but everything ran smoothly this week.

None the wiser, mind...
ID: 77069 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Ananas

Send message
Joined: 1 Jan 06
Posts: 232
Credit: 752,471
RAC: 0
Message 77072 - Posted: 29 Jul 2014, 4:03:04 UTC
Last modified: 29 Jul 2014, 4:05:47 UTC

http://srv4.bakerlab.org/rosetta_cgi/cgi gives me timeouts, for BOINC and the same when I try it in the browser. Uploads need several attempts too. Someone standing on the cable?

p.s.: no trouble connecting to the Rosetta web site and the database seems to be fast also, so it must be only this one server that is in trouble.
ID: 77072 · Rating: 0 · rate: Rate + / Rate - Report as offensive
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 77073 - Posted: 29 Jul 2014, 4:23:52 UTC

I'm not able to upload or download as well.

ID: 77073 · Rating: 0 · rate: Rate + / Rate - Report as offensive
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 77074 - Posted: 29 Jul 2014, 5:45:02 UTC

This is what boinc is showing in messages now.

Tue 29 Jul 2014 15:13:08 EST | rosetta@home | Started upload of tube9_25_A_tube9_25_B_patchdock_split_00_140727_SAVE_ALL_OUT__179884_95_0_0
Tue 29 Jul 2014 15:15:08 EST | | Project communication failed: attempting access to reference site
Tue 29 Jul 2014 15:15:08 EST | rosetta@home | Temporarily failed upload of tube9_25_A_tube9_25_B_patchdock_split_00_140727_SAVE_ALL_OUT__179884_95_0_0: transient HTTP error
Tue 29 Jul 2014 15:15:08 EST | rosetta@home | Backing off 3 min 7 sec on upload of tube9_25_A_tube9_25_B_patchdock_split_00_140727_SAVE_ALL_OUT__179884_95_0_0
Tue 29 Jul 2014 15:15:10 EST | | Internet access OK - project servers may be temporarily down.

ID: 77074 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Dougb

Send message
Joined: 29 Nov 07
Posts: 1
Credit: 5,189,007
RAC: 877
Message 77077 - Posted: 29 Jul 2014, 9:28:15 UTC

My PCs can't upload or report either; I get this error:

29/07/2014 5:26:25 PM | rosetta@home | Started upload of rb_07_27_48552_95186_ab_stage0_t000___robetta_IGNORE_THE_REST_12_13_179905_3_0_0
29/07/2014 5:26:25 PM | rosetta@home | Started upload of rb_07_27_48552_95186_ab_stage0_h002___robetta_IGNORE_THE_REST_09_15_179904_3_0_0
29/07/2014 5:26:47 PM | rosetta@home | Temporarily failed upload of rb_07_27_48552_95186_ab_stage0_h002___robetta_IGNORE_THE_REST_09_15_179904_3_0_0: connect() failed
29/07/2014 5:26:47 PM | rosetta@home | Backing off 00:13:42 on upload of rb_07_27_48552_95186_ab_stage0_h002___robetta_IGNORE_THE_REST_09_15_179904_3_0_0
29/07/2014 5:26:47 PM | rosetta@home | Started upload of tj_7_11_2helix_highRadius_X18_GB_16_DDD_3_e_fb_fragments_abinitio_SAVE_ALL_OUT_174854_765_0_0
29/07/2014 5:26:48 PM | rosetta@home | Temporarily failed upload of rb_07_27_48552_95186_ab_stage0_t000___robetta_IGNORE_THE_REST_12_13_179905_3_0_0: connect() failed
29/07/2014 5:26:48 PM | rosetta@home | Backing off 00:11:34 on upload of rb_07_27_48552_95186_ab_stage0_t000___robetta_IGNORE_THE_REST_12_13_179905_3_0_0
29/07/2014 5:26:48 PM | rosetta@home | Started upload of HELFOLD1376_5_fold_SAVE_ALL_OUT_179896_257_0_0
29/07/2014 5:26:52 PM | | Project communication failed: attempting access to reference site
29/07/2014 5:26:53 PM | | Internet access OK - project servers may be temporarily down.
ID: 77077 · Rating: 0 · rate: Rate + / Rate - Report as offensive
JohnH

Send message
Joined: 25 Mar 13
Posts: 43
Credit: 2,319,355
RAC: 0
Message 77078 - Posted: 29 Jul 2014, 10:20:22 UTC

Copy that ... no up/download this morning.
Typical event log.

7/29/2014 11:10:14 AM | | Project communication failed: attempting access to reference site
7/29/2014 11:10:14 AM | rosetta@home | Temporarily failed upload of HELFOLD1376_6_fold_SAVE_ALL_OUT_179898_984_0_0: connect() failed
7/29/2014 11:10:14 AM | rosetta@home | Backing off 01:10:18 on upload of HELFOLD1376_6_fold_SAVE_ALL_OUT_179898_984_0_0
7/29/2014 11:10:14 AM | rosetta@home | Temporarily failed upload of HELFOLD1376_7_fold_SAVE_ALL_OUT_179899_1090_0_0: connect() failed
7/29/2014 11:10:14 AM | rosetta@home | Backing off 00:56:15 on upload of HELFOLD1376_7_fold_SAVE_ALL_OUT_179899_1090_0_0
7/29/2014 11:10:16 AM | | Internet access OK - project servers may be temporarily down.


ID: 77078 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Eric Detheridge

Send message
Joined: 26 Aug 12
Posts: 2
Credit: 1,975,060
RAC: 0
Message 77079 - Posted: 29 Jul 2014, 11:22:44 UTC

No uploads for me either, with similar errors.

Tue 29 Jul 2014 06:14:55 AM CDT | | Project communication failed: attempting access to reference site
Tue 29 Jul 2014 06:14:55 AM CDT | rosetta@home | Temporarily failed upload of rb_07_28_48577_95038_ab_stage0_h001___robetta_IGNORE_THE_REST_11_13_179916_5_0_0: connect() failed
Tue 29 Jul 2014 06:14:55 AM CDT | rosetta@home | Backing off 24 min 40 sec on upload of rb_07_28_48577_95038_ab_stage0_h001___robetta_IGNORE_THE_REST_11_13_179916_5_0_0
Tue 29 Jul 2014 06:14:55 AM CDT | rosetta@home | Temporarily failed upload of rb_07_28_48577_95038_ab_stage0_h002___robetta_IGNORE_THE_REST_11_11_179917_4_0_0: connect() failed
Tue 29 Jul 2014 06:14:55 AM CDT | rosetta@home | Backing off 12 min 9 sec on upload of rb_07_28_48577_95038_ab_stage0_h002___robetta_IGNORE_THE_REST_11_11_179917_4_0_0
Tue 29 Jul 2014 06:14:56 AM CDT | | Internet access OK - project servers may be temporarily down.
ID: 77079 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 77081 - Posted: 29 Jul 2014, 13:08:10 UTC
Last modified: 29 Jul 2014, 13:08:47 UTC

Same here, thought it might be a bug in my system for uploading, but even task requests hit a wall. (Times are CET (GMT+2))

7/29/2014 3:04:23 PM | rosetta@home | Requesting new tasks for CPU and NVIDIA
7/29/2014 3:04:45 PM | rosetta@home | Scheduler request failed: Couldn't connect to server
7/29/2014 3:04:46 PM | | Project communication failed: attempting access to reference site
7/29/2014 3:04:47 PM | | Internet access OK - project servers may be temporarily down.
ID: 77081 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Previous · 1 . . . 28 · 29 · 30 · 31 · 32 · 33 · 34 . . . 55 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org