minirosetta 2.03

Message boards : Number crunching : minirosetta 2.03

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
LogixGeer

Send message
Joined: 29 Jan 09
Posts: 1
Credit: 453,984
RAC: 0
Message 64508 - Posted: 15 Dec 2009, 15:17:25 UTC

Same here:

Message from server: Server error: can't attach shared memory
ID: 64508 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kristaps

Send message
Joined: 11 Jun 07
Posts: 1
Credit: 81,256
RAC: 0
Message 64509 - Posted: 15 Dec 2009, 15:29:36 UTC

Tue 15 Dec 2009 05:24:30 PM EET|rosetta@home|Message from server: Server error: can't attach shared memory
ID: 64509 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Panoramix

Send message
Joined: 4 Dec 07
Posts: 1
Credit: 12,900,351
RAC: 0
Message 64511 - Posted: 15 Dec 2009, 16:52:04 UTC

Same here, all computer respond with:
Message from server: Server error: can't attach shared memory
ID: 64511 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,219,950
RAC: 1,855
Message 64512 - Posted: 15 Dec 2009, 17:00:06 UTC - in response to Message 64511.  

OK --so anyone reporting can confirm the error. Haven't seen anything in the way of an acknowledging response from the project though.

Same here, all computer respond with:
Message from server: Server error: can't attach shared memory


ID: 64512 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MrWizard

Send message
Joined: 30 Oct 05
Posts: 3
Credit: 123,787
RAC: 0
Message 64515 - Posted: 15 Dec 2009, 17:06:03 UTC - in response to Message 64512.  

OK --so anyone reporting can confirm the error. Haven't seen anything in the way of an acknowledging response from the project though.

Same here, all computer respond with:
Message from server: Server error: can't attach shared memory



It's 9:05am at their location now. Should get some action soon...
ID: 64515 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike Tyka

Send message
Joined: 20 Oct 05
Posts: 96
Credit: 2,190
RAC: 0
Message 64516 - Posted: 15 Dec 2009, 17:07:55 UTC

Hi! 9.07 here now :)

Not sure what the problem is - we're looking into it now. THis version
worked fine on RALPH, so it we suspect something went awry during the actual update.


http://beautifulproteins.blogspot.com/
http://www.miketyka.com/
ID: 64516 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Darmok

Send message
Joined: 4 Sep 09
Posts: 6
Credit: 222,205
RAC: 0
Message 64517 - Posted: 15 Dec 2009, 17:08:44 UTC - in response to Message 64515.  

OK --so anyone reporting can confirm the error. Haven't seen anything in the way of an acknowledging response from the project though.

Same here, all computer respond with:
Message from server: Server error: can't attach shared memory



It's 9:05am at their location now. Should get some action soon...


I was just going to say that. It's now been 15 hours w/out comm. They will let us know soon...
ID: 64517 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MrWizard

Send message
Joined: 30 Oct 05
Posts: 3
Credit: 123,787
RAC: 0
Message 64518 - Posted: 15 Dec 2009, 17:25:54 UTC - in response to Message 64516.  

Hi! 9.07 here now :)

Not sure what the problem is - we're looking into it now. THis version
worked fine on RALPH, so it we suspect something went awry during the actual update.


I'm under the impression it's a server problem not an application problem. Correct me if I'm wrong...
ID: 64518 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
bill brandt-gasuen

Send message
Joined: 9 Jun 09
Posts: 1
Credit: 6,250,117
RAC: 37,032
Message 64520 - Posted: 15 Dec 2009, 18:05:32 UTC

So how do we remedy this situation? Is there something we can do on our end or do we just sit tight in the rowboat waiting to be rescued? If WUs were passengers, I've got a cruise ship full of passengers that need evacuation! Googling brings to light past similar occurrences where tinkering directly with the WUs solved the problem, but what I'm picking up here doesn't seem to indicate a physical server switch as much as a software issue.
ID: 64520 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1338
Credit: 24,634,659
RAC: 11,928
Message 64521 - Posted: 15 Dec 2009, 18:07:39 UTC - in response to Message 64518.  

I'm under the impression it's a server problem not an application problem. Correct me if I'm wrong...

Sounds correct to me. Every post in this thread has nothing to do with 2.03 yet.

My uploads have all gone through now. Now for the big fight over new WUs!
ID: 64521 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike Tyka

Send message
Joined: 20 Oct 05
Posts: 96
Credit: 2,190
RAC: 0
Message 64522 - Posted: 15 Dec 2009, 18:12:14 UTC

Ok, it appears we had to many old application backlogged. It was indeed a server problem - it should be resolved now :) - sorry for the hick up.
http://beautifulproteins.blogspot.com/
http://www.miketyka.com/
ID: 64522 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aroundomaha

Send message
Joined: 11 Sep 08
Posts: 14
Credit: 54,472,973
RAC: 13,531
Message 64525 - Posted: 15 Dec 2009, 20:31:43 UTC - in response to Message 64478.  

THis version fixes a stackoverflow error that we didn't catch in 2.02.

Please post issues here, thanks !


I'm seeing work units rolling in again. Thank you to the Rosetta team for a quick resolution.
ID: 64525 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 161
Credit: 695,102
RAC: 177
Message 64527 - Posted: 15 Dec 2009, 22:59:12 UTC

Is the stack overflow issue the reason why mix_score13_env_rlbd_1hz6__IGNORE_THE_RESTlr8_DECOY_16523_77_09 made 129 decoys from 129 attempts?

Have a crunching good day!!
ID: 64527 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1338
Credit: 24,634,659
RAC: 11,928
Message 64529 - Posted: 16 Dec 2009, 1:15:39 UTC

One machine, having run out of work, started a 2.03 WU ver yquickly and closed just as quickly after just 12 decoys. A validate error, but no error messages within the task details:

mix_score13_hb_rlbd_1shf__IGNORE_THE_RESTlr13_DECOY_16324_352_1
ID: 64529 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike Tyka

Send message
Joined: 20 Oct 05
Posts: 96
Credit: 2,190
RAC: 0
Message 64530 - Posted: 16 Dec 2009, 2:37:52 UTC - in response to Message 64527.  

Is the stack overflow issue the reason why mix_score13_env_rlbd_1hz6__IGNORE_THE_RESTlr8_DECOY_16523_77_09 made 129 decoys from 129 attempts?


No, but 129 decoys is a good thing - isn't it ?

http://beautifulproteins.blogspot.com/
http://www.miketyka.com/
ID: 64530 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 64531 - Posted: 16 Dec 2009, 4:30:22 UTC

This broker_idealclose_kic_in20_hb_t308__IGNORE_THE_REST_16512_810 WU gave an error for both crunchers.
ID: 64531 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 161
Credit: 695,102
RAC: 177
Message 64532 - Posted: 16 Dec 2009, 5:40:52 UTC - in response to Message 64530.  
Last modified: 16 Dec 2009, 5:44:38 UTC



No, but 129 decoys is a good thing - isn't it ?

Absolutely. Reason I asked was because I thought there was a 100 decoy limit for tasks that had a high model count. I think it was limited because there were upload problems for tasks that had over 100 decoys.
Have a crunching good day!!
ID: 64532 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jjwhalen
Avatar

Send message
Joined: 20 Dec 06
Posts: 4
Credit: 399,398
RAC: 0
Message 64537 - Posted: 16 Dec 2009, 18:34:18 UTC

(Hint) It sure would be great if someone from project administration would comment in this forum about this issue, even if just to say "we're looking at the problem."
Best wishes:)

ID: 64537 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 217
Credit: 8,755,202
RAC: 5,851
Message 64538 - Posted: 16 Dec 2009, 19:24:34 UTC

304932456 (lr8_combine_smooth_torsion_it00_rama02_A_rlbd_2hng_IGNORE_THE_REST_DECOY_14887_678_2) failed on Mac OS X 10.6

Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/yfsong_lr8_combine_smooth_torsion_it00_rama02_A.zip
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/lr8_2hng.out.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
std::cerr: Exception was thrown:
failure to read decoy F_00003_0004346_0 from silent-file lr8_2hng.out

</stderr_txt>
]]>
ID: 64538 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Caius Corp.

Send message
Joined: 10 Dec 05
Posts: 1
Credit: 327,656
RAC: 0
Message 64539 - Posted: 16 Dec 2009, 20:30:26 UTC

I get an issue on this workunit during CPU benchmarks:
mer. 16 déc. 2009 19:57:17 CET||Running CPU benchmarks
mer. 16 déc. 2009 19:57:17 CET||Suspending computation - running CPU benchmarks
mer. 16 déc. 2009 19:57:28 CET|rosetta@home|Task mix_score13_env_rlbd_2apb__IGNORE_THE_RESTlr10_DECOY_16523_731_0: no shared memory segment
mer. 16 déc. 2009 19:57:28 CET|rosetta@home|Task mix_score13_env_rlbd_2apb__IGNORE_THE_RESTlr10_DECOY_16523_731_0 exited with zero status but no 'finished' file
mer. 16 déc. 2009 19:57:28 CET|rosetta@home|If this happens repeatedly you may need to reset the project.
mer. 16 déc. 2009 19:57:49 CET||Benchmark results:
mer. 16 déc. 2009 19:57:49 CET|| Number of CPUs: 2
mer. 16 déc. 2009 19:57:49 CET|| 2217 floating point MIPS (Whetstone) per CPU
mer. 16 déc. 2009 19:57:49 CET|| 6149 integer MIPS (Dhrystone) per CPU
mer. 16 déc. 2009 19:57:50 CET||Resuming computation


But now the task is still running well on Rosetta mini 2.03, no other message.
ID: 64539 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : minirosetta 2.03



©2020 University of Washington
https://www.bakerlab.org