Doesn't work after

Message boards : Number crunching : Doesn't work after

To post messages, you must log in.

AuthorMessage
Oscar Curero

Send message
Joined: 11 Oct 05
Posts: 2
Credit: 20,060
RAC: 0
Message 1855 - Posted: 27 Oct 2005, 22:29:19 UTC
Last modified: 27 Oct 2005, 22:29:52 UTC

Hi,

I use rosetta 14 hours a day (between 23:00 and 13:00). After the pause, rosetta doesn't use the cpu, although the boinc client is up, running and loggin. Here is the log:

2005-10-26 02:30:41 [---] Starting BOINC client version 5.2.4 for i686-pc-linux-gnu
2005-10-26 02:30:41 [---] libcurl/7.14.0 OpenSSL/0.9.8 zlib/1.2.3
2005-10-26 02:30:41 [---] Data directory: /usr/share/BOINC
2005-10-26 02:30:41 [---] Processor: 1 AuthenticAMD AMD Athlon(tm) XP 3000+
2005-10-26 02:30:41 [---] Memory: 1035 MB physical, 250.98 MB virtual
2005-10-26 02:30:41 [---] Disk: 10.00 GB total, 1.73 GB free
2005-10-26 02:30:41 [rosetta@home] Computer ID: 34189; location: home; project prefs: default
2005-10-26 02:30:41 [---] General prefs: from rosetta@home (last modified 2005-10-24 01:58:17)
2005-10-26 02:30:41 [---] General prefs: no separate prefs for home; using your defaults
2005-10-26 02:30:41 [---] Remote control not allowed; using loopback address
2005-10-26 02:30:41 [rosetta@home] Resuming computation for result 1hz6A_abrelaxmode_random_gauss_sim_aneal_17658_1 using rosetta version 478
2005-10-26 09:50:55 [---] request_reschedule_cpus: process exited
2005-10-26 09:50:55 [rosetta@home] Computation for result 1hz6A_abrelaxmode_random_gauss_sim_aneal_17658_1 finished
2005-10-26 09:50:55 [rosetta@home] Starting result 1hz6A_abrelaxmode_random_length10_34680_0 using rosetta version 478
2005-10-26 09:50:57 [rosetta@home] Started upload of 1hz6A_abrelaxmode_random_gauss_sim_aneal_17658_1_0
2005-10-26 09:51:09 [rosetta@home] Finished upload of 1hz6A_abrelaxmode_random_gauss_sim_aneal_17658_1_0
2005-10-26 09:51:09 [rosetta@home] Throughput 8709 bytes/sec
2005-10-26 10:56:48 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
2005-10-26 10:56:48 [rosetta@home] Reason: To fetch work
2005-10-26 10:56:48 [rosetta@home] Requesting 9075 seconds of new work, and reporting 1 results
2005-10-26 10:56:53 [rosetta@home] Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
2005-10-26 10:56:54 [---] request_reschedule_cpus: files downloaded
2005-10-26 10:56:58 [rosetta@home] Deferring communication with project for 1 seconds
2005-10-26 12:48:12 [---] request_reschedule_cpus: process exited
2005-10-26 12:48:12 [rosetta@home] Computation for result 1hz6A_abrelaxmode_random_length10_34680_0 finished
2005-10-26 12:48:12 [rosetta@home] Starting result 1hz6A_abrelaxmode_random_length05_jitter10_03956_0 using rosetta version 478
2005-10-26 12:48:15 [rosetta@home] Started upload of 1hz6A_abrelaxmode_random_length10_34680_0_0
2005-10-26 12:48:22 [rosetta@home] Finished upload of 1hz6A_abrelaxmode_random_length10_34680_0_0
2005-10-26 12:48:22 [rosetta@home] Throughput 8496 bytes/sec
2005-10-26 13:00:00 [---] Suspending computation and network activity - time of day
2005-10-26 13:00:00 [rosetta@home] Pausing result 1hz6A_abrelaxmode_random_length05_jitter10_03956_0 (removed from memory)
2005-10-26 23:00:00 [---] Resuming computation and network activity
2005-10-26 23:00:00 [---] request_reschedule_cpus: Resuming activities
2005-10-27 12:48:28 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
2005-10-27 12:48:28 [rosetta@home] Reason: To report results
2005-10-27 12:48:28 [rosetta@home] Reporting 1 results
2005-10-27 12:48:33 [rosetta@home] Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
2005-10-27 12:48:38 [rosetta@home] Deferring communication with project for 1 seconds
2005-10-27 13:00:00 [---] Suspending computation and network activity - time of day
2005-10-27 13:00:00 [rosetta@home] Pausing result 1hz6A_abrelaxmode_random_length05_jitter10_03956_0 (removed from memory)
2005-10-27 23:00:00 [---] Resuming computation and network activity
2005-10-27 23:00:01 [---] request_reschedule_cpus: Resuming activities


If I restart the boinc client, eveything is again normal: boinc uses the cpu (about 90%). Anyone else with this problem?

Thanks



ID: 1855 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
eberndl
Avatar

Send message
Joined: 17 Sep 05
Posts: 47
Credit: 2,593,821
RAC: 2,420
Message 1857 - Posted: 27 Oct 2005, 23:22:46 UTC - in response to Message 1855.  


2005-10-26 13:00:00 [---] Suspending computation and network activity - time of day
2005-10-26 13:00:00 [rosetta@home] Pausing result 1hz6A_abrelaxmode_random_length05_jitter10_03956_0 (removed from memory)
2005-10-26 23:00:00 [---] Resuming computation and network activity
2005-10-26 23:00:00 [---] request_reschedule_cpus: Resuming activities


From your message, it implies that you'd rather crunch 24/7, so what you have to do is go to your account and go to you General Preferances, and change
"Do work only between the hours of" so that the start time and end time are equal. You will then be up and crunching full time.

Good luck!



Questions? Try the Wiki!
Take a look inside my brain
ID: 1857 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tern
Avatar

Send message
Joined: 25 Oct 05
Posts: 575
Credit: 4,507,064
RAC: 1,089
Message 1863 - Posted: 28 Oct 2005, 7:30:46 UTC - in response to Message 1855.  

2005-10-27 13:00:00 [---] Suspending computation and network activity - time of day
2005-10-27 13:00:00 [rosetta@home] Pausing result 1hz6A_abrelaxmode_random_length05_jitter10_03956_0 (removed from memory)
2005-10-27 23:00:00 [---] Resuming computation and network activity
2005-10-27 23:00:01 [---] request_reschedule_cpus: Resuming activities[/code]

If I restart the boinc client, eveything is again normal: boinc uses the cpu (about 90%). Anyone else with this problem?


If I understand your question, you're saying that when 23:00 comes and it's supposed to start working again, it doesn't...

I see you're running 5.2.4 Linux, and removing from memory. I would first try leaving the app in memory (if you can) and see if that solves it. Next, being Linux, I'd probably write a script to kill BOINC entirely at 13:30, and just restart the whole thing at 23:00 - that's a "workaround" that you could use until the bug is found/fixed.

Keep us informed if "leave in memory" fixes it; if it does, this should be reported on the BOINC board as a bug. (Doesn't sound like a Rosetta issue. I'd be curious if you saw the same thing with another project.) If it doesn't, it could be any number of things, someone more Linux-literate than I will have to step in to help.

ID: 1863 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Oscar Curero

Send message
Joined: 11 Oct 05
Posts: 2
Credit: 20,060
RAC: 0
Message 2024 - Posted: 2 Nov 2005, 0:24:36 UTC - in response to Message 1863.  

Keep us informed if "leave in memory" fixes it; if it does, this should be reported on the BOINC board as a bug. (Doesn't sound like a Rosetta issue. I'd be curious if you saw the same thing with another project.) If it doesn't, it could be any number of things, someone more Linux-literate than I will have to step in to help.


Leaving it in memory fixed the problem! :))))))

Where's the BOINC bug tracker?


(Big Thanks!)

ID: 2024 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tern
Avatar

Send message
Joined: 25 Oct 05
Posts: 575
Credit: 4,507,064
RAC: 1,089
Message 2025 - Posted: 2 Nov 2005, 1:21:59 UTC - in response to Message 2024.  
Last modified: 2 Nov 2005, 1:24:12 UTC


Leaving it in memory fixed the problem! :))))))

Where's the BOINC bug tracker?


In general, BOINC Manager problems are handled at BOINC boards - but in this case, there's no need to post there as bug #408 has been entered for this on the developer's bug database, and this Rosetta thread address given as the source. The text is:

When prefs set to "do work between" two times, work will stop correctly, but when it's time to restart, gives "Resuming computation and network activity" and "equest_reschedule_cpus: Resuming activities" messages, but no project begins running. Restarting Manager starts work. "Leave in memory" solves problem.

ID: 2025 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Doesn't work after



©2024 University of Washington
https://www.bakerlab.org