Waiting to run

Questions and Answers : Unix/Linux : Waiting to run

To post messages, you must log in.

AuthorMessage
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76722 - Posted: 14 May 2014, 14:42:57 UTC

I had a power supply failure and the boinc client/manager shut down abruptly. Upon completing repairs, the system booted successfully, but the rosetta@home client seems stuck on a task. It is stopped and states, "Waiting to run" after having done 5.257%. Anyone recommend abort and reinstall? Or, what?
ID: 76722 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76726 - Posted: 15 May 2014, 1:07:59 UTC - in response to Message 76722.  

I had a power supply failure and the boinc client/manager shut down abruptly. Upon completing repairs, the system booted successfully, but the rosetta@home client seems stuck on a task. It is stopped and states, "Waiting to run" after having done 5.257%. Anyone recommend abort and reinstall? Or, what?

I just reset the project and it is back to crunching. Happy crunching!
ID: 76726 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76731 - Posted: 16 May 2014, 1:18:10 UTC - in response to Message 76726.  

I had a power supply failure and the boinc client/manager shut down abruptly. Upon completing repairs, the system booted successfully, but the rosetta@home client seems stuck on a task. It is stopped and states, "Waiting to run" after having done 5.257%. Anyone recommend abort and reinstall? Or, what?

I just reset the project and it is back to crunching. Happy crunching!


It's waiting again. There was a disturbance in the system hardware setup and reboots of both dsl modem and computer. I'll do the same as last time. I'll abort the task at its current progress of 3.667%, and continue with the next and following tasks. If there is a better way to overcome these halts, please post a reply.
ID: 76731 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 76732 - Posted: 16 May 2014, 2:57:02 UTC
Last modified: 16 May 2014, 3:00:30 UTC

When BOINC Manager reports a status of "waiting to run" it is an indication that it has intentionally stopped running the task at the moment. This happens when tasks from other projects need to run, or when running the task would push your system outside of the CPU or memory limits you have configured.

It looks like your machine has 4 CPUs, but is reporting only 1GB of memory? If that reported memory is accurate, you do not have enough memory available to consistently run 4 tasks at the same time if this is what you are trying to do.

So what may be happening is that when you first reset the project, all WUs are new. BOINC Manager starts up 4 tasks (assuming you've configured it to use all of the CPUs), and at first the tasks don't need much memory. They have to do some initialization work before they really get to crunching. So they run for a while. But then as they complete the initialization, they start to each use more and more memory. Eventually the sum of the 4 exceeds both your physical memory and configured memory that BOINC is allowed to use. So BOINC Manager suspends one of the tasks. Thus reducing required memory of the remaining 3 tasks. Then, it is likely the same happens again. The 3 active tasks grow to exceed your memory and another is suspended.

The BOINC Manager will get back to running the task when resources are available to do so. But if your machine really only has 1GB of memory, I'd suggest that Rosetta tasks are too memory intensive to run well for you. If you wish to keep all 4 CPUs humming, I'd suggest you look for BOINC projects with lower memory requirements. World Community Grid seems to often have subprojects with small memory requirements (<100MB per active task in many cases, some less than 50MB).
Rosetta Moderator: Mod.Sense
ID: 76732 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76734 - Posted: 16 May 2014, 21:22:27 UTC - in response to Message 76732.  

When BOINC Manager reports a status of "waiting to run" it is an indication that it has intentionally stopped running the task at the moment. This happens when tasks from other projects need to run, or when running the task would push your system outside of the CPU or memory limits you have configured.

It looks like your machine has 4 CPUs, but is reporting only 1GB of memory? If that reported memory is accurate, you do not have enough memory available to consistently run 4 tasks at the same time if this is what you are trying to do.

So what may be happening is that when you first reset the project, all WUs are new. BOINC Manager starts up 4 tasks (assuming you've configured it to use all of the CPUs), and at first the tasks don't need much memory. They have to do some initialization work before they really get to crunching. So they run for a while. But then as they complete the initialization, they start to each use more and more memory. Eventually the sum of the 4 exceeds both your physical memory and configured memory that BOINC is allowed to use. So BOINC Manager suspends one of the tasks. Thus reducing required memory of the remaining 3 tasks. Then, it is likely the same happens again. The 3 active tasks grow to exceed your memory and another is suspended.

The BOINC Manager will get back to running the task when resources are available to do so. But if your machine really only has 1GB of memory, I'd suggest that Rosetta tasks are too memory intensive to run well for you. If you wish to keep all 4 CPUs humming, I'd suggest you look for BOINC projects with lower memory requirements. World Community Grid seems to often have subprojects with small memory requirements (<100MB per active task in many cases, some less than 50MB).


I have two comments. Why does it not use swap memory on the HDD? If I buy more DDR chips and increase my live memory, it should perform as necessary. This board has four slots and can handle a max of 16GB. When the funds are available, I'll add memory.
ID: 76734 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 76736 - Posted: 17 May 2014, 2:29:09 UTC

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.
Rosetta Moderator: Mod.Sense
ID: 76736 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76738 - Posted: 17 May 2014, 12:44:48 UTC - in response to Message 76736.  

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.


I did what you stated and it's working one task at a time. I did fry the power supply, but had a spare in another machine to have it back up in an hour or so. Meanwhile I'm waiting to get another stick of memory. I have hyperx 1066 mhz installed and the match is pricey. But low budget perseverance usually wins. Thanks for the assistance.
ID: 76738 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76741 - Posted: 17 May 2014, 16:14:00 UTC - in response to Message 76736.  

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.


Juggling the speed, volume, and temps of the system, here's a snap of my desktop:http://s25.postimg.org/t8rzc400f/Screenshot_05172014_12_00_25_PM.png
ID: 76741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76742 - Posted: 17 May 2014, 17:14:33 UTC - in response to Message 76741.  

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.


Juggling the speed, volume, and temps of the system, here's a snap of my desktop:http://s25.postimg.org/t8rzc400f/Screenshot_05172014_12_00_25_PM.png


acpitz-virtual-0
Adapter: Virtual device
temp1: +51.0°C (crit = +127.0°C)

k10temp-pci-00c3
Adapter: PCI adapter
temp1: +57.0°C (high = +70.0°C)

nouveau-pci-0100
Adapter: PCI adapter
temp1: +54.0°C (high = +100.0°C, crit = +120.0°C)
ID: 76742 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76743 - Posted: 17 May 2014, 17:22:15 UTC - in response to Message 76742.  

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.


Juggling the speed, volume, and temps of the system, here's a snap of my desktop:http://s25.postimg.org/t8rzc400f/Screenshot_05172014_12_00_25_PM.png


acpitz-virtual-0
Adapter: Virtual device
temp1: +51.0°C (crit = +127.0°C)

k10temp-pci-00c3
Adapter: PCI adapter
temp1: +57.0°C (high = +70.0°C)

nouveau-pci-0100
Adapter: PCI adapter
temp1: +54.0°C (high = +100.0°C, crit = +120.0°C)


So, now I have three cores running three tasks, and 81% memory, 44%swap in use.
ID: 76743 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76750 - Posted: 19 May 2014, 17:52:04 UTC - in response to Message 76736.  
Last modified: 19 May 2014, 17:55:22 UTC

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.


It's waiting again. Log file:
Mon 19 May 2014 12:35:24 PM EDT | | Starting BOINC client version 7.0.27 for x86_64-pc-linux-gnu
Mon 19 May 2014 12:35:24 PM EDT | | log flags: file_xfer, sched_ops, task
Mon 19 May 2014 12:35:24 PM EDT | | Libraries: libcurl/7.26.0 OpenSSL/1.0.1e zlib/1.2.7 libidn/1.25 libssh2/1.4.2 librtmp/2.3
Mon 19 May 2014 12:35:24 PM EDT | | Data directory: /var/lib/boinc-client
Mon 19 May 2014 12:35:24 PM EDT | | Processor: 4 AuthenticAMD AMD Phenom(tm) II X4 955 Processor [Family 16 Model 4 Stepping 3]
Mon 19 May 2014 12:35:24 PM EDT | | Processor: 512.00 KB cache
Mon 19 May 2014 12:35:24 PM EDT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt hw_pstate npt lbrv svm_lock nrip_save
Mon 19 May 2014 12:35:24 PM EDT | | OS: Linux: 3.2.0-4-amd64
Mon 19 May 2014 12:35:24 PM EDT | | Memory: 1002.36 MB physical, 2.00 GB virtual
Mon 19 May 2014 12:35:24 PM EDT | | Disk: 71.37 GB total, 64.22 GB free
Mon 19 May 2014 12:35:24 PM EDT | | Local time is UTC -4 hours
Mon 19 May 2014 12:35:24 PM EDT | | No usable GPUs found
Mon 19 May 2014 12:35:24 PM EDT | | Config: GUI RPC allowed from:
Mon 19 May 2014 12:35:24 PM EDT | | A new version of BOINC is available. <a href=http://boinc.berkeley.edu/download.php>Download it.</a>
Mon 19 May 2014 12:35:24 PM EDT | rosetta@home | URL https://boinc.bakerlab.org/rosetta/; Computer ID 1764067; resource share 100
Mon 19 May 2014 12:35:24 PM EDT | | No general preferences found - using defaults
Mon 19 May 2014 12:35:24 PM EDT | | Reading preferences override file
Mon 19 May 2014 12:35:24 PM EDT | | Preferences:
Mon 19 May 2014 12:35:24 PM EDT | | max memory usage when active: 751.77MB
Mon 19 May 2014 12:35:24 PM EDT | | max memory usage when idle: 922.17MB
Mon 19 May 2014 12:35:24 PM EDT | | max disk usage: 20.00GB
Mon 19 May 2014 12:35:24 PM EDT | | suspend work if non-BOINC CPU load exceeds 10 %
Mon 19 May 2014 12:35:24 PM EDT | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager)
Mon 19 May 2014 12:35:24 PM EDT | | Not using a proxy
Mon 19 May 2014 12:45:02 PM EDT | rosetta@home | project resumed by user
Mon 19 May 2014 01:42:13 PM EDT | rosetta@home | update requested by user
Mon 19 May 2014 01:42:14 PM EDT | rosetta@home | Sending scheduler request: Requested by user.
Mon 19 May 2014 01:42:14 PM EDT | rosetta@home | Not reporting or requesting tasks
Mon 19 May 2014 01:42:15 PM EDT | rosetta@home | Scheduler request completed

More memory due this coming Friday.
ID: 76750 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hillhopper.new

Send message
Joined: 9 May 14
Posts: 10
Credit: 115,488
RAC: 0
Message 76810 - Posted: 6 Jun 2014, 6:42:11 UTC - in response to Message 76750.  

Well, any task that runs uses some swap space. But, unless you've changed them, the BOINC preferences don't push things to the point of trying to run like that. This is because the hard drive would quickly wear out, and the processing would be very slow. To the point that you'd make better progress just running a single task, and staying more or less within the physical memory of the machine.

It is rather unusual to see a 4 CPU system with less than 4GB of memory. Did something get fried with your power supply? I mean Rosetta isn't the only application that will have problems running on 4 CPUs all sharing 1GB.

In the meantime, you can configure BOINC Manager to only run on a single CPU. Or you can let it try to run what it can, and accept that much of the time, you will only see a single task getting CPU. Be sure you set the flag to keep suspended tasks in "memory" (swap space actually). Otherwise their work in progress gets lost when they have to be suspended, and in such an environment, BOINC Manager will be suspending tasks frequently.


It's waiting again. Log file:
Mon 19 May 2014 12:35:24 PM EDT | | Starting BOINC client version 7.0.27 for x86_64-pc-linux-gnu
Mon 19 May 2014 12:35:24 PM EDT | | log flags: file_xfer, sched_ops, task
Mon 19 May 2014 12:35:24 PM EDT | | Libraries: libcurl/7.26.0 OpenSSL/1.0.1e zlib/1.2.7 libidn/1.25 libssh2/1.4.2 librtmp/2.3
Mon 19 May 2014 12:35:24 PM EDT | | Data directory: /var/lib/boinc-client
Mon 19 May 2014 12:35:24 PM EDT | | Processor: 4 AuthenticAMD AMD Phenom(tm) II X4 955 Processor [Family 16 Model 4 Stepping 3]
Mon 19 May 2014 12:35:24 PM EDT | | Processor: 512.00 KB cache
Mon 19 May 2014 12:35:24 PM EDT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt hw_pstate npt lbrv svm_lock nrip_save
Mon 19 May 2014 12:35:24 PM EDT | | OS: Linux: 3.2.0-4-amd64
Mon 19 May 2014 12:35:24 PM EDT | | Memory: 1002.36 MB physical, 2.00 GB virtual
Mon 19 May 2014 12:35:24 PM EDT | | Disk: 71.37 GB total, 64.22 GB free
Mon 19 May 2014 12:35:24 PM EDT | | Local time is UTC -4 hours
Mon 19 May 2014 12:35:24 PM EDT | | No usable GPUs found
Mon 19 May 2014 12:35:24 PM EDT | | Config: GUI RPC allowed from:
Mon 19 May 2014 12:35:24 PM EDT | | A new version of BOINC is available. <a href=http://boinc.berkeley.edu/download.php>Download it.</a>
Mon 19 May 2014 12:35:24 PM EDT | rosetta@home | URL https://boinc.bakerlab.org/rosetta/; Computer ID 1764067; resource share 100
Mon 19 May 2014 12:35:24 PM EDT | | No general preferences found - using defaults
Mon 19 May 2014 12:35:24 PM EDT | | Reading preferences override file
Mon 19 May 2014 12:35:24 PM EDT | | Preferences:
Mon 19 May 2014 12:35:24 PM EDT | | max memory usage when active: 751.77MB
Mon 19 May 2014 12:35:24 PM EDT | | max memory usage when idle: 922.17MB
Mon 19 May 2014 12:35:24 PM EDT | | max disk usage: 20.00GB
Mon 19 May 2014 12:35:24 PM EDT | | suspend work if non-BOINC CPU load exceeds 10 %
Mon 19 May 2014 12:35:24 PM EDT | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager)
Mon 19 May 2014 12:35:24 PM EDT | | Not using a proxy
Mon 19 May 2014 12:45:02 PM EDT | rosetta@home | project resumed by user
Mon 19 May 2014 01:42:13 PM EDT | rosetta@home | update requested by user
Mon 19 May 2014 01:42:14 PM EDT | rosetta@home | Sending scheduler request: Requested by user.
Mon 19 May 2014 01:42:14 PM EDT | rosetta@home | Not reporting or requesting tasks
Mon 19 May 2014 01:42:15 PM EDT | rosetta@home | Scheduler request completed

More memory due this coming Friday.


8GB memory running four tasks simultaneously and temps in the 50's C. All's well that is well.
ID: 76810 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Questions and Answers : Unix/Linux : Waiting to run



©2024 University of Washington
https://www.bakerlab.org