All tasks in scheduler state uninitialized

Questions and Answers : Unix/Linux : All tasks in scheduler state uninitialized

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile SGAI-CSIC

Send message
Joined: 4 Apr 20
Posts: 19
Credit: 15,069,615
RAC: 0
Message 94544 - Posted: 15 Apr 2020, 14:23:42 UTC - in response to Message 94542.  
Last modified: 15 Apr 2020, 14:24:02 UTC

I think the problem it's the boinc-client in CentOS, I have created another vm with Debian installation and the task are running:

======= Tasks ========
1) -----------
   name: bhfe_s01_SAVE_ALL_OUT_IGNORE_THE_REST_1iy1rc1g_913393_2_0
   WU name: bhfe_s01_SAVE_ALL_OUT_IGNORE_THE_REST_1iy1rc1g_913393_2
   project URL: https://boinc.bakerlab.org/rosetta/
   report deadline: Sat Apr 18 16:15:23 2020
   ready to report: no
   got server ack: no
   final CPU time: 0.000000
   state: downloaded
   scheduler state: scheduled
   exit_status: 0
   signal: 0
   suspended via GUI: no
   active_task_state: EXECUTING
   app version num: 415
   checkpoint CPU time: 0.000000
   current CPU time: 21.956000
   fraction done: 0.000187
   swap size: 469 MB
   working set size: 215 MB
   estimated CPU time remaining: 21512.713905
2) -----------
   name: Junior_HalfRoid_design4_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_6kl5yf9y_913404_1_0
   WU name: Junior_HalfRoid_design4_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_6kl5yf9y_913404_1
   project URL: https://boinc.bakerlab.org/rosetta/
   report deadline: Sat Apr 18 16:15:34 2020
   ready to report: no
   got server ack: no
   final CPU time: 0.000000
   state: downloaded
   scheduler state: scheduled
   exit_status: 0
   signal: 0
   suspended via GUI: no
   active_task_state: EXECUTING
   app version num: 415
   checkpoint CPU time: 0.000000
   current CPU time: 14.716000
   fraction done: 0.000110
   swap size: 364 MB
   working set size: 116 MB
   estimated CPU time remaining: 21514.369500
ID: 94544 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile SGAI-CSIC

Send message
Joined: 4 Apr 20
Posts: 19
Credit: 15,069,615
RAC: 0
Message 94565 - Posted: 15 Apr 2020, 18:31:29 UTC - in response to Message 94542.  
Last modified: 15 Apr 2020, 18:33:42 UTC

That's right Mod.Sense, it seems a problem between the balance of CPU and memory, I have increased a little the RAM memory (6GB) and the task are running. I will continue to observe it ...
ID: 94565 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1644
Credit: 16,927,756
RAC: 16,270
Message 94576 - Posted: 15 Apr 2020, 23:25:59 UTC - in response to Message 94536.  

Thank you for your help, I try to stop boinc daemon a restart again and I see these messages in the log, perhaps it's a bug of the boinc-client on CentOS?


# systemctl stop boinc-client 
# systemctl start boinc-client
# boinccmd --get_messages

1:  15-Apr-2020 14:33:14 (low) [] cc_config.xml not found - using defaults
2: 15-Apr-2020 14:33:14 (low) [] Starting BOINC client version 7.16.1 for x86_64-pc-linux-gnu
3: 15-Apr-2020 14:33:14 (low) [] log flags: file_xfer, sched_ops, task
4: 15-Apr-2020 14:33:14 (low) [] Libraries: libcurl/7.29.0 NSS/3.44 zlib/1.2.7 libidn/1.28 libssh2/1.8.0
5: 15-Apr-2020 14:33:14 (low) [] Data directory: /var/lib/boinc
6: 15-Apr-2020 14:33:14 (low) [] No usable GPUs found
7: 15-Apr-2020 14:33:14 (low) [] [libc detection] gathered: 2.17, GNU libc
8: 15-Apr-2020 14:33:14 (low) [] Host name: rosetta3
9: 15-Apr-2020 14:33:14 (low) [] Processor: 4 GenuineIntel QEMU Virtual CPU version 2.5+ [Family 6 Model 13 Stepping 3]
10: 15-Apr-2020 14:33:14 (low) [] Processor features: fpu de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pse36 clflush mmx fxsr sse sse2 syscall nx lm rep_good nopl xtopology eagerfpu pni cx16 x2apic hypervisor lahf_lm
11: 15-Apr-2020 14:33:14 (low) [] OS: Linux CentOS Linux: CentOS Linux 7 (Core) [3.10.0-1062.18.1.el7.x86_64|libc 2.17 (GNU libc)]
12: 15-Apr-2020 14:33:14 (low) [] Memory: 3.70 GB physical, 1.20 GB virtual
13: 15-Apr-2020 14:33:14 (low) [] Disk: 10.22 GB total, 7.85 GB free
14: 15-Apr-2020 14:33:14 (low) [] Local time is UTC +2 hours
15: 15-Apr-2020 14:33:14 (low) [Rosetta@home] General prefs: from Rosetta@home (last modified 15-Apr-2020 09:58:03)
16: 15-Apr-2020 14:33:14 (low) [Rosetta@home] Computer location: work
17: 15-Apr-2020 14:33:14 (low) [] General prefs: using separate prefs for work
18: 15-Apr-2020 14:33:14 (low) [] Preferences:
19: 15-Apr-2020 14:33:14 (low) [] max memory usage when active: 1894.49 MB
20: 15-Apr-2020 14:33:14 (low) [] max memory usage when idle: 3410.09 MB
21: 15-Apr-2020 14:33:14 (low) [] max disk usage: 7.56 GB
22: 15-Apr-2020 14:33:14 (low) [] (to change preferences, visit a project web site or select Preferences in the Manager)
23: 15-Apr-2020 14:33:14 (low) [] Setting up project and slot directories
24: 15-Apr-2020 14:33:14 (low) [] Checking active tasks
25: 15-Apr-2020 14:33:14 (low) [Rosetta@home] URL https://boinc.bakerlab.org/rosetta/; Computer ID 4062012; resource share 100
26: 15-Apr-2020 14:33:14 (low) [] Setting up GUI RPC socket
27: 15-Apr-2020 14:33:14 (low) [] Checking presence of 23 project files



Unfortunately the bottom is missing from the log there, but it looks like you got it sorted.
The bottom of the log may (or may not) have had a message about insufficient RAM, but certainly there should have been more messages relating to starting the Tasks.
Grant
Darwin NT
ID: 94576 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1644
Credit: 16,927,756
RAC: 16,270
Message 94577 - Posted: 15 Apr 2020, 23:28:39 UTC - in response to Message 94565.  
Last modified: 15 Apr 2020, 23:29:05 UTC

That's right Mod.Sense, it seems a problem between the balance of CPU and memory, I have increased a little the RAM memory (6GB) and the task are running. I will continue to observe it ...
You need to allow for up to 1.3GB RAM per Task, most of the current one use much less, but that's the highest requirement i've seen so far (over 3 weeks or so). And those large RAM requirement Tasks also like around 1GB of storage space as well- the Event log will get messages about low DIsk space if it occurs.
Grant
Darwin NT
ID: 94577 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile SGAI-CSIC

Send message
Joined: 4 Apr 20
Posts: 19
Credit: 15,069,615
RAC: 0
Message 94595 - Posted: 16 Apr 2020, 7:52:17 UTC - in response to Message 94577.  

Thanks to all for the help, now all it's running.

Ok, I see what you say, some task demand for memory that others, for example:



Some task demand ~ 20 % of mem (1.2 GB) others less ~ 12% ( 700 MB). I understand then that in order to executing the task
boinc makes sure have enough expected memory for the sum of all tasks and if it is not fulfilled the tasks do not initialize?
ID: 94595 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1644
Credit: 16,927,756
RAC: 16,270
Message 94599 - Posted: 16 Apr 2020, 8:52:02 UTC - in response to Message 94595.  
Last modified: 16 Apr 2020, 8:53:31 UTC

I understand then that in order to executing the task boinc makes sure have enough expected memory for the sum of all tasks and if it is not fulfilled the tasks do not initialize?
Usually as many as are able to will run. If the next Task to start needs more memory than is available, it won't run.
If the memory available is less than the minimum requirement for a single, low memory requirement Task, then none will be able to run.
Grant
Darwin NT
ID: 94599 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 94623 - Posted: 16 Apr 2020, 17:49:29 UTC - in response to Message 94595.  

Previously you had posted these messages from BOINC startup
19: 15-Apr-2020 14:33:14 (low) [] max memory usage when active: 1894.49 MB
20: 15-Apr-2020 14:33:14 (low) [] max memory usage when idle: 3410.09 MB


This indicates that BOINC is configured to only allowed to use about 50% of memory when the machine is active. Depending on how often the machine is used for other things, this could limit things significantly so far as getting BOINC WUs completed.
Rosetta Moderator: Mod.Sense
ID: 94623 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1644
Credit: 16,927,756
RAC: 16,270
Message 94642 - Posted: 16 Apr 2020, 23:21:42 UTC - in response to Message 94623.  

Previously you had posted these messages from BOINC startup
19: 15-Apr-2020 14:33:14 (low) [] max memory usage when active: 1894.49 MB
20: 15-Apr-2020 14:33:14 (low) [] max memory usage when idle: 3410.09 MB
This indicates that BOINC is configured to only allowed to use about 50% of memory when the machine is active. Depending on how often the machine is used for other things, this could limit things significantly so far as getting BOINC WUs completed.



So, SGAI-CSIC
If you go to to your Account, Computing preferences, and change the available memory there to
Memory
    When computer is in use, use at most 90 %
When computer is not in use, use at most 95 %
(or higher), and that will let BOINC crunch more work.
Grant
Darwin NT
ID: 94642 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile SGAI-CSIC

Send message
Joined: 4 Apr 20
Posts: 19
Credit: 15,069,615
RAC: 0
Message 94662 - Posted: 17 Apr 2020, 8:08:27 UTC - in response to Message 94642.  

Ok, thanks, I will try :-)
ID: 94662 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Questions and Answers : Unix/Linux : All tasks in scheduler state uninitialized



©2024 University of Washington
https://www.bakerlab.org