Posts by [AF>Le_Pommier] Jerome_C2005

1) Message boards : Number crunching : Excessive workunit fetch (Message 105860)
Posted 7 Apr 2022 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
If I can read github correctly, it is said that this patch will be published in boinc 7.20.0 ??
2) Message boards : Number crunching : Excessive workunit fetch (Message 103702)
Posted 4 Dec 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
Thanks for this input Mad Max.

I may give it a try, next year when FB is over :)
3) Message boards : Number crunching : Excessive workunit fetch (Message 103343)
Posted 16 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
Well whatever the amount of memory really used or "reserved" or whatever, what is sure is that I had over 13 Gb of free memory available at that time.

Also I don't run 20 tasks on that machine : I have a linux VM with 10 thread for other boinc projects + I only let 9 threads available for boinc in macOS, so it was 8 regular rosetta + 1 python running in parallel, but now if you "officially" need 80 Gb of RAM for such a setup AND rosetta doesn't propose any setup in the preferences to limit this in any way, this is just a nonsense.
4) Message boards : Number crunching : Excessive workunit fetch (Message 103338)
Posted 16 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
On macOS those python VB tasks take much much less than 8 Gb (I would 13 Gb free when it was running) + as I explained, I had limited them and only 1 was running at a time (the rest was regular rosetta).
5) Message boards : Number crunching : Excessive workunit fetch (Message 103276)
Posted 14 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
I'm not sure if you mean "don't bother too much and change project for now" or "there are useful tips in that other thread" ? I had a (very) quick look and it's long and complicated, not even sure if the problems they mention are similar ?

In any case it'll have to wait, for now I have dropped all those many tasks I had (running or not) and I'm focusing on other projects for the moment.

Thanks !
6) Message boards : Number crunching : Excessive workunit fetch (Message 103258)
Posted 14 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
Oh well, I realize that there were a *second* python task that has already calculated 15 hours and was suspended hiding in the bushes, and now that I cancelled big sister it started to run exactly the same way, it has now ran 16 hours with a slooooowly progression %, and is is 2 days overdue !!!

That's too many problem with one project and 0 support - apart from you good people, thank you for this, but the admin on here doesn't give a damn.

I'm done with rosetta for now, we'll see in the future if they propose options that make sense and listen to their community.
7) Message boards : Number crunching : Excessive workunit fetch (Message 103255)
Posted 14 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
I decided to cancel that 4 days running task (probably never going to complete)

Dim 14 nov 12:18:37 2021 |  | [work_fetch] Request work fetch: Core client configuration
Dim 14 nov 12:18:40 2021 |  | choose_project(): 1636888720.569206
Dim 14 nov 12:18:40 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:18:40 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:18:40 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:18:40 2021 | Rosetta@home | [work_fetch] REC 3869.097 prio -0.841 can't request work: too many runnable tasks
Dim 14 nov 12:18:40 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:18:40 2021 |  | [work_fetch] shortfall 1059135.02 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:18:40 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:18:40 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:18:40 2021 |  | [work_fetch] shortfall 1889.00 nidle 0.00 saturated 127711.00 busy 0.00
Dim 14 nov 12:18:40 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:18:40 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:18:40 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:18:40 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:18:45 2021 | Rosetta@home | task boinc_cages_IL_2727241_2142_0 aborted by user
Dim 14 nov 12:18:45 2021 | Rosetta@home | [sched_op] Deferring communication for 00:01:38

Dim 14 nov 12:18:45 2021 | Rosetta@home | [sched_op] Reason: Unrecoverable error for task boinc_cages_IL_2727241_2142_0
Dim 14 nov 12:18:45 2021 | | [work_fetch] Request work fetch: result aborted by user
Dim 14 nov 12:18:45 2021 |  | choose_project(): 1636888725.591459
Dim 14 nov 12:18:45 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:18:45 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:18:45 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:18:45 2021 | Rosetta@home | [work_fetch] REC 3869.131 prio -0.841 can't request work: scheduler RPC backoff (97.99 sec)
Dim 14 nov 12:18:45 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:18:45 2021 |  | [work_fetch] shortfall 1048535.58 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:18:45 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:18:45 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:18:45 2021 |  | [work_fetch] shortfall 1893.78 nidle 0.00 saturated 127706.22 busy 0.00
Dim 14 nov 12:18:45 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:18:45 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:18:45 2021 | Rosetta@home | skip: scheduler RPC backoff
Dim 14 nov 12:18:45 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:18:53 2021 |  | [work_fetch] Request work fetch: application exited
Dim 14 nov 12:18:53 2021 | Rosetta@home | Computation for task boinc_cages_IL_2727241_2142_0 finished
Dim 14 nov 12:18:55 2021 |  | choose_project(): 1636888735.875360
Dim 14 nov 12:18:55 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:18:55 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:18:55 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:18:55 2021 | Rosetta@home | [work_fetch] REC 3869.176 prio -0.802 can't request work: scheduler RPC backoff (87.70 sec)
Dim 14 nov 12:18:55 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:18:55 2021 |  | [work_fetch] shortfall 1048547.57 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:18:55 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:18:55 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:18:55 2021 |  | [work_fetch] shortfall 1903.34 nidle 0.00 saturated 127696.66 busy 0.00
Dim 14 nov 12:18:55 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:18:55 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:18:55 2021 | Rosetta@home | skip: scheduler RPC backoff
Dim 14 nov 12:18:55 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:19:56 2021 |  | choose_project(): 1636888796.737827
Dim 14 nov 12:19:56 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:19:56 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:19:56 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:19:56 2021 | Rosetta@home | [work_fetch] REC 3869.459 prio -0.802 can't request work: scheduler RPC backoff (26.84 sec)
Dim 14 nov 12:19:56 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:19:56 2021 |  | [work_fetch] shortfall 1048619.42 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:19:56 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:19:56 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:19:56 2021 |  | [work_fetch] shortfall 1960.34 nidle 0.00 saturated 127639.66 busy 0.00
Dim 14 nov 12:19:56 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:19:56 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:19:56 2021 | Rosetta@home | skip: scheduler RPC backoff
Dim 14 nov 12:19:56 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:20:24 2021 |  | [work_fetch] Request work fetch: Backoff ended for Rosetta@home
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback_work_request()
Dim 14 nov 12:20:27 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:20:27 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:20:27 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:20:27 2021 | Rosetta@home | [work_fetch] REC 3869.459 prio -0.841 can request work
Dim 14 nov 12:20:27 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:20:27 2021 |  | [work_fetch] shortfall 1048655.18 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:20:27 2021 | Rosetta@home | [work_fetch] share 1.000
Dim 14 nov 12:20:27 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:20:27 2021 |  | [work_fetch] shortfall 1989.37 nidle 0.00 saturated 127610.63 busy 0.00
Dim 14 nov 12:20:27 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:20:27 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: resource CPU
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: collatz can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: Cosmology@Home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: GPUGRID can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: iThena can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: Kryptos@Home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: LHC@home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: lhcathome-dev can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: Moo! Wrapper can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: PrimeGrid can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: QuChemPedIA@home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: ralph@home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: SRBase can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: TN-Grid Platform can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: yoyo@home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: WUProp@Home can't fetch work
Dim 14 nov 12:20:27 2021 | Rosetta@home | [work_fetch] using MC shortfall 119018.758792 instead of shortfall 1048655.175950
Dim 14 nov 12:20:27 2021 | Rosetta@home | [work_fetch] set_request() for CPU: ninst 9 nused_total 1000.00 nidle_now 7.00 fetch share 1.00 req_inst 0.00 req_secs 119018.76
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: resource AMD/ATI GPU
Dim 14 nov 12:20:27 2021 | Rosetta@home | piggyback: can't fetch AMD/ATI GPU: no applications
Dim 14 nov 12:20:27 2021 | Rosetta@home | [sched_op] Starting scheduler request
Dim 14 nov 12:20:27 2021 | Rosetta@home | [work_fetch] request: CPU (119018.76 sec, 0.00 inst) AMD/ATI GPU (0.00 sec, 0.00 inst)
Dim 14 nov 12:20:28 2021 | Rosetta@home | Sending scheduler request: To report completed tasks.
Dim 14 nov 12:20:28 2021 | Rosetta@home | Reporting 1 completed tasks
Dim 14 nov 12:20:28 2021 | Rosetta@home | Requesting new tasks for CPU
Dim 14 nov 12:20:28 2021 | Rosetta@home | [sched_op] CPU work request: 119018.76 seconds; 0.00 devices
Dim 14 nov 12:20:28 2021 | Rosetta@home | [sched_op] AMD/ATI GPU work request: 0.00 seconds; 0.00 devices
Dim 14 nov 12:20:36 2021 | Rosetta@home | Scheduler request completed: got 1 new tasks
Dim 14 nov 12:20:36 2021 | Rosetta@home | [sched_op] Server version 707
Dim 14 nov 12:20:36 2021 | Rosetta@home | Project requested delay of 31 seconds
Dim 14 nov 12:20:36 2021 | Rosetta@home | [sched_op] estimated total CPU task duration: 29571 seconds
Dim 14 nov 12:20:36 2021 | Rosetta@home | [sched_op] estimated total AMD/ATI GPU task duration: 0 seconds
Dim 14 nov 12:20:36 2021 | Rosetta@home | [sched_op] handle_scheduler_reply(): got ack for task boinc_cages_IL_2727241_2142_0
Dim 14 nov 12:20:36 2021 | Rosetta@home | [sched_op] Deferring communication for 00:00:31
Dim 14 nov 12:20:36 2021 | Rosetta@home | [sched_op] Reason: requested by project
Dim 14 nov 12:20:36 2021 |  | [work_fetch] Request work fetch: RPC complete
Dim 14 nov 12:20:38 2021 | Rosetta@home | Started download of aaam-FPR-mALA_pp-SAR-AMACBEN2_pp_14.gz
Dim 14 nov 12:20:41 2021 |  | choose_project(): 1636888841.122885
Dim 14 nov 12:20:41 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:20:41 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:20:41 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:20:41 2021 | Rosetta@home | [work_fetch] REC 3869.459 prio -0.803 can't request work: too many runnable tasks (25.88 sec)
Dim 14 nov 12:20:41 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:20:41 2021 |  | [work_fetch] shortfall 1048671.92 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:20:41 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:20:41 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:20:41 2021 |  | [work_fetch] shortfall 2002.86 nidle 0.00 saturated 127597.14 busy 0.00
Dim 14 nov 12:20:41 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:20:41 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:20:41 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:20:41 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:20:45 2021 | Rosetta@home | Finished download of aaam-FPR-mALA_pp-SAR-AMACBEN2_pp_14.gz
Dim 14 nov 12:21:07 2021 |  | [work_fetch] Request work fetch: Backoff ended for Rosetta@home
Dim 14 nov 12:21:11 2021 |  | choose_project(): 1636888871.599030
Dim 14 nov 12:21:11 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:21:11 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:21:11 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:21:11 2021 | Rosetta@home | [work_fetch] REC 3869.693 prio -0.841 can't request work: too many runnable tasks
Dim 14 nov 12:21:11 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:21:11 2021 |  | [work_fetch] shortfall 1048707.53 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:21:11 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:21:11 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:21:11 2021 |  | [work_fetch] shortfall 2030.80 nidle 0.00 saturated 127569.20 busy 0.00
Dim 14 nov 12:21:11 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:21:11 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:21:11 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:21:11 2021 |  | [work_fetch] No project chosen for work fetch


I hope in all this you can see useful information.
8) Message boards : Number crunching : Excessive workunit fetch (Message 103254)
Posted 14 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
Dim 14 nov 12:11:56 2021 |  | [work_fetch] Request work fetch: Core client configuration
Dim 14 nov 12:12:00 2021 |  | [work_fetch] Request work fetch: application exited
Dim 14 nov 12:12:00 2021 |  | choose_project(): 1636888320.744567
Dim 14 nov 12:12:00 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:12:00 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:12:00 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:12:00 2021 | Rosetta@home | [work_fetch] REC 3867.255 prio -0.841 can't request work: too many runnable tasks
Dim 14 nov 12:12:00 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:12:00 2021 |  | [work_fetch] shortfall 1058742.41 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:12:00 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:12:00 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:12:00 2021 |  | [work_fetch] shortfall 1503.29 nidle 0.00 saturated 128096.71 busy 0.00
Dim 14 nov 12:12:00 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:12:00 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:12:00 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:12:00 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:12:05 2021 | Rosetta@home | update requested by user
Dim 14 nov 12:12:05 2021 |  | [work_fetch] Request work fetch: project updated by user
Dim 14 nov 12:12:05 2021 | Rosetta@home | [sched_op] sched RPC pending: Requested by user
Dim 14 nov 12:12:05 2021 | Rosetta@home | piggyback_work_request()
Dim 14 nov 12:12:05 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:12:05 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:12:05 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:12:05 2021 | Rosetta@home | [work_fetch] REC 3867.255 prio -0.841 can't request work: too many runnable tasks
Dim 14 nov 12:12:05 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:12:05 2021 |  | [work_fetch] shortfall 1058747.99 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:12:05 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:12:05 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:12:05 2021 |  | [work_fetch] shortfall 1507.87 nidle 0.00 saturated 128092.13 busy 0.00
Dim 14 nov 12:12:05 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:12:05 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:12:05 2021 | Rosetta@home | [sched_op] Starting scheduler request
Dim 14 nov 12:12:05 2021 | Rosetta@home | [work_fetch] request: CPU (0.00 sec, 0.00 inst) AMD/ATI GPU (0.00 sec, 0.00 inst)
Dim 14 nov 12:12:07 2021 | Rosetta@home | Sending scheduler request: Requested by user.
Dim 14 nov 12:12:07 2021 | Rosetta@home | Not requesting tasks: too many runnable tasks
Dim 14 nov 12:12:07 2021 | Rosetta@home | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
Dim 14 nov 12:12:07 2021 | Rosetta@home | [sched_op] AMD/ATI GPU work request: 0.00 seconds; 0.00 devices
Dim 14 nov 12:12:14 2021 | Rosetta@home | Scheduler request completed
Dim 14 nov 12:12:14 2021 | Rosetta@home | [sched_op] Server version 707
Dim 14 nov 12:12:14 2021 | Rosetta@home | Project requested delay of 31 seconds
Dim 14 nov 12:12:14 2021 | Rosetta@home | [sched_op] Deferring communication for 00:00:31
Dim 14 nov 12:12:14 2021 | Rosetta@home | [sched_op] Reason: requested by project
Dim 14 nov 12:12:14 2021 |  | [work_fetch] Request work fetch: RPC complete
Dim 14 nov 12:12:19 2021 |  | choose_project(): 1636888339.635691
Dim 14 nov 12:12:19 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:12:19 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:12:19 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:12:19 2021 | Rosetta@home | [work_fetch] REC 3867.255 prio -0.802 can't request work: too many runnable tasks (25.94 sec)
Dim 14 nov 12:12:19 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:12:19 2021 |  | [work_fetch] shortfall 1058762.19 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:12:19 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:12:19 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:12:19 2021 |  | [work_fetch] shortfall 1521.56 nidle 0.00 saturated 128078.44 busy 0.00
Dim 14 nov 12:12:19 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:12:19 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:12:19 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:12:19 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:12:36 2021 |  | [work_fetch] Request work fetch: application exited
Dim 14 nov 12:12:39 2021 |  | choose_project(): 1636888359.985780
Dim 14 nov 12:12:39 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:12:39 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:12:39 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:12:39 2021 | Rosetta@home | [work_fetch] REC 3867.423 prio -0.802 can't request work: too many runnable tasks (5.59 sec)
Dim 14 nov 12:12:39 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:12:39 2021 |  | [work_fetch] shortfall 1058781.68 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:12:39 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:12:39 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:12:39 2021 |  | [work_fetch] shortfall 1544.87 nidle 0.00 saturated 128055.13 busy 0.00
Dim 14 nov 12:12:39 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:12:39 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:12:39 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:12:39 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:12:46 2021 |  | [work_fetch] Request work fetch: Backoff ended for Rosetta@home
Dim 14 nov 12:12:50 2021 |  | choose_project(): 1636888370.128871
Dim 14 nov 12:12:50 2021 |  | [work_fetch] ------- start work fetch state -------
Dim 14 nov 12:12:50 2021 |  | [work_fetch] target work buffer: 43200.00 + 86400.00 sec
Dim 14 nov 12:12:50 2021 |  | [work_fetch] --- project states ---
Dim 14 nov 12:12:50 2021 | Rosetta@home | [work_fetch] REC 3867.423 prio -0.841 can't request work: too many runnable tasks
Dim 14 nov 12:12:50 2021 |  | [work_fetch] --- state for CPU ---
Dim 14 nov 12:12:50 2021 |  | [work_fetch] shortfall 1058792.27 nidle 7.00 saturated 0.00 busy 0.00
Dim 14 nov 12:12:50 2021 | Rosetta@home | [work_fetch] share 0.000
Dim 14 nov 12:12:50 2021 |  | [work_fetch] --- state for AMD/ATI GPU ---
Dim 14 nov 12:12:50 2021 |  | [work_fetch] shortfall 1555.47 nidle 0.00 saturated 128044.53 busy 0.00
Dim 14 nov 12:12:50 2021 | Rosetta@home | [work_fetch] share 0.000 no applications
Dim 14 nov 12:12:50 2021 |  | [work_fetch] ------- end work fetch state -------
Dim 14 nov 12:12:50 2021 | Rosetta@home | skip: too many runnable tasks
Dim 14 nov 12:12:50 2021 |  | [work_fetch] No project chosen for work fetch
Dim 14 nov 12:12:59 2021 |  | Re-reading cc_config.xml
Dim 14 nov 12:12:59 2021 |  | Config: don't compute while Diablo III is running
Dim 14 nov 12:12:59 2021 |  | Config: don't use GPUs while Diablo III is running
Dim 14 nov 12:12:59 2021 |  | log flags: file_xfer, sched_ops, task
Dim 14 nov 12:12:59 2021 | Rosetta@home | Found app_config.xml


I understand this is because there are now too many, so it doesn't help ?

I would have to wait for one task to finish and be there "just at this moment" to monitor ?
Almost impossible for me... last 2 days I didn't check at all and I realize the bloody python task has now 4 days of processing stuck at 99,999% with 2 days over the deadline.......
9) Message boards : Number crunching : Excessive workunit fetch (Message 103199)
Posted 12 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
I never got such a problem on any other project

I’ve had this problem several times on World Community Grid. I’ve stopped using max_concurrent on WCG because of this.

You are not lucky, I've been using this in many occasions in the past.
10) Message boards : Number crunching : Excessive workunit fetch (Message 103198)
Posted 12 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
I don't understand what you mean with "how much work Boinc requested", boinc just asks "new tasks", it never asks for any given amount of work... ??

Ven 12 nov 17:56:56 2021 | Rosetta@home | Sending scheduler request: To fetch work.
Ven 12 nov 17:56:56 2021 | Rosetta@home | Reporting 1 completed tasks
Ven 12 nov 17:56:56 2021 | Rosetta@home | Requesting new tasks for CPU
Ven 12 nov 17:57:04 2021 | Rosetta@home | Scheduler request completed: got 1 new tasks
Ven 12 nov 17:57:04 2021 | Rosetta@home | Project requested delay of 31 seconds
Ven 12 nov 17:57:06 2021 | Rosetta@home | Started download of aaap-ACPhenC_pp-mHPS-PHE-AMACBEN3_8.gz
Ven 12 nov 17:57:13 2021 | Rosetta@home | Finished download of aaap-ACPhenC_pp-mHPS-PHE-AMACBEN3_8.gz
Ven 12 nov 18:02:31 2021 | Rosetta@home | 1001 UT CPU (992 non démarées, 9 en cours, 0 terminées)
Or do you mean "how often it does it" ?
Ven 12 nov 18:27:10 2021 | Rosetta@home | Sending scheduler request: Requested by project.
Ven 12 nov 18:27:10 2021 | Rosetta@home | Not requesting tasks: too many runnable tasks
Ven 12 nov 18:27:19 2021 | Rosetta@home | Scheduler request completed
Ven 12 nov 18:27:19 2021 | Rosetta@home | Project requested delay of 31 seconds
Ven 12 nov 18:57:22 2021 | Rosetta@home | Sending scheduler request: Requested by project.
Ven 12 nov 18:57:22 2021 | Rosetta@home | Not requesting tasks: too many runnable tasks
Ven 12 nov 18:57:28 2021 | Rosetta@home | Scheduler request completed
Ven 12 nov 18:57:28 2021 | Rosetta@home | Project requested delay of 31 seconds

I (and you) can see there was a 30mn delay between last 2 requests from boinc, but last one didn't even succeed.
If I look up in the history I confirm it is every 30mn.

All these tasks will start to reach the deadline between Saturday and Monday, so I guess my boinc (and / or the rosetta server) is going to cancel them massively ? so next step is that rosetta is going to send new ones massively again ?? at some point the server should start to blacklist my machine I suppose ?

So there is no solution to that problem, since rosetta doesn't propose any setup option on the website (to block the python tasks) and above all since rosetta python tasks are causing so many problems that I can't run many of them at the same time ?

Even with my run limitation I see the currently running task has now *more than 2 days of execution*, it is now a few hours away from the deadline, the completion % is moving up extremely slowly (+ 0,001% after a long time, it is now 99,951%, earlier in the afternoon I remember it was 99,950%).

So probably the only real solution is to stop rosetta :(
11) Message boards : Number crunching : Excessive workunit fetch (Message 103170)
Posted 11 Nov 2021 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
Hi,

I have the exact same problem : I was forced to limit the python tasks to 1 at a time using app_config (recommended in another thread on this forum) because I would get many problems with those tasks on my iMac (with lots of resources : core i9 + 40 Gb RAM, but they stall / don't stop to crunch after almost 2 days and go over the deadline, ... ) and now it downloaded almost 1000 tasks with 2 days deadline (regular rosetta, not python where it only got 9 waiting to crunch).

So this is "not rosetta's fault" ? I never got such a problem on any other project and it is not the first time I limit number of tasks (I used to do it on LCH back in the days when they didn't have much options in project setup).

And rosetta offers no setup whatsoever with project selection.

I feel much more that *it is* "rosetta's fault" actually.
12) Message boards : Number crunching : Rosetta 4.0+ (Message 96301)
Posted 9 May 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
@Jonathan & Grant : " Edit your Boinc preferences on the project to use 8 cpus out of 24. Roughly 33%"

I obviously don't want to do this, I want the 24 cores to be used, not only 8 out of 24 (I wouldn't rent such a host in that case).

I limit rosetta via an app_config to 6 now (I found out even 8 was too much for the 8 GB of the machine...) and all the rest is crunching with universe tasks at the moment. I suspect this might be the reason why the rosetta cache is too big, maybe it actually calculates a required number with 24 cores and not 8 or 6 ? but still, with the very small cache I have set it doesn't make much sense.

But I assume it will self-regulate after some time, now it has 118 on-going tasks (and 95 recently canceled for deadline), this is much less than the 1000 I had at the very beginning (when I had a bigger cache). And anyway it is a standard boinc behavior to cancel unprocessed tasks at the deadline, so let it be.
13) Message boards : Number crunching : Rosetta 4.0+ (Message 96265)
Posted 8 May 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
As expected it canceled hundreds of tasks.
The cache instructions seem to be followed, I don't have hundreds of tasks anymore.

I'm afraid I spoke too soon : it started to request to many tasks again without changing anything to my small cache, i still have 120 waiting to run and it already canceled 160 again because of the deadline in the past few days... with only 17 valid tasks in the log...

So it is still requesting tasks way above the cache setting :(
14) Message boards : Number crunching : Rosetta 4.0+ (Message 96162)
Posted 6 May 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
As expected it canceled hundreds of tasks.

The cache instructions seem to be followed, I don't have hundreds of tasks anymore.

I did remove the app_info and I'm now getting 4.20 tasks.

Obviously it killed all the tasks that were currently running when I removed + restarted boinc after removing that file, but I suspected this would happen anyway...

Thanks.
15) Message boards : Number crunching : Rosetta 4.0+ (Message 96034)
Posted 4 May 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
This is a good news for me then, I'll be able to remove that app_info and go back to fully automated mode !

Thanks for the info.
16) Message boards : Number crunching : Rosetta 4.0+ (Message 95993)
Posted 4 May 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
Thank you !

Actually now there is nothing I can do but wait, I see the server has started to reclaim a few tasks and I suppose it is going to do it at a larger scale soon.

Since you are here, if you look at my earlier posts a few weeks ago I had a problem with all the mini tasks on this host (I have posted the kind of error I got then) so I was forced to block the mini tasks using an app_info file to declare only the rosetta app. It is quite tedious since I have to upgrade the file and also download the application versions manually (mine are sill 4.15 but I see I must now go to 4.20). But on the other end I don't want to risk to block several cores with unlimited wasted CPU cycles again with those mini that this machine really doesn't like...

Do you have any idea of where it may come from ? any library version or something like that ?

Thanks.
17) Message boards : Number crunching : Rosetta 4.0+ (Message 95915)
Posted 3 May 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
OK so now I decided to give it a go again because of Pentathlon who chose (what a surprise) Rosetta as the main project.

On that same machine, my cache is now

work_buf_min_days = 0
work_buf_additional_days = 0.2

(I could verify it in the global_prefs.xml file and the global_pref_override.xml is empty)

Quite reasonable, isn't it ?

Also it is limited to 8 tasks (using app_config.xml) because of the reduced RAM of that machine.
And I am still blocking the mini task (using app_info.xml) because I don't feel like trying, and fighting, again.

And guess what, it has downloaded MORE THAN 1000 TASKS on the machine !!!!

Who is to blame ? not me ! Hundreds of tasks are going to be cancelled by the server within a few days...

(I still think it should normally not be a big problems for the project itself, but apparently all of your scholarly demonstrations above tend to show the contrary, so I hope "everybody" is not going to be angry at me again here...)
18) Message boards : Number crunching : Rosetta 4.0+ (Message 95355)
Posted 25 Apr 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
OK I get your point, rosetta requires a short cache.

We'll see in the future if I put back this machine to run on it.
19) Message boards : Number crunching : Rosetta 4.0+ (Message 95287)
Posted 24 Apr 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
@Grand + Sid : I don't understand very well what is the problem of having tasks (whatever the number) cancelled by the server because the deadline is reached ? are they not sent back to other crunchers ? the calculation will be done at then, and no resource will actually be "wasted", correct ? or is it just about the "error count" ? it should only affect me finally, not the project... ?

Regarding rosetta deadline I had not noticed is was so short indeed. But my cache is not "rosetta only", I've always been a multi-projects boincer, but it's true it's an old habit when internet was not so stable, and when projects would often come short of tasks, having a cache was always a pleasant idea.

But again : this was absolutely not the problem I faced with the mini tasks (see all the history of my explanations above). And again, I "solved" it by blocking the mini on that machine, it was enough for me and was not doing any harm to the project research.

Thanks.
20) Message boards : Number crunching : Rosetta 4.0+ (Message 95217)
Posted 23 Apr 2020 by Profile [AF>Le_Pommier] Jerome_C2005
Post:
The problems I had with mini tasks had *nothing* to do with missed deadline.

The canceled by they server" mini tasks had not started to run = sent back to other users, no problem.

The mini tasks that started to run on that machine never terminated / succeeded due to the problems I documented before.

Besides I had solved the problem on that host by blocking mini tasks execution (documented also, app_info = anonymous platform = no problem).

Rosetta tasks were running fine.

For the moment I have turned this host on another project, so less problem even :)


Next 20



©2022 University of Washington
https://www.bakerlab.org