Allow callers to define the scaling strategy of the FLAME pool #51

nickdichev-firework · 2024-08-18T20:13:40Z

I've been evaluating FLAME to replace the lambda architecture we are using for one of our services. We are currently using Kubernetes jobs as the runner. We have explicitly chosen to make the job/lambda call one-to-one due to the constraints of our application's requirements.

As such, in my first few experiments with FLAME I set the :max_concurrency option to 1 and :single_use to true. This works okay, however, due to the logic of the existing "max concurrency per runner" strategy, it causes us to get penalized waiting for a new k8s pod to spin up more often than we can tolerate.

The strategy we are trying to implement is "constant number of overprovisioned workers". I think the behavior I've extracted is flexible enough for all callers to be able to define their own strategies, however, I am definitely looking for feedback there.

There are still some rough edges in the PR, but I was hoping to see what you all think about this general direction.

For reference, here's the strategy module I've implemented in my app: https://gist.github.com/nickdichev-firework/e24530a6a36833c9c5aff8fb9ed8f970

nickdichev-firework · 2024-08-19T21:47:51Z

One issue I've realized with my approach is that the initial booted runners don't take into account our desired "overprovisioned" count, this one can be worked around by taking that into account in our :min config.

Similarly, I there's a problem where the min workers can be shutdown due since we use single_use: true and as such the pool can be scaled down past the :min config if there is no waiting callers due to the has_unmet_servicable_demand? check in the :DOWN handler. I'm still trying to write a test to exhibit the behavior, but I observed it in an actual deployment.

I thought about also moving that callback to the behavior as well, since I think my implementation would like to do it slightly differently than is implemented in the existing Pool implementation.

nickdichev-firework · 2024-08-28T17:26:13Z

Hi @chrismccord I'm curious if you had a chance to look at this one.

I saw some of the code I touched here has changed in main, so hoping to catch you while that's fresh in your mind.

… runners

…premptively scale without waiting

…and?/2

nickdichev-firework mentioned this pull request Aug 18, 2024

Add hotstart_threshold to flame.pool #32

Open

nickdichev-firework added 12 commits September 3, 2024 10:57

Replace :max_concurrency option with :strategy

c6ddcf9

Update tests to use :strategy instead of :max_concurrency

d0a47e0

Add FLAME.Pool.Strategy

989d193

Implement the existing "max concurrency per worker" strategy

460ff2a

Add :checkout_and_scale action

8753a6e

Seperate the pending count from the runner count

58538d3

Hand the strategy implementations a closure to pop waiters and assign…

afe8535

… runners

Change checkout_runner API to return a list of actions so caller can …

d216f51

…premptively scale without waiting

Add t() for Pool.WaitingState

e8a0df2

Rename PerRunnerMaxConcurrencyStrategy filename to match module

0c68dfe

Allow strategy implementations to implement has_unment_servicable_dem…

992b4a6

…and?/2

Add desired_count/2 to the behavior, it was forgotten to be added

0270d43

nickdichev-firework force-pushed the ndichev/scaling-strategies branch from e9c6d10 to 0270d43 Compare September 3, 2024 18:42

This was referenced Sep 5, 2024

Unexpected interaction between :single_use pool option and :min runners #60

Open

Never scaling past 1 runner #56

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow callers to define the scaling strategy of the FLAME pool #51

Allow callers to define the scaling strategy of the FLAME pool #51

nickdichev-firework commented Aug 18, 2024

nickdichev-firework commented Aug 19, 2024 •

edited

Loading

nickdichev-firework commented Aug 28, 2024

Allow callers to define the scaling strategy of the FLAME pool #51

Are you sure you want to change the base?

Allow callers to define the scaling strategy of the FLAME pool #51

Conversation

nickdichev-firework commented Aug 18, 2024

nickdichev-firework commented Aug 19, 2024 • edited Loading

nickdichev-firework commented Aug 28, 2024

nickdichev-firework commented Aug 19, 2024 •

edited

Loading