Add `aserve` utility for serving multiple flows from an asynchronous context #15972

GitAlexxx · 2024-11-11T14:17:28Z

Before executing serve() - when running configured deployments in bulk via a script - it is often necessary to perform an initialization function (create a block, create/reset a concurrency tag, and others). It is normal that it can be asynchronous (in addition to the fact that you want to have a completely asynchronous code, get_client c sync_client=True is not functionally equal to asynchronous get_client). The current implementation does not allow you to run the described case in main as asyncio.run(main()).

codspeed-hq · 2024-11-11T14:26:12Z

CodSpeed Performance Report

Merging #15972 will not alter performance

_{Comparing GitAlexxx:feat/async_start_of_serve (c66923b) with main (42ea12a)}

Summary

✅ 3 untouched benchmarks

desertaxle · 2024-11-11T14:50:48Z

Thanks for opening a PR @GitAlexxx! Can you elaborate on the situation you're looking to improve? Even though serve is a sync function, you can still call it from an async context. A short script where you're seeing a problem would be handy.

GitAlexxx · 2024-11-11T15:41:28Z

Thanks for opening a PR @GitAlexxx! Can you elaborate on the situation you're looking to improve? Even though serve is a sync function, you can still call it from an async context. A short script where you're seeing a problem would be handy.

Hi @desertaxle !

Yes, serve() can be called in any context, but the commit refers more to the context itself - now it is not possible to perform preparatory asynchronous functions BEFORE calling serve(), now this can only be done by explicitly calling the Runner object (in fact, simply rewriting the existing serve). Previously, in perfect 2.X, serve() was asynchronous and did not implement work with event loop internally, there is not even any transitional compatibility with the codebase + I would like to use uvloop.run() for known performance reasons (but more on that in another commit).
Commit does not change the existing approach, but expands it.

An example of the case described in the commit description (just an example, the point is to be able to run asynchronous functions before serve()):

import asyncio

from prefect import serve, get_client, flow


@flow(name='clustering-flow', log_prints=True)
async def clustering_flow():
    print('Clustering results')


async def init() -> None:
    await create_heavy_concurrency_limit()
    await reset_heavy_concurrency_limit()


async def create_heavy_concurrency_limit() -> None:
    async with get_client() as client:
        await client.create_concurrency_limit(tag='heavy-tag', concurrency_limit=1)


async def reset_heavy_concurrency_limit() -> None:
    async with get_client() as client:
        await client.reset_concurrency_limit_by_tag(tag='heavy-tag')


async def main() -> None:
    await init()  # this won't work in the current prefect implementation.

    clustering_deploy = await clustering_flow.to_deployment(name='clustering-deployment')

    await serve(clustering_deploy)


if __name__ == '__main__':
    asyncio.run(main())

desertaxle · 2024-11-11T15:54:43Z

Thanks for the example @GitAlexxx!

In 3.0, we decided to make serve only synchronous because our @sync_compatible hides typing information on decorated functions, and since serve runs indefinitely, we thought that there was less utility in having an async interface for serve.

I can think of two ways we could approach this:

We expose a loop kwarg on serve, allowing users to provide a running event loop. I think that will allow you to use uvloop instead of asyncio, but I'm not entirely sure.
We add a aserve with is an explicitly async version of serve.

If approach 1 works, I would prefer it over approach 2 to avoid duplicating implementation between serve and aserve, but let me know if approach 1 would work for your usecase!

GitAlexxx · 2024-11-12T09:32:34Z

Thanks for the example @GitAlexxx!

In 3.0, we decided to make serve only synchronous because our @sync_compatible hides typing information on decorated functions, and since serve runs indefinitely, we thought that there was less utility in having an async interface for serve.

I can think of two ways we could approach this:

We expose a loop kwarg on serve, allowing users to provide a running event loop. I think that will allow you to use uvloop instead of asyncio, but I'm not entirely sure.

We add a aserve with is an explicitly async version of serve.

If approach 1 works, I would prefer it over approach 2 to avoid duplicating implementation between serve and aserve, but let me know if approach 1 would work for your usecase!

@desertaxle, thanks for the clarifications regarding serve() and the reasons for its complete synchronicity, this is really important.
I also note that a simple call to serve() is not always enough for more complex deployment scenarios (including using asynchronous initialization functions), of course there are fewer such cases, but this is not a reason not to consider them.

I have considered both suggested options: the first option solves only the problem with the event loop used (asyncio/uvloop), the second option (creating aserve) solves both problems: setting up an asynchronous context and using its own event loop.
I can't completely agree about the duplication of functionality. By design, they may look similar, but they solve completely different tasks, aserve() not only adds compatibility with the existing user code base, but allows you to configure deployments more flexibly in complex scenarios.

In this regard, I am sending a commit to add aserve().

desertaxle

@GitAlexxx that approach makes sense! Could you add some test coverage to the new aserve function? Also, I think it'd make sense to remove the event loop handling from serve and instead raise an error when serve is used from a synchronous context that directs users to use aserve instead.

Let me know if you have any questions about those requests!

GitAlexxx · 2024-11-16T19:22:54Z

@GitAlexxx that approach makes sense! Could you add some test coverage to the new aserve function? Also, I think it'd make sense to remove the event loop handling from serve and instead raise an error when serve is used from a synchronous context that directs users to use aserve instead.

Let me know if you have any questions about those requests!

@desertaxle, thanks for the idea, it really differentiates the use of serv/aserv more. I am sending a commit with changes and test coverage.

During the writing of tests (as it happens)) I noticed the serve in the Flow class (I do not use such a launch myself in projects - a separate function is preferable). In fact, it is now repeating serve before the change - with event loop processing and no asynchronous version.

Do you think it makes sense to change it? It seems to me that this method makes no sense at all, given that it simply duplicates the existing serve and does not solve any new task (in the new implementation, separate serve and aserve cover all possible launch cases).

desertaxle · 2024-11-16T19:31:06Z

During the writing of tests (as it happens)) I noticed the serve in the Flow class (I do not use such a launch myself in projects - a separate function is preferable). In fact, it is now repeating serve before the change - with event loop processing and no asynchronous version.

Do you think it makes sense to change it? It seems to me that this method makes no sense at all, given that it simply duplicates the existing serve and does not solve any new task (in the new implementation, separate serve and aserve cover all possible launch cases.

Yes, I think it makes sense to update Flow.serve since it is a useful, convenient interface for users. We should update it to have an explicit sync and aysnc interface and delegate execution to serve and aserve, but that can be handled in a seperate PR and doesn't need to be addressed here.

desertaxle

Left a few stylistic nits, but once those are addressed this should be good to merge!

src/prefect/flows.py

Co-authored-by: Alexander Streed <[email protected]>

Alexander Sidorenko added 2 commits November 11, 2024 16:53

refactor: Run pre-commit

15b27e6

GitAlexxx requested review from cicdw, desertaxle and zzstoatzz as code owners November 11, 2024 14:17

feat: Add aserve method

c5fda6c

GitAlexxx requested review from aaazzam and chrisguidry as code owners November 12, 2024 09:28

desertaxle requested changes Nov 12, 2024

View reviewed changes

Alexander Sidorenko added 2 commits November 16, 2024 19:21

feat: Fix serve method

60d5d08

feat: Add tests for serve and aserve

d0a8312

desertaxle requested changes Nov 20, 2024

View reviewed changes

src/prefect/flows.py Outdated Show resolved Hide resolved

src/prefect/flows.py Outdated Show resolved Hide resolved

desertaxle added the enhancement An improvement of an existing feature label Nov 20, 2024

desertaxle changed the title ~~Feat/async start of serve~~ Add aserve methods for serving multiple flows from an asynchronous context Nov 20, 2024

desertaxle changed the title ~~Add aserve methods for serving multiple flows from an asynchronous context~~ Add aserve utility for serving multiple flows from an asynchronous context Nov 20, 2024

GitAlexxx and others added 2 commits November 20, 2024 20:58

Update flows.py

c07da62

Co-authored-by: Alexander Streed <[email protected]>

Update flows.py

c66923b

Co-authored-by: Alexander Streed <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `aserve` utility for serving multiple flows from an asynchronous context #15972

Add `aserve` utility for serving multiple flows from an asynchronous context #15972

GitAlexxx commented Nov 11, 2024

codspeed-hq bot commented Nov 11, 2024 •

edited

Loading

desertaxle commented Nov 11, 2024

GitAlexxx commented Nov 11, 2024 •

edited

Loading

desertaxle commented Nov 11, 2024

GitAlexxx commented Nov 12, 2024

desertaxle left a comment

GitAlexxx commented Nov 16, 2024 •

edited

Loading

desertaxle commented Nov 16, 2024

desertaxle left a comment

Add aserve utility for serving multiple flows from an asynchronous context #15972

Are you sure you want to change the base?

Add aserve utility for serving multiple flows from an asynchronous context #15972

Conversation

GitAlexxx commented Nov 11, 2024

codspeed-hq bot commented Nov 11, 2024 • edited Loading

CodSpeed Performance Report

Merging #15972 will not alter performance

Summary

desertaxle commented Nov 11, 2024

GitAlexxx commented Nov 11, 2024 • edited Loading

desertaxle commented Nov 11, 2024

GitAlexxx commented Nov 12, 2024

desertaxle left a comment

Choose a reason for hiding this comment

GitAlexxx commented Nov 16, 2024 • edited Loading

desertaxle commented Nov 16, 2024

desertaxle left a comment

Choose a reason for hiding this comment

Add `aserve` utility for serving multiple flows from an asynchronous context #15972

Add `aserve` utility for serving multiple flows from an asynchronous context #15972

codspeed-hq bot commented Nov 11, 2024 •

edited

Loading

GitAlexxx commented Nov 11, 2024 •

edited

Loading

GitAlexxx commented Nov 16, 2024 •

edited

Loading