Added concurrency helpers to retrieve task results by agronholm · Pull Request #890 · agronholm/anyio

agronholm · 2025-03-16T12:42:37Z

Changes

Adds an enhanced version of the task group that allows task-by-task cancellation as well as awaiting on the results of individual tasks. Two other convenience functions are also provided:

amap(): calls the given one-parameter coroutine function with each item from the given iterable of arguments and runs them concurrently in a task group
race() launches all given coroutines as tasks in a task group and returns the return value of whichever task completes first

Concurrency and rate limiting is provided by both functions.

Checklist

If this is a user-facing code change, like a bugfix or a new feature, please ensure that
you've fulfilled the following conditions (where applicable):

You've added tests (in tests/) added which would fail without your patch
You've updated the documentation (in docs/, in case of behavior changes or new
features)
You've added a new changelog entry (in docs/versionhistory.rst).

If this is a trivial change, like a typo fix or a code reformatting, then you can ignore
these instructions.

Updating the changelog

If there are no entries after the last release, use **UNRELEASED** as the version.
If, say, your patch fixes issue #123, the entry should look like this:

- Fix big bad boo-boo in task groups
  (`#123 <https://github.com/agronholm/anyio/issues/123>`_; PR by @yourgithubaccount)

If there's no issue linked, just link to your pull request instead by updating the
changelog after you've created the PR.

davidbrochart · 2025-03-19T07:45:51Z

src/anyio/_core/_tasks.py

    return get_async_backend().create_task_group()
+
+
+class TaskHandle(Generic[T]):


It looks like a TaskHandle could be used to implement a create_future() on the EnhancedTaskGroup, to have something like asyncio's Future.
What do you think?

What exact addition are you suggesting?

I was going to say something like that:

class EnhancedTaskGroup: def create_future(self) -> TaskHandle[T]: handle = TaskHandle[T]() return handle

But now I realize a Future is just a TaskHandle, so there's nothing to do?

Futures are a lower level primitive, not tied to any task, unlike TaskHandle. And a Future reports the exception it receives verbatim while TaskHandle handles base exceptions in a special manner.

davidbrochart · 2025-03-19T08:04:30Z

A bit far-fetched, but what do you think of a free function create_task() that would use the current EnhancedTaskGroup if there is one, and errors out otherwise?
On one hand this is going against structured concurrency, but on the other hand it removes the need to pass a task group down the stack, when you know there must be one.

src/anyio/_core/_tasks.py

davidbrochart · 2025-03-19T08:26:58Z

src/anyio/_core/_tasks.py

+        kwargs: Mapping[str, Any] | None = None,
+    ) -> TaskHandle[T]:
+        handle = TaskHandle[T]()
+        handle._start_value = await self._task_group.start(


If we want to be able to await the start value in the TaskHandle, I guess we should wrap start() with a create_task()?

I don't understand what you mean. The start value will already be available in the TaskHandle once start() returns.

Ah indeed 👍

I thought that the TaskHandle was returned immediately, but it's returned only when the task has started. Which means we cannot e.g. cancel the task before it has started. Not sure if that should be allowed?

If you really need to do that, you can just cancel an outer cancel scope.

smurfix · 2025-03-19T09:04:21Z

it removes the need to pass a task group down the stack,

On the other hand, that's the point of passing down the taskgroup -- when it's not there we know that the method we're calling won't start things it doesn't wait for.

If you want to circumvent that, for whatever reason, there's already a fairly-high-performance way to do it -- set a contextvar. So IMHO "explicit is better than implicit" and thus we shouldn't support that natively.

agronholm · 2025-03-20T17:28:36Z

A bit far-fetched, but what do you think of a free function create_task() that would use the current EnhancedTaskGroup if there is one, and errors out otherwise? On one hand this is going against structured concurrency, but on the other hand it removes the need to pass a task group down the stack, when you know there must be one.

I'm -1 on an implicit task group.

dhirschfeld · 2025-04-15T21:05:51Z

The one thing I'm after is a nursery/TaskGroup where I can start tasks and then iterate over the results asynchronously as soon as they become available so that I can interleave work and don't have to wait for the slowest task to start the next part of the pipeline.

With this API, race is trivial to implement - you just cancel the TaskGroup as soon as the first task is made available. gather is just collect all results in a list of length n_inputs in the order they were provided then return the results list when the TaskGroup exits.

agronholm · 2025-04-15T21:07:45Z

With this API, race is trivial to implement

And how do we deal with exceptions occurring in the child tasks?

dhirschfeld · 2025-04-15T21:15:40Z

With this API, race is trivial to implement

And how do we deal with exceptions occurring in the child tasks?

In my implementation I actually return Outcome instances, so the user processes the Outcome's as they're made available. It's then up to the user to decide what to do with any errors. If they blindly unwrap an error it will raise and cancel the TaskGroup.

It would of course be nice to have this built-in, so I'm lurking here to see if I can replace my own custom solution. For me it's important to be able to process tasks as soon as they're finished and to not have to wait for the slowest.

agronholm · 2025-04-15T21:59:28Z

Ok, so help me understand. What do you suggest race() returns then? What if the first result from a child task is an exception? Do you want to return the Outcome (or equivalent) of that?

dhirschfeld · 2025-04-16T03:18:48Z

I think that depends on the use-case. If amap returned an asynchronous iterator that returned Outcome wrapped results as soon as they were ready I'd leave it at that and let users implement race themselves with the semantics that made sense for their problem.

Given the amap primitive, race becomes trivial to implement however you want so there's no need to provide an implementation which may not be ideal for all use-cases.

this doesn't help with the refcycles

agronholm · 2025-04-21T19:11:06Z

I think that depends on the use-case. If amap returned an asynchronous iterator that returned Outcome wrapped results as soon as they were ready I'd leave it at that and let users implement race themselves with the semantics that made sense for their problem.

Given the amap primitive, race becomes trivial to implement however you want so there's no need to provide an implementation which may not be ideal for all use-cases.

But as_completed() does exactly that (with TaskHandles), doesn't it? I don't think amap() is suitable for implementing race().

dhirschfeld · 2025-04-22T00:05:17Z

I think that depends on the use-case. If amap returned an asynchronous iterator that returned Outcome wrapped results as soon as they were ready I'd leave it at that and let users implement race themselves with the semantics that made sense for their problem.
Given the amap primitive, race becomes trivial to implement however you want so there's no need to provide an implementation which may not be ideal for all use-cases.

But as_completed() does exactly that (with TaskHandles), doesn't it? I don't think amap() is suitable for implementing race().

Ah yep, I missed the as_completed implementation - that does look like it does what I'm after!

dhirschfeld · 2025-04-28T21:08:11Z

I'm actually pretty excited about this functionality as it will let me replace a bunch of custom code, so I'm curious if there are plans to land this in a release sometime soonish?

agronholm · 2025-09-28T12:35:11Z

I think I'll stop spamming this PR and make a separate one for rate limiting, as it's not tied to the improved concurrency API.

agronholm · 2025-09-29T09:15:24Z

For interested parties, here it is: #989

agronholm · 2025-10-03T08:02:36Z

Here's what I understand our options are:

1. Expose a new task group class (EnhancedTaskGroup or whatever)

Pros:

It's a new API and we're free to shape it however we like

Cons:

It's a new API which we have to support for the foreseeable future
Having two distinct task groups complicates things for users too

2. Expose only the concurrency helpers but not a new task group API

Pros:

Users get increased convenience without any duplication

Cons:

The convenience functions don't cater to all use cases
Only being able to retrieve task results via these concurrency helpers feels odd from an API design perspective

3. Shoehorn task handles into the existing API

Pros:

Low-level APIs that actually let us use the existing task groups to return task results

Cons:

Backwards compatibility will be tricky, particularly with the start() method which will likely require an extra keyword argument to return a task handle instead of the task_status.started() value
On Trio, we always end up wrapping the target coroutine function in start_soon() as there is no other way to get the return value
If we mess this up, it will hurt both us and the users

agronholm · 2025-10-03T09:50:00Z

For option 3, it would probably look something like this:

class TaskGroup: # never mind the name or inheritance here
    def start_soon(
        self,
        func: Callable[[Unpack[PosArgsT]], Awaitable[T]],
        *args: Unpack[PosArgsT],
        name: object = None,
    ) -> TaskHandle[T]:
        ...

    @overload
    async def start(
        self,
        func: Callable[..., Awaitable[Any]],
        *args: object,
        name: object = None,
        return_handle: Literal[False] = False,
    ) -> Any:
        ...

    @overload
    async def start(
        self,
        func: Callable[..., Awaitable[T]],
        *args: object,
        name: object = None,
        return_handle: Literal[True],
    ) -> TaskHandle[T]:
        ...

Graeme22 · 2025-10-03T16:06:36Z

My order of preference would be 2, 3, 1. I bet the majority of users who are asking for ways to collect task results would be satisfied with some combination of gather/amap/whatever. It's also a very low-cost option that wouldn't break anything and can be adapted to future API changes easily.

agronholm · 2025-10-03T19:41:16Z

My own order of preference is 3, 2, 1. @Graeme22 what objections do you have against option 3? I feel that 2) is only kicking the can down the road. We need a final solution at some point.

Graeme22 · 2025-10-03T21:47:00Z

My own order of preference is 3, 2, 1. @Graeme22 what objections do you have against option 3? I feel that 2) is only kicking the can down the road. We need a final solution at some point.

I don't have any strong objections to 3. If it's likely to be a breaking change anyway, maybe start could just return a tuple[Any, TaskHandle[T]]?

Something like this could also be considered, sort of a combination of 1 and 3 that maintains backwards compatibility:

@classmethod
@override
def create_task_group(cls, collect_results: Literal[False] = False) -> TaskGroup: ...

@classmethod
@override
def create_task_group(cls, collect_results: Literal[True]) -> ResultCollectingTaskGroup: ...

class ResultCollectingTaskGroup:
    def start_soon(
        self,
        func: Callable[[Unpack[PosArgsT]], Awaitable[T]],
        *args: Unpack[PosArgsT],
        name: object = None,
    ) -> TaskHandle[T]:

    async def start(
        self,
        func: Callable[..., Awaitable[Any]],
        *args: object,
        name: object = None,
    ) -> tuple[Any, TaskHandle[T]]:

Regardless, I think the utility functions should definitely be present (maybe this was implied already idk)

agronholm · 2025-10-03T21:48:50Z

My own order of preference is 3, 2, 1. @Graeme22 what objections do you have against option 3? I feel that 2) is only kicking the can down the road. We need a final solution at some point.

I don't have any strong objections to 3. If it's likely to be a breaking change anyway, maybe start could just return a tuple[Any, TaskHandle[T]]?

Wait, what? Who said anything about a breaking change?

Graeme22 · 2025-10-03T21:50:02Z

My own order of preference is 3, 2, 1. @Graeme22 what objections do you have against option 3? I feel that 2) is only kicking the can down the road. We need a final solution at some point.

I don't have any strong objections to 3. If it's likely to be a breaking change anyway, maybe start could just return a tuple[Any, TaskHandle[T]]?

Wait, what? Who said anything about a breaking change?

I just assumed it was possible when you said backwards compatibility would be tricky

agronholm · 2025-10-03T22:03:42Z

My own order of preference is 3, 2, 1. @Graeme22 what objections do you have against option 3? I feel that 2) is only kicking the can down the road. We need a final solution at some point.

I don't have any strong objections to 3. If it's likely to be a breaking change anyway, maybe start could just return a tuple[Any, TaskHandle[T]]?

Wait, what? Who said anything about a breaking change?

I just assumed it was possible when you said backwards compatibility would be tricky

What I meant was that the API becomes clunkier with the shoehorning of this functionality to what was basically copied from Trio's nurseries. But in no event do I want to compromise the compatibility guarantees.

agronholm · 2025-10-03T22:24:36Z

One more thing: TaskHandle is already capable of storing the start value of a task, so a tuple is not necessary.

smurfix · 2025-10-04T03:44:10Z

If it's likely to be a breaking change anyway

There's a difference between "we need to change some detail and there's a deprecation notice and grace period where both work and everything" (like we had with Event.set and friends) and "we want to change something major, it's suddenly incompatible and you need to update your complete codebase all at once".

The latter is not happening. Unless it's for a very important overriding reason, which this clearly is not.

agronholm · 2025-10-04T11:28:08Z

What bothers me a lot still is the start() mechanism where we cannot establish type safety for the start value. I have been unable to figure out a clearly better alternative for the task_status.started() call. The only idea I came up with is a contextvar-based mechanism where you call a free function that sets an event in the contextvar, but there is a downside for that, namely that it would allow starting functions that have no intention of ever calling that free function. Plus it would still not give us type safety.

If async generators could return a value like normal generators can, that would make for an interesting alternative.

agronholm · 2025-10-04T11:30:59Z

OTOH, if we assume that we don't need the return value from such a task, then async generators would be a viable alternative, as you can still return from them without a value. I'll whip up a test branch for that.

smurfix · 2025-10-04T17:34:58Z

The Trio people have been discussing this for more than two years now. python-trio/trio#2633

Though I have to say that our discussion here feels a bit more constructive.

smurfix · 2025-10-04T17:41:18Z

Another advantage of # 3 is (most probably) that it's the least amount of additional work. # 1 will lead to either some code duplication or some additional method calls (including annoying wrappers to catch the results; while the user doesn't see them any more they'll be still there and slow things down). # 2 definitely requires task wrappers.

agronholm · 2025-10-07T22:20:44Z

I have a new branch, concurrency-helpers which contains a reimagined implementation of the task group. It deals solely with coroutine objects and offers start() like functionality using an async generator rather than a keyword argument, thus offering a type safe way to provide a start value.

Additionally, its start_task() method takes a concurrency limiter and a rate limiter, thus adding native support for these features. It still provides a synchronous create_task() method which works just like its asyncio counterpart.

agronholm · 2025-10-08T08:12:28Z

Another advantage of # 3 is (most probably) that it's the least amount of additional work. # 1 will lead to either some code duplication or some additional method calls (including annoying wrappers to catch the results; while the user doesn't see them any more they'll be still there and slow things down). # 2 definitely requires task wrappers.

No matter how we implement it, task wrappers will be needed as there is otherwise no way to extract the return value on Trio.

smurfix · 2025-10-08T09:02:41Z

No matter how we implement it, task wrappers will be needed as there is otherwise no way to extract the return value on Trio.

… assuming the Trio people don't also go that route.

python-trio/trio#2633

Maybe you want to weigh in there.

agronholm · 2025-10-08T09:07:51Z

No matter how we implement it, task wrappers will be needed as there is otherwise no way to extract the return value on Trio.

… assuming the Trio people don't also go that route.

python-trio/trio#2633

Maybe you want to weigh in there.

Frankly I don't think this is going to be resolved upstream any time soon. But if Trio gets this feature, we have the option of changing our implementation behind the scenes. Plus my latest change would still need the wrapper for releasing the concurrency limiter.

smurfix · 2025-10-08T09:09:55Z

Frankly I don't think this is going to be resolved upstream any time soon

You may have a point here. On the other hand, maybe they'll follow your lead for once …

Graeme22 · 2025-12-29T17:31:10Z

Any updates on this? It seems like option 3 was acceptable to everyone who's commented thus far, it could look like:

def start_soon(
    self,
    func: Callable[[Unpack[PosArgsT]], Awaitable[T]],
    *args: Unpack[PosArgsT],
    name: object = None,
) -> Future[T]:
    ...

@overload
async def start(
    self,
    func: Callable[..., Awaitable[Any]],
    *args: object,
    name: object = None,
    return_future: Literal[False] = False,
) -> Any:
    ...

@overload
async def start(
    self,
    func: Callable[..., Awaitable[T]],
    *args: object,
    name: object = None,
    return_future: Literal[True],
) -> Tuple[Future[T], Any]:
    ...

agronholm · 2025-12-29T22:17:08Z

Any updates on this? It seems like option 3 was acceptable to everyone who's commented thus far, it could look like:

def start_soon(
    self,
    func: Callable[[Unpack[PosArgsT]], Awaitable[T]],
    *args: Unpack[PosArgsT],
    name: object = None,
) -> Future[T]:
    ...

@overload
async def start(
    self,
    func: Callable[..., Awaitable[Any]],
    *args: object,
    name: object = None,
    return_future: Literal[False] = False,
) -> Any:
    ...

@overload
async def start(
    self,
    func: Callable[..., Awaitable[T]],
    *args: object,
    name: object = None,
    return_future: Literal[True],
) -> Tuple[Future[T], Any]:
    ...

So the thing is, after an extensive discussion, there was a loose consensus that we wanted to include rate limiting in these enhanced task groups as that is not properly doable from the outside. But that effort got stuck as there was no real consensus on how the rate limiters should work. I have a couple related PRs open. I'm not sure how to get unstuck with this.

Graeme22 · 2025-12-29T23:22:08Z

So the thing is, after an extensive discussion, there was a loose consensus that we wanted to include rate limiting in these enhanced task groups as that is not properly doable from the outside. But that effort got stuck as there was no real consensus on how the rate limiters should work. I have a couple related PRs open. I'm not sure how to get unstuck with this.

Couldn't the rate limiter code be separated out so we can get the task results implementation worked out?

agronholm · 2025-12-29T23:32:19Z

So the thing is, after an extensive discussion, there was a loose consensus that we wanted to include rate limiting in these enhanced task groups as that is not properly doable from the outside. But that effort got stuck as there was no real consensus on how the rate limiters should work. I have a couple related PRs open. I'm not sure how to get unstuck with this.

Couldn't the rate limiter code be separated out so we can get the task results implementation worked out?

If the rate limiters can be added later w/o breaking the API. There is also the possibility of using the rate limiting interface w/o a concrete implementation, though I think that would be an odd thing to provide.

davidbrochart reviewed Mar 19, 2025

View reviewed changes

agronholm added 5 commits March 20, 2025 18:00

Added EnhancedTaskGroup, amap() and race()

8713419

Tweaked the implementation and added as_completed()

22444c0

Fixed pyright errors

64e67be

Replaced race() with as_completed() and added some other tweaks

4a2702c

Refactored connect_tcp() to use as_completed()

33f6919

agronholm changed the title ~~Added EnhancedTaskGroup, amap() and race()~~ Added EnhancedTaskGroup, amap() and as_completed() Mar 20, 2025

agronholm force-pushed the enhanced-taskgroup branch from 6a9bec2 to 33f6919 Compare March 20, 2025 17:27

agronholm and others added 4 commits April 6, 2025 18:01

Merge branch 'master' into enhanced-taskgroup

8ede25b

Merge branch 'master' into enhanced-taskgroup

66bfdb3

Added AwaitedTaskCancelled and fixed connect_tcp()

4940718

Merge branch 'master' into enhanced-taskgroup

8bb4593

Merge branch 'master' into enhanced-taskgroup

345c164

graingert and others added 5 commits April 16, 2025 07:58

clear exc refcycle

3c7a2af

more judiciously clear coros

6538225

this doesn't help with the refcycles

fix clear coros

dff36e7

del retval only when it's defined

0a2af29

Merge branch 'master' into enhanced-taskgroup

1807550

Merge branch 'master' into enhanced-taskgroup

878044f

agronholm modified the milestones: 4.12, 4.13 Nov 28, 2025

Merge branch 'master' into enhanced-taskgroup

2e723bd

agronholm removed this from the 4.13 milestone Feb 9, 2026

		return get_async_backend().create_task_group()


		class TaskHandle(Generic[T]):

Uh oh!

Conversation

agronholm commented Mar 16, 2025

Changes

Checklist

Updating the changelog

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidbrochart commented Mar 19, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

smurfix commented Mar 19, 2025

Uh oh!

agronholm commented Mar 20, 2025

Uh oh!

dhirschfeld commented Apr 15, 2025

Uh oh!

agronholm commented Apr 15, 2025

Uh oh!

dhirschfeld commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agronholm commented Apr 15, 2025

Uh oh!

dhirschfeld commented Apr 16, 2025

Uh oh!

agronholm commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhirschfeld commented Apr 22, 2025

Uh oh!

dhirschfeld commented Apr 28, 2025

Uh oh!

agronholm commented Sep 28, 2025

Uh oh!

agronholm commented Sep 29, 2025

Uh oh!

agronholm commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agronholm commented Oct 3, 2025

Uh oh!

Graeme22 commented Oct 3, 2025

Uh oh!

agronholm commented Oct 3, 2025

Uh oh!

Graeme22 commented Oct 3, 2025

Uh oh!

agronholm commented Oct 3, 2025

Uh oh!

Graeme22 commented Oct 3, 2025

Uh oh!

agronholm commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agronholm commented Oct 3, 2025

Uh oh!

smurfix commented Oct 4, 2025

Uh oh!

agronholm commented Oct 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agronholm commented Oct 4, 2025

dhirschfeld commented Apr 15, 2025 •

edited

Loading

agronholm commented Apr 21, 2025 •

edited

Loading

agronholm commented Oct 3, 2025 •

edited

Loading

agronholm commented Oct 3, 2025 •

edited

Loading

agronholm commented Oct 4, 2025 •

edited

Loading

smurfix commented Oct 4, 2025 •

edited

Loading

agronholm commented Oct 7, 2025 •

edited

Loading

Graeme22 commented Dec 29, 2025 •

edited

Loading