gh-131466: `concurrent.futures.Executor.map`: avoid temporarily exceeding `buffersize` while collecting the next result #131467

ebonnal · 2025-03-19T15:57:43Z

Context recap:

If we have:

results: Iterator = executor.map(fn, iterable, buffersize=buffersize)

What happens when calling next(results):

fetch the next arg from interable and put a task for fn(arg) in the buffer
wait for the next result to be available
yield the collected result

-> During step 2. there is buffersize + 1 buffered tasks.

This PR swaps steps 1. and 2. so that buffersize is never exceeded, even during next.

Issue: concurrent.futures.Executor.map temporarily exceeds its buffersize while collecting the next result #131466

…llecting the next result

…ffer-after-yield

…Test.test_free_reference

ebonnal · 2025-03-22T19:07:41Z

Lib/concurrent/futures/_base.py

-                        yield _result_or_cancel(fs.pop(), end_time - time.monotonic())
+
+                    # Yield the awaited result
+                    yield fs.pop().result()


to be discussed: this could be replaced by a lighter yield fs.pop()._result because the prior call to _result_or_cancel guarantees that at this point the result is available.

Lib/test/test_concurrent_futures/executor.py

picnixz

While I understand that we could possibly exceed buffersize while collecting the next result, is there a real-word use case where it would really cause an issue? the reason is that we access to fs[-1] and then do fs.pop().

I see that have a del fut in _result_or_cancel() but can you confirm that it's sufficient to not hold any reference to the yet-to-be-popped future?

Lib/concurrent/futures/_base.py

picnixz · 2025-03-23T13:25:47Z

Asking Gregory as well since he's the mp expert c:

ebonnal · 2025-03-23T13:33:19Z

@picnixz sorry I re-asked your review because you made me realize that we actually don't need _result_or_cancel anymore:

test_executor_map_current_future_cancel introduced in #95169 does not break anymore because now if the fs[-1].result() access fails, the future is still in fs (not popped out like before) and it will be properly cancelled as part of the result_iterator's finally block.

I'm digging deeper into #95169 's context to check if I miss any non-tested scenario, especially regarding this:

    finally:
        # Break a reference cycle with the exception in self._exception
        del fut

picnixz · 2025-03-23T13:52:11Z

especially regarding this:

yes, that's what I wanted to ask, but I'm not an expert here so i'll let you investigate first c:

ebonnal · 2025-03-23T22:53:59Z

Lib/concurrent/futures/_base.py

-            fut.cancel()
-    finally:
-        # Break a reference cycle with the exception in self._exception
-        del fut


Hi @graingert!
Context:
As a side effect, this PR may remove the need for _result_or_cancel (introduced in #95169). If fetching the next result raises a TimeoutError, its future will still be in fs and will be properly cancelled by the result_iterator's finally block.

Question:
Do you remember in which scenario the del fut was required? Removing it in the current main does not break any tests 🤔

This is if fut.result() raises an exception there's a reference cycle where fut.exception().__traceback__ -> fut.exception()

Probably worth adding a test, a git grep for no_other_refs will find a similar one

Thank you! Will add the test 🫡

fyi #131701 adds the test @graingert @picnixz 🙏🏻

…ed by fs[-1]

Executor.map: avoid temporarily exceeding the buffersize while co…

233ccc1

…llecting the next result

bedevere-app bot added the awaiting review label Mar 19, 2025

bedevere-app bot mentioned this pull request Mar 19, 2025

concurrent.futures.Executor.map temporarily exceeds its buffersize while collecting the next result #131466

Open

ebonnal added 9 commits March 19, 2025 16:07

Merge remote-tracking branch 'cpython/main' into feat/executor-map-bu…

b864ef9

…ffer-after-yield

avoid keeping a ref to result for test_thread_pool.ThreadPoolExecutor…

72d7028

…Test.test_free_reference

update comment about not keeping references to popped future/result

2a30697

introduce current_timeout variable

ab4182b

comment on the necessity of the result container

7a1ae46

avoid container

268927d

remove current_timeout usage

de09aff

fix comments format

1814bfe

rephrase comments

7206321

ebonnal changed the title ~~gh-131466: concurrent.futures.Executor.map: avoid temporarily exceeding buffersize while collecting the next result~~ gh-131466: concurrent.futures.Executor.map: avoid temporarily exceeding buffersize while collecting the next result Mar 20, 2025

picnixz self-requested a review March 22, 2025 15:59

picnixz added the skip news label Mar 22, 2025

ebonnal commented Mar 22, 2025

View reviewed changes

picnixz reviewed Mar 22, 2025

View reviewed changes

Lib/test/test_concurrent_futures/executor.py Outdated Show resolved Hide resolved

order imports

162add1

ebonnal requested a review from picnixz March 23, 2025 01:02

picnixz reviewed Mar 23, 2025

View reviewed changes

Lib/concurrent/futures/_base.py Outdated Show resolved Hide resolved

format comments

f2c5fd0

picnixz approved these changes Mar 23, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Mar 23, 2025

picnixz requested a review from gpshead March 23, 2025 13:25

remove _result_or_cancel

9474769

ebonnal requested a review from picnixz March 23, 2025 13:30

access awaited result via _result attribute

2a2119e

ebonnal commented Mar 23, 2025

View reviewed changes

ebonnal added 2 commits March 24, 2025 13:23

break a reference cycle with fs[-1]._exception

3be6956

break other potential reference cycles with fs, not only the one caus…

0d70be9

…ed by fs[-1]

ebonnal mentioned this pull request Mar 24, 2025

concurrent.futures.Executor.map: test no reference cycle from failed future captured in its exception’s traceback #131701

Open

ebonnal added 2 commits March 25, 2025 00:34

lighter ref cycle break

f509097

move the ref cycle break into the finally block

d50dabd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-131466: `concurrent.futures.Executor.map`: avoid temporarily exceeding `buffersize` while collecting the next result #131467

gh-131466: `concurrent.futures.Executor.map`: avoid temporarily exceeding `buffersize` while collecting the next result #131467

ebonnal commented Mar 19, 2025 •

edited

Loading

ebonnal Mar 22, 2025 •

edited

Loading

picnixz left a comment

picnixz commented Mar 23, 2025

ebonnal commented Mar 23, 2025 •

edited

Loading

picnixz commented Mar 23, 2025

ebonnal Mar 23, 2025 •

edited

Loading

graingert Mar 24, 2025

ebonnal Mar 24, 2025

ebonnal Mar 24, 2025 •

edited

Loading

gh-131466: concurrent.futures.Executor.map: avoid temporarily exceeding buffersize while collecting the next result #131467

Are you sure you want to change the base?

gh-131466: concurrent.futures.Executor.map: avoid temporarily exceeding buffersize while collecting the next result #131467

Conversation

ebonnal commented Mar 19, 2025 • edited Loading

Context recap:

ebonnal Mar 22, 2025 • edited Loading

Choose a reason for hiding this comment

picnixz left a comment

Choose a reason for hiding this comment

picnixz commented Mar 23, 2025

ebonnal commented Mar 23, 2025 • edited Loading

picnixz commented Mar 23, 2025

ebonnal Mar 23, 2025 • edited Loading

Choose a reason for hiding this comment

graingert Mar 24, 2025

Choose a reason for hiding this comment

ebonnal Mar 24, 2025

Choose a reason for hiding this comment

ebonnal Mar 24, 2025 • edited Loading

Choose a reason for hiding this comment

gh-131466: `concurrent.futures.Executor.map`: avoid temporarily exceeding `buffersize` while collecting the next result #131467

gh-131466: `concurrent.futures.Executor.map`: avoid temporarily exceeding `buffersize` while collecting the next result #131467

ebonnal commented Mar 19, 2025 •

edited

Loading

ebonnal Mar 22, 2025 •

edited

Loading

ebonnal commented Mar 23, 2025 •

edited

Loading

ebonnal Mar 23, 2025 •

edited

Loading

ebonnal Mar 24, 2025 •

edited

Loading