refactor: merge CoalesceAsyncExecInput into CoalesceBatches #18540

Tim-53 · 2025-11-08T04:24:22Z

Which issue does this PR close?

Closes Consider folding CoalesceAsyncExecInput physical optimizer rule into CoalesceBatches #18155.

Rationale for this change

What changes are included in this PR?

Merges the functionality of CoalesceAsyncExecInput into CoalesceBatches to remove redundant optimizer logic and simplify batch coalescing behavior.

Are these changes tested?

Behavior is covered by existing ``CoalesceBatches and optimizer tests.

Are there any user-facing changes?

No

…ty into CoalesceBatches

…scebatches

Dandandan · 2025-11-09T19:18:44Z

datafusion/physical-optimizer/src/coalesce_batches.rs

                    plan,
                    target_batch_size,
                ))))
+            } else if let Some(async_exec) = plan_any.downcast_ref::<AsyncFuncExec>() {


Could we add it to the expression for wrap_in_coalesce before? E.g.

|| plan_any .downcast_ref::<AsyncFuncExec>() .map(|f| ... etc.

I considered adding it at first, but I realized that inside AsyncFuncExec, the CoalesceBatchesExec is wrapped around the first child, whereas for the other operators it's wrapped around the entire plan.
I'm not sure whether changing the order inside AsyncFuncExec to match the others would have any impact, so I kept it as is for now.
I'll look into it further, but if anyone already has context on why it’s done this way, I’d appreciate any insights.

I believe the idea is that for async functions, we are specifically interested in batching together inputs to the function so ideally it is not called as often (which can be expensive for async function). Whereas for coalesce batches in general, it looks like it considers the output of a node and if it is too small it wraps that note in a coalesce.

So for async, we consider input to aync node for coalesce logic; for other node types we look at their output for coalesce logic

We should copy the comment from coalesce_async_exec_input.rs here to not lose that context, e.g.

Coalesce inputs to async functions to reduce number of async function invocations

Thanks for the quick reply. Just added the comment.

Jefffrey

Seems fine to me, apart from minor comment about some documentation.

We have existing tests for this:

datafusion/datafusion/sqllogictest/test_files/async_udf.slt

Lines 28 to 41 in f162fd3

    
           query TT 
        
           explain select min(async_abs(x)) from data; 
        
           ---- 
        
           logical_plan 
        
           01)Aggregate: groupBy=[[]], aggr=[[min(async_abs(data.x))]] 
        
           02)--TableScan: data projection=[x] 
        
           physical_plan 
        
           01)AggregateExec: mode=Final, gby=[], aggr=[min(async_abs(data.x))] 
        
           02)--CoalescePartitionsExec 
        
           03)----AggregateExec: mode=Partial, gby=[], aggr=[min(async_abs(data.x))] 
        
           04)------RepartitionExec: partitioning=RoundRobinBatch(4), input_partitions=1 
        
           05)--------AsyncFuncExec: async_expr=[async_expr(name=__async_fn_0, expr=async_abs(x@0))] 
        
           06)----------CoalesceBatchesExec: target_batch_size=8192 
        
           07)------------DataSourceExec: partitions=1, partition_sizes=[1]

Since they pass it looks like this refactor works.

Jefffrey · 2025-11-10T01:01:10Z

datafusion/physical-optimizer/src/coalesce_batches.rs

                    plan,
                    target_batch_size,
                ))))
+            } else if let Some(async_exec) = plan_any.downcast_ref::<AsyncFuncExec>() {


I believe the idea is that for async functions, we are specifically interested in batching together inputs to the function so ideally it is not called as often (which can be expensive for async function). Whereas for coalesce batches in general, it looks like it considers the output of a node and if it is too small it wraps that note in a coalesce.

So for async, we consider input to aync node for coalesce logic; for other node types we look at their output for coalesce logic

We should copy the comment from coalesce_async_exec_input.rs here to not lose that context, e.g.

Coalesce inputs to async functions to reduce number of async function invocations

refactor: remove CoalesceAsyncExecInput and integrate its functionali…

28a0f3a

…ty into CoalesceBatches

github-actions bot added the optimizer Optimizer rules label Nov 8, 2025

Tim-53 marked this pull request as ready for review November 8, 2025 04:30

Tim-53 mentioned this pull request Nov 8, 2025

Consider folding CoalesceAsyncExecInput physical optimizer rule into CoalesceBatches #18155

Open

Tim-53 added 2 commits November 8, 2025 17:45

Merge branch 'main' into 18155-fold-coalesceasyncexecinput-into-coale…

e80a077

…scebatches

remove redundant entries for coalesce_async_exec_input in explain.slt

ecb2d3d

github-actions bot added the sqllogictest SQL Logic Tests (.slt) label Nov 9, 2025

Dandandan reviewed Nov 9, 2025

View reviewed changes

Jefffrey approved these changes Nov 10, 2025

View reviewed changes

add clarifying comment

4905819

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: merge CoalesceAsyncExecInput into CoalesceBatches #18540

refactor: merge CoalesceAsyncExecInput into CoalesceBatches #18540

Tim-53 commented Nov 8, 2025 •

edited

Loading

Uh oh!

Dandandan Nov 9, 2025

Uh oh!

Tim-53 Nov 10, 2025

Uh oh!

Jefffrey Nov 10, 2025

Uh oh!

Tim-53 Nov 10, 2025

Uh oh!

Jefffrey left a comment

Uh oh!

Jefffrey Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	query TT
	explain select min(async_abs(x)) from data;
	----
	logical_plan
	01)Aggregate: groupBy=[[]], aggr=[[min(async_abs(data.x))]]
	02)--TableScan: data projection=[x]
	physical_plan
	01)AggregateExec: mode=Final, gby=[], aggr=[min(async_abs(data.x))]
	02)--CoalescePartitionsExec
	03)----AggregateExec: mode=Partial, gby=[], aggr=[min(async_abs(data.x))]
	04)------RepartitionExec: partitioning=RoundRobinBatch(4), input_partitions=1
	05)--------AsyncFuncExec: async_expr=[async_expr(name=__async_fn_0, expr=async_abs(x@0))]
	06)----------CoalesceBatchesExec: target_batch_size=8192
	07)------------DataSourceExec: partitions=1, partition_sizes=[1]

refactor: merge CoalesceAsyncExecInput into CoalesceBatches #18540

Are you sure you want to change the base?

refactor: merge CoalesceAsyncExecInput into CoalesceBatches #18540

Conversation

Tim-53 commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Dandandan Nov 9, 2025

Choose a reason for hiding this comment

Uh oh!

Tim-53 Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Jefffrey Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Tim-53 Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Jefffrey left a comment

Choose a reason for hiding this comment

Uh oh!

Jefffrey Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Tim-53 commented Nov 8, 2025 •

edited

Loading