Acero - Streaming Execution Engine#
Warning
Acero is experimental and a stable API is not yet guaranteed.
Acero is a streaming query engine, which allows the computation to be expressedas an “execution plan” (constructed using theDeclaration interface).This enables to create a computation composed ofpyarrow.compute functionsand to execute this efficiently in a batched manner.
| Helper class for declaring the nodes of an ExecPlan. |
Base class for the node options. | |
| A Source node which accepts a table. |
| A Source node which yields batches from a Dataset scan. |
| Make a node which excludes some rows from batches passed through it. |
| Make a node which executes expressions on input batches, producing batches of the same length with new columns. |
| Make a node which aggregates input batches, optionally grouped by keys. |
| Make a node which applies a new ordering to the data. |
| Make a node which implements join operation using hash join strategy. |
See also
- Substrait
Alternative way to run Acero from a standardized Substrait plan.

