pyarrow.acero.HashJoinNodeOptions#
- classpyarrow.acero.HashJoinNodeOptions(join_type,left_keys,right_keys,left_output=None,right_output=None,output_suffix_for_left='',output_suffix_for_right='',filter_expression=None)#
Bases:
_HashJoinNodeOptionsMake a node which implements join operation using hash join strategy.
This is the option class for the “hashjoin” node factory.
- Parameters:
- join_type
str Type of join. One of “left semi”, “right semi”, “left anti”,“right anti”, “inner”, “left outer”, “right outer”, “full outer”.
- left_keys
str,Expressionorlist Key fields from left input. Each key can be a string column nameor a field expression, or a list of such field references.
- right_keys
str,Expressionorlist Key fields from right input. Seeleft_keys for details.
- left_output
list, optional List of output fields passed from left input. If left and rightoutput fields are not specified, all valid fields from both left andright input will be output. Each field can be a string column nameor a field expression.
- right_output
list, optional List of output fields passed from right input. If left and rightoutput fields are not specified, all valid fields from both left andright input will be output. Each field can be a string column nameor a field expression.
- output_suffix_for_left
str Suffix added to names of output fields coming from left input(used to distinguish, if necessary, between fields of the samename in left and right input and can be left empty if there areno name collisions).
- output_suffix_for_right
str Suffix added to names of output fields coming from right input,seeoutput_suffix_for_left for details.
- filter_expression
pyarrow.compute.Expression Residual filter which is applied to matching row.
- join_type
- __init__(self,join_type,left_keys,right_keys,left_output=None,right_output=None,output_suffix_for_left='',output_suffix_for_right='',filter_expression=None)#
Methods
__init__(self, join_type, left_keys, right_keys)

