pyarrow.acero.HashJoinNodeOptions#

classpyarrow.acero.HashJoinNodeOptions(join_type,left_keys,right_keys,left_output=None,right_output=None,output_suffix_for_left='',output_suffix_for_right='',filter_expression=None)#

Bases:_HashJoinNodeOptions

Make a node which implements join operation using hash join strategy.

This is the option class for the “hashjoin” node factory.

Parameters:
join_typestr

Type of join. One of “left semi”, “right semi”, “left anti”,“right anti”, “inner”, “left outer”, “right outer”, “full outer”.

left_keysstr,Expression orlist

Key fields from left input. Each key can be a string column nameor a field expression, or a list of such field references.

right_keysstr,Expression orlist

Key fields from right input. Seeleft_keys for details.

left_outputlist, optional

List of output fields passed from left input. If left and rightoutput fields are not specified, all valid fields from both left andright input will be output. Each field can be a string column nameor a field expression.

right_outputlist, optional

List of output fields passed from right input. If left and rightoutput fields are not specified, all valid fields from both left andright input will be output. Each field can be a string column nameor a field expression.

output_suffix_for_leftstr

Suffix added to names of output fields coming from left input(used to distinguish, if necessary, between fields of the samename in left and right input and can be left empty if there areno name collisions).

output_suffix_for_rightstr

Suffix added to names of output fields coming from right input,seeoutput_suffix_for_left for details.

filter_expressionpyarrow.compute.Expression

Residual filter which is applied to matching row.

__init__(self,join_type,left_keys,right_keys,left_output=None,right_output=None,output_suffix_for_left='',output_suffix_for_right='',filter_expression=None)#

Methods

__init__(self, join_type, left_keys, right_keys)