bigframes.pandas.DataFrame.duplicated#

DataFrame.duplicated(subset=None,keep:str='first')Series[source]#

Return boolean Series denoting duplicate rows.

Considering certain columns is optional.

Parameters:
  • subset (column label orsequence oflabels,optional) – Only consider certain columns for identifying duplicates, bydefault use all of the columns.

  • keep ({'first','last',False},default 'first') –

    Determines which duplicates (if any) to mark.

    • first : Mark duplicates asTrue except for the first occurrence.

    • last : Mark duplicates asTrue except for the last occurrence.

    • False : Mark all duplicates asTrue.

Returns:

Boolean series for each duplicated rows.

Return type:

bigframes.pandas.Series

This Page