- API reference
- Series
- pandas.Serie...
pandas.Series.duplicated#
- Series.duplicated(keep='first')[source]#
Indicate duplicate Series values.
Duplicated values are indicated as
Truevalues in the resultingSeries. Either all duplicates, all except the first or all except thelast occurrence of duplicates can be indicated.- Parameters:
- keep{‘first’, ‘last’, False}, default ‘first’
Method to handle dropping duplicates:
‘first’ : Mark duplicates as
Trueexcept for the firstoccurrence.‘last’ : Mark duplicates as
Trueexcept for the lastoccurrence.False: Mark all duplicates asTrue.
- Returns:
- Series[bool]
Series indicating whether each value has occurred in thepreceding values.
See also
Index.duplicatedEquivalent method on pandas.Index.
DataFrame.duplicatedEquivalent method on pandas.DataFrame.
Series.drop_duplicatesRemove duplicate values from Series.
Examples
By default, for each set of duplicated values, the first occurrence isset on False and all others on True:
>>>animals=pd.Series(['llama','cow','llama','beetle','llama'])>>>animals.duplicated()0 False1 False2 True3 False4 Truedtype: bool
which is equivalent to
>>>animals.duplicated(keep='first')0 False1 False2 True3 False4 Truedtype: bool
By using ‘last’, the last occurrence of each set of duplicated valuesis set on False and all others on True:
>>>animals.duplicated(keep='last')0 True1 False2 True3 False4 Falsedtype: bool
By setting keep on
False, all duplicates are True:>>>animals.duplicated(keep=False)0 True1 False2 True3 False4 Truedtype: bool