- API reference
- Index objects
- pandas.Index...
pandas.Index.duplicated#
- Index.duplicated(keep='first')[source]#
Indicate duplicate index values.
Duplicated values are indicated as
Truevalues in the resultingarray. Either all duplicates, all except the first, or all except thelast occurrence of duplicates can be indicated.- Parameters:
- keep{‘first’, ‘last’, False}, default ‘first’
The value or values in a set of duplicates to mark as missing.
‘first’ : Mark duplicates as
Trueexcept for the firstoccurrence.‘last’ : Mark duplicates as
Trueexcept for the lastoccurrence.False: Mark all duplicates asTrue.
- Returns:
- np.ndarray[bool]
See also
Series.duplicatedEquivalent method on pandas.Series.
DataFrame.duplicatedEquivalent method on pandas.DataFrame.
Index.drop_duplicatesRemove duplicate values from Index.
Examples
By default, for each set of duplicated values, the first occurrence isset to False and all others to True:
>>>idx=pd.Index(['lama','cow','lama','beetle','lama'])>>>idx.duplicated()array([False, False, True, False, True])
which is equivalent to
>>>idx.duplicated(keep='first')array([False, False, True, False, True])
By using ‘last’, the last occurrence of each set of duplicated valuesis set on False and all others on True:
>>>idx.duplicated(keep='last')array([ True, False, True, False, False])
By setting keep on
False, all duplicates are True:>>>idx.duplicated(keep=False)array([ True, False, True, False, True])