Indicate duplicate Series values.
Duplicated values are indicated as True
values in the resulting Series. Either all duplicates, all except the first or all except the last occurrence of duplicates can be indicated.
Method to handle dropping duplicates:
âfirstâ : Mark duplicates as True
except for the first occurrence.
âlastâ : Mark duplicates as True
except for the last occurrence.
False
: Mark all duplicates as True
.
Series indicating whether each value has occurred in the preceding values.
Examples
By default, for each set of duplicated values, the first occurrence is set on False and all others on True:
>>> animals = pd.Series(['llama', 'cow', 'llama', 'beetle', 'llama']) >>> animals.duplicated() 0 False 1 False 2 True 3 False 4 True dtype: bool
which is equivalent to
>>> animals.duplicated(keep='first') 0 False 1 False 2 True 3 False 4 True dtype: bool
By using âlastâ, the last occurrence of each set of duplicated values is set on False and all others on True:
>>> animals.duplicated(keep='last') 0 True 1 False 2 True 3 False 4 False dtype: bool
By setting keep on False
, all duplicates are True:
>>> animals.duplicated(keep=False) 0 True 1 False 2 True 3 False 4 True dtype: bool
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4