Which Python method can be used to Remove duplicates by Data scientist?
The drop_duplicates() method removes duplicate rows.
dataframe.drop_duplicates(subset, keep, inplace, ignore_index)
Remove duplicate rows from the DataFrame:
1. import pandas as pd
2. data = {
3. 'name': ['Peter', 'Mary', 'John', 'Mary'],
4. 'age': [50, 40, 30, 40],
5. 'qualified': [True, False, False, False]
6. }
7.
8. df = pd.DataFrame(data)
9. newdf = df.drop_duplicates()
Emelda
7 months agoBulah
6 months agoKirby
6 months agoFannie
6 months agoDolores
7 months agoVincent
7 months agoSolange
7 months agoBambi
7 months agoDenna
7 months agoMy
7 months agoGlory
7 months agoAudry
7 months agoDulce
7 months agoCandra
8 months agoSylvie
8 months agoPilar
7 months agoEdgar
7 months agoAmalia
7 months agoBarbra
8 months agoIvette
8 months agoLynelle
8 months agoKimberely
7 months agoDominic
7 months agoDominga
7 months agoHuey
7 months ago