Which Python method can be used to Remove duplicates by Data scientist?
The drop_duplicates() method removes duplicate rows.
dataframe.drop_duplicates(subset, keep, inplace, ignore_index)
Remove duplicate rows from the DataFrame:
1. import pandas as pd
2. data = {
3. 'name': ['Peter', 'Mary', 'John', 'Mary'],
4. 'age': [50, 40, 30, 40],
5. 'qualified': [True, False, False, False]
6. }
7.
8. df = pd.DataFrame(data)
9. newdf = df.drop_duplicates()
Pilar
4 months agoStarr
4 months agoRenea
4 months agoCora
5 months agoMeaghan
5 months agoTammara
5 months agoDevorah
5 months agoBeth
5 months agoAriel
6 months agoJanet
6 months agoKina
6 months agoIsreal
6 months agoTonette
6 months agoWalker
6 months agoCecil
6 months agoRozella
6 months agoEmelda
2 years agoBulah
1 year agoKirby
1 year agoFannie
1 year agoDolores
2 years agoVincent
2 years agoSolange
2 years agoBambi
2 years agoDenna
2 years agoMy
2 years agoGlory
2 years agoAudry
2 years agoDulce
2 years agoCandra
2 years agoSylvie
2 years agoPilar
2 years agoEdgar
2 years agoAmalia
2 years agoBarbra
2 years agoIvette
2 years agoLynelle
2 years agoKimberely
2 years agoDominic
2 years agoDominga
2 years agoHuey
2 years ago