Unique values based on multiple columns

Question

I have 2 columns which contains duplicate entries. See example below. I want to remove duplicates from both columns Original Column

MatchN  Striker
1000887 DA Warner
1000887 DA Warner
1000887 TM Head
1000887 TM Head

I would like to finally get the result as

MatchN  Striker
1000887 DA Warner
1000887 TM Head

I tried using

np.df[["MatchN"],["Striker"]].unique()

but it does not work.

Can anyone please suggest best way to get to the desired result?

MaxU - stand with Ukraine · Accepted Answer · 2017-06-28 17:32:52Z

4

In [69]: df = df.drop_duplicates(['MatchN','Striker'])

In [70]: df
Out[70]:
    MatchN    Striker
0  1000887  DA Warner
2  1000887    TM Head

answered Jun 28, 2017 at 17:27

212k37 gold badges402 silver badges436 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

df.drop_duplicates(["MatchN"],["Striker"]) does not work

@AnoopMahajan, you should have posted a reproducible data set... Pleaase check updated answer

@AnoopMahajan, glad i could help :)