How to compare columns of two different data frames and keep the common values

Question

I have two data frames with same column but different values, out of which some are same and some are different. I want to compare both columns and keep the common values.

df1 :

df2 :

This is what I am expecting after comparison

df2 :

Perhaps this can be useful? stackoverflow.com/questions/26921943/… — Ardweaden
– Ardweaden, Commented Jul 26, 2019 at 11:53
I guess you want to use something like merging df1 and df2 based on the column A — Jules
– Jules, Commented Jul 26, 2019 at 11:56
I don't want to merge them, I want to save the common points only. — Bhavishya
– Bhavishya, Commented Jul 26, 2019 at 12:20

anky · Accepted Answer · 2019-07-26 11:56:15Z

1

You can use pd.Index.intersection() to find the matching columns and do a inner merge finally reindex() to keep df2.columns:

match=df2.columns.intersection(df1.columns).tolist() #finds matching cols in both df
df2.merge(df1,on=match).reindex(df2.columns,axis=1) #merge and reindex to df2.columns

answered Jul 26, 2019 at 11:56

anky

75.3k11 gold badges46 silver badges76 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Bhavishya Over a year ago

Using this, I am getting the merged values and not the common ones.

anky Over a year ago

@Bhavishya what is the difference? can you elaborate, this gets exactly what you posted as desired answer

Bhavishya Over a year ago

I have around 250k entries in df1 and around 1.1m entries in df2, so after keeping just the common values I should get less than or equal to 250k entries in df2 but I'm getting around 1m entries instead.

Collectives™ on Stack Overflow

How to compare columns of two different data frames and keep the common values

1 Answer 1

3 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

3 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related