Remove rows of one Dataframe based on one column of another dataframe

Question

I got two DataFrame and want remove rows in df1 where we have same value in column 'a' in df2. Moreover one common value in df2 will only remove one row.

df1 = pd.DataFrame({'a':[1,1,2,3,4,4],'b':[1,2,3,4,5,6],'c':[6,5,4,3,2,1]})
df2 = pd.DataFrame({'a':[2,4,2],'b':[1,2,3],'c':[6,5,4]})
result = pd.DataFrame({'a':[1,1,3,4],'b':[1,2,4,6],'c':[6,5,3,1]})

Check with isin ~

BENY
– BENY

2020-08-24 16:08:37 +00:00
Commented Aug 24, 2020 at 16:08 — BENY
– BENY, Commented Aug 24, 2020 at 16:08

Shubham Sharma · Accepted Answer · 2020-08-24 16:36:34Z

3

Use Series.isin + Series.duplicated to create a boolean mask and use this mask to filter the rows from df1:

m = df1['a'].isin(df2['a']) & ~df1['a'].duplicated()
df = df1[~m]

Result:

answered Aug 24, 2020 at 16:36

Shubham Sharma

71.8k6 gold badges26 silver badges58 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

EMiller · Accepted Answer · 2020-08-24 16:08:45Z

0

Try This:

import pandas as pd
df1=pd.DataFrame({'a':[1,1,2,3,4,4],'b':[1,2,3,4,5,6],'c':[6,5,4,3,2,1]})
df2=pd.DataFrame({'a':[2,4,2],'b':[1,2,3],'c':[6,5,4]})
df2a = df2['a'].tolist()
def remove_df2_dup(x):
    if x in df2a:
        df2a.remove(x)
        return False
    return True
df1[df1.a.apply(remove_df2_dup)]

It creates a list from df2['a'], then checks that list against each value of df1['a'], removing values from the list each time there's a match in df1

answered Aug 24, 2020 at 16:08

EMiller

8391 gold badge7 silver badges21 bronze badges

Comments

Kuldip Chaudhari · Accepted Answer · 2020-08-24 16:22:03Z

0

try this

df1=pd.DataFrame({'a':[1,1,2,3,4,4],'b':[1,2,3,4,5,6],'c':[6,5,4,3,2,1]})
df2=pd.DataFrame({'a':[2,4,2],'b':[1,2,3],'c':[6,5,4]})

for x in df2.a:
    if x in df1.a:
        df1.drop(df1[df1.a==x].index[0], inplace=True)

print(df1)

answered Aug 24, 2020 at 16:22

Kuldip Chaudhari

1,1146 silver badges8 bronze badges

Collectives™ on Stack Overflow

Remove rows of one Dataframe based on one column of another dataframe

3 Answers 3

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related