Delete rows with a certain value in Python and Pandas

Question

I want to delete rows who have certain values. The values that I want to delete have a "+" and are as follows:

cooperative+parallel
passive+prosocial

My dataset consists of 900000 rows, and about 2000 values contain the problem I mentioned.

I want the code something like this:

df = df[df.columnname != '+']

The above is for one column (its not working well) but I would also like one example for whole dataset.

I prefer the solution in Pandas.

Many thanks

jezrael · Accepted Answer · 2020-11-19 08:37:02Z

3

Use Series.str.contains with invert mask by ~ and escape +, because special regex character with DataFrame.apply for all object columns selected by DataFrame.select_dtypes with DataFrame.any for test at least one match:

df1 = df[~df.select_dtypes(object).apply(lambda x: x.str.contains('\+')).any(axis=1)]

Or use regex=False:

df1 = df[~df.select_dtypes(object).apply(lambda x: x.str.contains('\+', regex=False)).any(axis=1)]

answered Nov 19, 2020 at 8:31

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Thanks, this works for one column. Could you please provide me an example for the full dataset also?

Artyom Akselrod · Accepted Answer · 2020-11-19 08:31:23Z

0

df = df[~df['columnname'].str.contains('+', regex=False)]

answered Nov 19, 2020 at 8:31

Artyom Akselrod

9966 silver badges15 bronze badges