replace '()' from a dataframe pandas python

Question

I have a dataframe and it contains some values like

the change (♠)
and the new (⦻)

my desired output is

the change
and the new

I have tried to use

df.columns = df.columns.str.strip(' ()')
df=df.replace('\()','',regex=False)

but nothing worked, can anyone help? Thanks

Try df = df.replace('\(\)','',regex=True)

Vedant Vasishtha
– Vedant Vasishtha

2021-06-08 16:18:29 +00:00
Commented Jun 8, 2021 at 16:18 — Vedant Vasishtha
– Vedant Vasishtha, Commented Jun 8, 2021 at 16:18

Andrej Kesely · Accepted Answer · 2021-06-09 09:14:36Z

2

If you have dataframe:

              col1           col2
0           value1  the change ()
1    the change ()         value3
2           value2         value4
3  () other change            NaN

You can replace the () in whole dataframe:

df = df.apply(lambda x: x.str.replace(r"\s*\(\)\s*", "", regex=True))
print(df)

Prints:

           col1        col2
0        value1  the change
1    the change      value3
2        value2      value4
3  other change         NaN

EDIT: If you have df:

              col1            col2
0           value1  the change (♠)
1   the change (⦻)          value3
2           value2          value4
3  () other change             NaN

Then:

df = df.apply(lambda x: x.str.replace(r"\s*\(.*?\)\s*", "", regex=True))
print(df)

Prints:

           col1        col2
0        value1  the change
1    the change      value3
2        value2      value4
3  other change         NaN

edited Jun 9, 2021 at 9:14

answered Jun 8, 2021 at 16:21

Andrej Kesely

196k15 gold badges60 silver badges105 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

sdave Over a year ago

I have a little update in the question, can you again help? Thanks

sdave Over a year ago

then I am getting "AttributeError: Can only use .str accessor with string values!" this error

Andrej Kesely Over a year ago

@sdave You have probably some numeric columns. you can do df['column name'] = df['column name'].apply(lambda x: x.str.replace(r"\s*\(.*?\)\s*", "", regex=True))

sdave Over a year ago

I tried already with the way you suggested above comment. df = df.replace(r"\s*\(.*?\)\s*", '', regex = True, inplace = False) this way it worked, I am trying to double check if the results are fine as the table is huge and then will update. But I am not sure how inplace = False worked

Andrej Kesely Over a year ago

@sdave This will match all text inside the ( ) - you can play with regular expression here for example: regex101.com/r/87WnG2/1

|

DumbCoder · Accepted Answer · 2021-06-08 16:48:32Z

1

You have almost done it. Just change regex = True in your code and modify the regex to remove the spaces as well.

Input dataset

         col1             col2
0   change ()             val1
1        val2  samplestring ()
2  change 2()          val 5()

df.replace(r'\s*\(\s*\)\s*', '', regex = True, inplace = True)

Output dataset:

       col1          col2
0    change          val1
1      val2  samplestring
2  change 2         val 5

answered Jun 8, 2021 at 16:48

DumbCoder

5157 silver badges19 bronze badges

6 Comments

sdave Over a year ago

I have updated my question, can you have a look please

sdave Over a year ago

In this case, we get empty dataframe :(

DumbCoder Over a year ago

Yes, because it was only looking for blank spaces between (). You can try this: df.replace(r'\s*\(.*?\)\s*', '', regex = True, inplace = True)

sdave Over a year ago

stackoverflow.com/questions/67907458/… can you have a look at this please

sdave Over a year ago

df = df.replace(r"\s*(.*?)\s*", '', regex = True, inplace = False) this way it worked. But I am not sure how inplace = False worked ?

|

Collectives™ on Stack Overflow

replace '()' from a dataframe pandas python

2 Answers 2

8 Comments

6 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

8 Comments

6 Comments

Your Answer

Sign up or log in

Post as a guest

Related