Replace entire cell with string if it consists a particular string

Question

I have a Dataframe df:

name     rank
A    captain, general, soldier
B    general, foo, major
C    foo
D    captain, major
E    foo, foo, foo

I want to check if any cell in the column rank consists of foo and if it does replace the whole cell with foo.

Expected output:

name     rank
A    captain, general, soldier
B    foo
C    foo
D    captain, major
E    foo

How can I do this?

Does this answer your question? Check if string is in a pandas dataframe — pho
– pho, Commented Jun 20, 2022 at 5:31

BeRT2me · Accepted Answer · 2022-06-20 06:55:12Z

1

df['rank'].replace('.*foo.*', 'foo', regex=True, inplace=True)
# OR
df['rank'].mask(df['rank'].str.contains('foo'), 'foo', inplace=True)
# OR
df.loc[df['rank'].str.contains('foo'), 'rank'] = 'foo'

Output:

  name                       rank
0    A  captain, general, soldier
1    B                        foo
2    C                        foo
3    D             captain, major
4    E                        foo

edited Jun 20, 2022 at 6:55

answered Jun 20, 2022 at 6:41

BeRT2me

13.3k2 gold badges18 silver badges39 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Whole Brain · Accepted Answer · 2022-06-20 07:00:13Z

1

You can apply a lambda function to the column :

df["rank"] = df["rank"].apply(lambda x: "foo" if "foo" in x.split(", ") else x)

Splitting on the separator allows to check for words. For example, the world "foobar" wouldn't trigger the transformation on its row.

Edit: Thanks to BeRT2me for suggesting to split by ', '.

edited Jun 20, 2022 at 7:00

answered Jun 20, 2022 at 6:32

Whole Brain

2,1772 gold badges13 silver badges20 bronze badges

Comments

Smaurya · Accepted Answer · 2022-06-20 07:02:54Z

0

mask = df['rank'].str.contains('foo')
df.loc[mask, 'rank'] = 'foo'

answered Jun 20, 2022 at 7:02

Smaurya

18710 bronze badges

Comments

new2cod3 · Accepted Answer · 2022-06-20 05:53:21Z

-1

if df['rank'].str.contains('foo').any():
 df['rank']='foo'

answered Jun 20, 2022 at 5:53

new2cod3

317 bronze badges

1 Comment

Yantra Logistics Over a year ago

wouldn't that just replace the whole series with foo

Collectives™ on Stack Overflow

Replace entire cell with string if it consists a particular string

4 Answers 4

Comments

Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related