Replacing values greater than a number in pandas dataframe

Question

I have a large dataframe which looks as:

df1['A'].ix[1:3]
2017-01-01 02:00:00    [33, 34, 39]
2017-01-01 03:00:00    [3, 43, 9]

I want to replace each element greater than 9 with 11.

So, the desired output for above example is:

df1['A'].ix[1:3]
2017-01-01 02:00:00    [11, 11, 11]
2017-01-01 03:00:00    [3, 11, 9]

Edit:

My actual dataframe has about 20,000 rows and each row has list of size 2000.

Is there a way to use numpy.minimum function for each row? I assume that it will be faster than list comprehension method?

So values are not in list? Ithink df[df > 9] = 11 solution is wrong. Or something missing? — jezrael
– jezrael, Commented Jan 8, 2019 at 14:58

Edouard Cuny · Accepted Answer · 2018-10-02 09:10:24Z

67

Very simply : df[df > 9] = 11

answered Oct 2, 2018 at 9:10

Edouard Cuny

1,2801 gold badge11 silver badges10 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Estatistics Over a year ago

in 2024, in python 3, dont work

jezrael · Accepted Answer · 2017-05-03 11:05:57Z

47

You can use apply with list comprehension:

df1['A'] = df1['A'].apply(lambda x: [y if y <= 9 else 11 for y in x])
print (df1)
                                A
2017-01-01 02:00:00  [11, 11, 11]
2017-01-01 03:00:00    [3, 11, 9]

Faster solution is first convert to numpy array and then use numpy.where:

a = np.array(df1['A'].values.tolist())
print (a)
[[33 34 39]
 [ 3 43  9]]

df1['A'] = np.where(a > 9, 11, a).tolist()
print (df1)
                                A
2017-01-01 02:00:00  [11, 11, 11]
2017-01-01 03:00:00    [3, 11, 9]

edited May 3, 2017 at 11:05

answered May 3, 2017 at 10:55

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

2 Comments

obscuredbyclouds Over a year ago

This method replaces nan values with the number following else which is not something I want to do.

Superdooperhero Over a year ago

First one gives me: TypeError: 'int' object is not iterable

tdy · Accepted Answer · 2021-03-28 00:07:08Z

42

I know this is an old post, but pandas now supports DataFrame.where directly. In your example:

df.where(df <= 9, 11, inplace=True)

Please note that pandas' where is different than numpy.where. In pandas, when the condition == True, the current value in the dataframe is used. When condition == False, the other value is taken.

EDIT:

You can achieve the same for just a column with Series.where:

df['A'].where(df['A'] <= 9, 11, inplace=True)

edited Mar 28, 2021 at 0:07

tdy

42k42 gold badges124 silver badges125 bronze badges

answered Mar 27, 2021 at 23:31

kpetrou

4414 silver badges3 bronze badges

Comments

D.Griffiths · Accepted Answer · 2019-01-29 17:06:54Z

27

You can use numpy indexing, accessed through the .values function.

df['col'].values[df['col'].values > x] = y

where you are replacing any value greater than x with the value of y.

So for the example in the question:

df1['A'].values[df1['A'] > 9] = 11

answered Jan 29, 2019 at 17:06

D.Griffiths

2,3373 gold badges19 silver badges30 bronze badges

1 Comment

f.thorpe Over a year ago

This was the best solution I could find that worked as expected.

CFW · Accepted Answer · 2019-09-18 08:07:09Z

6

I came for a solution to replacing each element larger than h by 1 else 0, which has the simple solution:

df = (df > h) * 1

(This does not solve the OP's question as all df <= h are replaced by 0.)

answered Sep 18, 2019 at 8:07

CFW

3141 gold badge5 silver badges13 bronze badges

2 Comments

Geshode Over a year ago

Why do you write it as an answer, if it doesn't answer the OP's question?

CFW Over a year ago

Because the title (which led me and potentially others to come here) is imprecise and could imply this answer.

Collectives™ on Stack Overflow

Replacing values greater than a number in pandas dataframe

5 Answers 5

1 Comment

2 Comments

Comments

1 Comment

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

1 Comment

2 Comments

Comments

1 Comment

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related