Replace all inf, -inf values with NaN in a pandas dataframe

Question

I have a large dataframe with inf, -inf values in different columns. I want to replace all inf, -inf values with NaN

I can do so column by column. So this works:

df['column name'] = df['column name'].replace(np.inf, np.nan)

But my code to do so in one go across the dataframe does not.

df.replace([np.inf, -np.inf], np.nan)

The output does not replace the inf values

tdy · Accepted Answer · 2024-04-29 02:00:26Z

25

TL;DR

df.replace is fastest for replacing ±inf
but you can avoid replacing altogether by just setting mode.use_inf_as_na (deprecated in v2.1.0)

Replacing `inf` and `-inf`

df = df.replace([np.inf, -np.inf], np.nan)

Just make sure to assign the results back. (Don't use the inplace approach, which is being deprecated in PDEP-8.)

There are other df.applymap options, but df.replace is fastest:

df = df.applymap(lambda x: np.nan if x in [np.inf, -np.inf] else x)
df = df.applymap(lambda x: np.nan if np.isinf(x) else x)
df = df.applymap(lambda x: x if np.isfinite(x) else np.nan)

Setting `mode.use_inf_as_na` (deprecated)

Deprecated in pandas 2.1.0
Will be removed in pandas 3.0

Note that we don't actually have to modify df at all. Setting mode.use_inf_as_na will simply change the way inf and -inf are interpreted:

True means treat None, nan, -inf, inf as null
False means None and nan are null, but inf, -inf are not null (default)

Either enable globally

pd.set_option('mode.use_inf_as_na', True)

Or locally via context manager

with pd.option_context('mode.use_inf_as_na', True):
    ...

edited Apr 29, 2024 at 2:00

answered Apr 2, 2021 at 17:17

tdy

42k42 gold badges124 silver badges125 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Volkov Maxim Over a year ago

Use case: when I has set mode.use_inf_as_na I got error "ValueError: Input X contains infinity or a value too large for dtype('float64')." from MinMaxScaler. After it I was back to df.replace().

Bohdan Pylypenko Over a year ago

mode.use_inf_as_na changes only representation of np.inf and np.NINF. But under the hood it still stores them as ±inf. So, if you want to get rid of them, you need to use replace().

svenmk Over a year ago

mode.use_inf_as_na is flagged as deprecated (see: github.com/pandas-dev/pandas/issues/34093 and github.com/pandas-dev/pandas/issues/51684). So it is better to not use it anymore.

sophocles · Accepted Answer · 2021-12-23 22:22:27Z

6

pandas.Series.replace doesn't happen in-place.

So the problem with your code to replace the whole dataframe does not work because you need to assign it back or, add inplace=True as a parameter. That's also why your column by column works, because you are assigning it back to the column df['column name'] = ...

Therefore, change df.replace([np.inf, -np.inf], np.nan) to either:

df.replace([np.inf, -np.inf], np.nan,inplace=True)

Or assign back to a new dataframe:

df = df.replace([np.inf, -np.inf], np.nan)

edited Dec 23, 2021 at 22:22

answered Apr 2, 2021 at 17:18

sophocles

13.9k3 gold badges18 silver badges37 bronze badges

6 Comments

postcolonialist Over a year ago

Hmm...I am getting an TypeError: unhashable type: 'list' for both the choices that you gave.

sophocles Over a year ago

Very strange, I am currently running it on my machine and it works. What pandas version are you using?

postcolonialist Over a year ago

Version - Python 3.8.0

sophocles Over a year ago

and of pandas? pd. __version__ ?

sophocles Over a year ago

I believe it has something to do with your pandas version. I use 1.2.0. Maybe it's time to update it :). I posted a picture in my answer to illustrate.

|

Collectives™ on Stack Overflow

Replace all inf, -inf values with NaN in a pandas dataframe

2 Answers 2

TL;DR

Replacing `inf` and `-inf`

Setting `mode.use_inf_as_na` (deprecated)

3 Comments

6 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

TL;DR

Replacing inf and -inf

Setting mode.use_inf_as_na (deprecated)

3 Comments

6 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related

Replacing `inf` and `-inf`

Setting `mode.use_inf_as_na` (deprecated)