Replace Value when all content is digit python

Question

Below is a sample of my df

id  name 
01  1
02  23 2    
03  234     
04  23423   
05  24 H AUTOSERVICE    
06  25 SUNGLASS

The aim is to 'clean' the DF by replacing digits with NaN only if the whole value contains digits.

The expected output would look like this

id  name 
01  NaN
02  24 H AUTOSERVICE    
03  25 SUNGLASS

I was thinking about something like this. Besides, it would remove all digits even 24 H

 df['name'] = df['name'].replace(r'[0-9]', '')

Thanks for anyone helping!

jezrael · Accepted Answer · 2020-05-06 05:43:39Z

2

First step is with Series.str.contains with negative selection [] of numbers and also whitespace \s and Series.where:

df['name'] = df['name'].where(df['name'].str.contains('[^0-9\s]'))
print (df)
   id              name
0   1               NaN
1   2               NaN
2   3               NaN
3   4               NaN
4   5  24 H AUTOSERVICE
5   6       25 SUNGLASS

For remove consecutive NaNs:

m = df['name'].isna()
df = df[m.ne(m.shift()) | ~m]
print (df)
   id              name
0  01               NaN
4  05  24 H AUTOSERVICE
5  06       25 SUNGLASS

edited May 6, 2020 at 5:43

answered May 6, 2020 at 5:29

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

A2N15 Over a year ago

Using the Mask and isnumeric doesnt solve this issue when the value is : 123 456?

Collectives™ on Stack Overflow

Replace Value when all content is digit python

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related