How to remove words containing only numbers in python?

Question

I have some text in Python which is composed of numbers and alphabets. Something like this:

s = "12 word word2"

From the string s, I want to remove all the words containing only numbers

So I want the result to be

s = "word word2"

This is a regex I have but it works on alphabets i.e. it replaces each alphabet by a space.

re.sub('[\ 0-9\ ]+', ' ', line)

Can someone help in telling me what is wrong? Also, is there a more time-efficient way to do this than regex?

Thanks!

anubhava · Accepted Answer · 2016-10-13 11:58:06Z

10

You can use this regex:

>>> s = "12 word word2"
>>> print re.sub(r'\b[0-9]+\b\s*', '', s)
word word2

\b is used for word boundary and \s* will remove 0 or more spaces after your number word.

answered Oct 13, 2016 at 11:58

anubhava

790k67 gold badges603 silver badges671 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

silent_dev Over a year ago

Thanks for the answer. Can the regex be modified to remove all punctuation/special symbols as follows ? re.sub(r'\b[!~`.,/<>]+\b\s*', '', s)

anubhava Over a year ago

Sure you can use: re.sub(r'\b[0-9]+\b\W*', '', s) as \W matches space or any other non-word character.

Jon Clements · Accepted Answer · 2016-10-13 12:00:33Z

7

Using a regex is probably a bit overkill here depending whether you need to preserve whitespace:

s = "12 word word2"
s2 = ' '.join(word for word in s.split() if not word.isdigit())
# 'word word2'

answered Oct 13, 2016 at 12:00

Jon Clements

143k34 gold badges254 silver badges288 bronze badges

Comments

Stam Kaly · Accepted Answer · 2016-10-13 12:02:48Z

1

Without using any external library you could do:

stringToFormat = "12 word word2"
words = ""
for word in stringToFormat.split(" "):
    try:
        int(word)
    except ValueError:
        words += "{} ".format(word)
print(words)

answered Oct 13, 2016 at 12:02

Stam Kaly

6681 gold badge11 silver badges26 bronze badges

Collectives™ on Stack Overflow

How to remove words containing only numbers in python?

3 Answers 3

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related