Strip spaces/tabs/newlines - python

Question

I am trying to remove all spaces/tabs/newlines in python 2.7 on Linux.

I wrote this, that should do the job:

myString="I want to Remove all white \t spaces, new lines \n and tabs \t"
myString = myString.strip(' \n\t')
print myString

output:

I want to Remove all white   spaces, new lines 
 and tabs

It seems like a simple thing to do, yet I am missing here something. Should I be importing something?

Check out the answer to this related question: stackoverflow.com/questions/1185524/… strip() removes only leading and trailing characters, not ALL characters. — dckrooney
– dckrooney, Commented May 22, 2012 at 22:40
This worked for me, from the: [How to trim whitespace (including tabs)?][1] s = s.strip(' \t\n\r') [1]: stackoverflow.com/questions/1185524/… — stamat
– stamat, Commented Jun 29, 2013 at 18:35

Ashwini Chaudhary · Accepted Answer · 2013-10-25 19:25:59Z

185

Use str.split([sep[, maxsplit]]) with no sep or sep=None:

From docs:

If sep is not specified or is None, a different splitting algorithm is applied: runs of consecutive whitespace are regarded as a single separator, and the result will contain no empty strings at the start or end if the string has leading or trailing whitespace.

Demo:

>>> myString.split()
['I', 'want', 'to', 'Remove', 'all', 'white', 'spaces,', 'new', 'lines', 'and', 'tabs']

Use str.join on the returned list to get this output:

>>> ' '.join(myString.split())
'I want to Remove all white spaces, new lines and tabs'

edited Oct 25, 2013 at 19:25

answered May 22, 2012 at 22:42

Ashwini Chaudhary

252k60 gold badges478 silver badges519 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

MattH · Accepted Answer · 2012-05-22 22:40:43Z

86

If you want to remove multiple whitespace items and replace them with single spaces, the easiest way is with a regexp like this:

>>> import re
>>> myString="I want to Remove all white \t spaces, new lines \n and tabs \t"
>>> re.sub('\s+',' ',myString)
'I want to Remove all white spaces, new lines and tabs '

You can then remove the trailing space with .strip() if you want to.

answered May 22, 2012 at 22:40

MattH

38.4k11 gold badges85 silver badges84 bronze badges

1 Comment

Sushant Pachipulusu Over a year ago

This is the cleanest solution

skt7 · Accepted Answer · 2017-12-30 16:36:26Z

24

Use the re library

import re
myString = "I want to Remove all white \t spaces, new lines \n and tabs \t"
myString = re.sub(r"[\n\t\s]*", "", myString)
print myString

Output:

IwanttoRemoveallwhitespaces,newlinesandtabs

answered Dec 30, 2017 at 16:36

skt7

1,2351 gold badge9 silver badges21 bronze badges

2 Comments

Jesuisme Over a year ago

This is a correction of the original answer given by @TheGr8Adakron, not a duplicate

Simone Over a year ago

This does not preserve the spaces between the words rending the text useless for NLP.

Jesuisme · Accepted Answer · 2019-01-11 19:59:05Z

14

This will only remove the tab, newlines, spaces and nothing else.

import re
myString = "I want to Remove all white \t spaces, new lines \n and tabs \t"
output   = re.sub(r"[\n\t\s]*", "", myString)

OUTPUT:

IwantoRemoveallwhiespaces,newlinesandtabs

Good day!

edited Jan 11, 2019 at 19:59

Jesuisme

1,9212 gold badges33 silver badges43 bronze badges

answered Dec 12, 2017 at 9:49

The Gr8 Adakron

1,2491 gold badge13 silver badges15 bronze badges

1 Comment

Sajad Karim Over a year ago

Thanks for the solution - I think a minor correction is needed, it should be '+' instead of '*'.

Manish Mulani · Accepted Answer · 2012-12-31 11:32:23Z

11

import re

mystr = "I want to Remove all white \t spaces, new lines \n and tabs \t"
print re.sub(r"\W", "", mystr)

Output : IwanttoRemoveallwhitespacesnewlinesandtabs

answered Dec 31, 2012 at 11:32

Manish Mulani

7,40310 gold badges46 silver badges45 bronze badges

1 Comment

jan Over a year ago

this also removes ';'

rosstripi · Accepted Answer · 2019-05-01 20:09:55Z

11

The above solutions suggesting the use of regex aren't ideal because this is such a small task and regex requires more resource overhead than the simplicity of the task justifies.

Here's what I do:

myString = myString.replace(' ', '').replace('\t', '').replace('\n', '')

or if you had a bunch of things to remove such that a single line solution would be gratuitously long:

removal_list = [' ', '\t', '\n']
for s in removal_list:
  myString = myString.replace(s, '')

answered May 1, 2019 at 20:09

rosstripi

6041 gold badge11 silver badges22 bronze badges

1 Comment

mirekphd Over a year ago

Arguably this solution is the most readable and memorable.

sqqqrly · Accepted Answer · 2020-09-30 14:11:04Z

3

How about a one-liner using a list comprehension within join?

>>> foobar = "aaa bbb\t\t\tccc\nddd"
>>> print(foobar)
aaa bbb                 ccc
ddd

>>> print(''.join([c for c in foobar if c not in [' ', '\t', '\n']]))
aaabbbcccddd

answered Sep 30, 2020 at 14:11

sqqqrly

8931 gold badge7 silver badges10 bronze badges

Comments

JayRizzo · Accepted Answer · 2019-05-15 06:54:50Z

Since there is not anything else that was more intricate, I wanted to share this as it helped me out.

This is what I originally used:

import requests
import re

url = 'https://stackoverflow.com/questions/10711116/strip-spaces-tabs-newlines-python' # noqa
headers = {'user-agent': 'my-app/0.0.1'}
r = requests.get(url, headers=headers)
print("{}".format(r.content))

Undesired Result:

b'<!DOCTYPE html>\r\n\r\n\r\n    <html itemscope itemtype="http://schema.org/QAPage" class="html__responsive">\r\n\r\n    <head>\r\n\r\n        <title>string - Strip spaces/tabs/newlines - python - Stack Overflow</title>\r\n        <link

This is what I changed it to:

import requests
import re

url = 'https://stackoverflow.com/questions/10711116/strip-spaces-tabs-newlines-python' # noqa
headers = {'user-agent': 'my-app/0.0.1'}
r = requests.get(url, headers=headers)
regex = r'\s+'
print("CNT: {}".format(re.sub(regex, " ", r.content.decode('utf-8'))))

Desired Result:

<!DOCTYPE html> <html itemscope itemtype="http://schema.org/QAPage" class="html__responsive"> <head> <title>string - Strip spaces/tabs/newlines - python - Stack Overflow</title>

The precise regex that @MattH had mentioned, was what worked for me in fitting it into my code. Thanks!

Note: This is python3

Collectives™ on Stack Overflow

Strip spaces/tabs/newlines - python

8 Answers 8

Comments

1 Comment

2 Comments

1 Comment

1 Comment

1 Comment

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

Comments

1 Comment

2 Comments

1 Comment

1 Comment

1 Comment

Comments

Comments

Linked

Related