Python. Regular expression

Question

How to find everything which goes after symols #TR= and it is inside [ ] using re module. For example #TR=[ dfgg dfgddfg dgfgf dgdgdg dfgfg ]

Are you expecting nested [ ] inside your top level of [ ] ? — martineno
– martineno, Commented Nov 24, 2010 at 22:30

Marek Augustyn · Accepted Answer · 2010-11-25 00:07:02Z

import re
txt = '#TR=[ dfgg ] a kuku #TR=[ala ma kota]'

If you want to search for just the first occurrence of this pattern, use:

matches = re.search('#TR=\[([^\]]*)\]', txt)
if matches:
    print(repr(matches.group(1)))
' dfgg dfg '

If you want to find all occurrences in the text, use:

matches = re.findall('#TR=\[([^\]]*)\]', txt)
if matches:
    print(matches)
[' dfgg ', 'ala ma kota']

Remember to check whether the characters you are searching for have special meaning in regular expressions (like [ or ]). If they are special, escape them with the backslash: \[.

Also remember, that by default, regular expressions are "greedy" which means they try to get as much text to match the pattern as possible; so if you use .* (which means "match any character except newline"; details) instead of [^\]]* (which means "match until the ] is found, and stop before it"), too much text could be matched:

matches = re.findall('#TR=\[(.*)\]', txt)
if matches:
    print(matches)
[' dfgg ] a kuku #TR=[ala ma kota']

You can also use the "non-greedy" modifier ? in your pattern, after the qualifier (*, +) which enables the "the-less-characters-the-better" matching (use *?, +?). The result could be more readable:

'#TR=\[(.*?)\]'

instead of:

'#TR=\[([^\]]*)\]'

There's a great online tool to test your patterns as-you-type: RegExr by Grant Skinner.

Sasha Chedygov · Accepted Answer · 2010-11-24 22:30:42Z

1

import re
# compile the regex
exp = re.compile('.*\[(.*)\].*')
txt = r"#TR=[ dfgg dfgddfg dgfgf dgdgdg dfgfg ]"
match = exp.match(txt)
# grab the text between the square brackets
result = match.group(1)

edited Nov 24, 2010 at 22:30

Sasha Chedygov

132k27 gold badges107 silver badges117 bronze badges

answered Nov 24, 2010 at 22:28

John Keyes

5,6142 gold badges33 silver badges51 bronze badges

1 Comment

Sasha Chedygov Over a year ago

Sorry, I edited your answer by mistake, meant to edit my own. Reverted my change.

alpha-mouse · Accepted Answer · 2010-11-24 22:27:33Z

0

(?<=#TR=[)[^]]*(?=])

answered Nov 24, 2010 at 22:27

alpha-mouse

5,01328 silver badges38 bronze badges

Collectives™ on Stack Overflow

Python. Regular expression

3 Answers 3

Comments

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related