Overlapping patterns regex

Question

with python and regex I attempt to match repeating/overlapping patterns/blocks like

04/00127-48
U 05062012
A: SAKARK
T_ Par.: dsfsd

in

04/00127-48
U 05062012
A: SAKARK
T_ Par.: dsfsd
04/00168-42
U 05062012
A: SAKARK
T_ Par.: fdfs
04/00168-43
U 05062012
A: SAKARK
T_ Par.: fdfs

I have tried

'(?=(\d+\/.*))'

this seem to work

'((\d+\/.*?)=?\d+\/)

but is there a better approach?

I'm confused, which patterns are you trying to extract? What is your intended result? — Joel Cornett
– Joel Cornett, Commented Jul 7, 2012 at 18:14
sorry about the bad question, I want to match the text blocks — user642897
– user642897, Commented Jul 7, 2012 at 18:23
See Marco de Wit's answer. Notice his usage of the re.DOTALL flag. — Joel Cornett
– Joel Cornett, Commented Jul 7, 2012 at 18:31

Marco de Wit · Accepted Answer · 2012-07-07 18:34:43Z

2

This answers your question:

re.findall(r'.+?(?=\d\d\/|$)',s,re.DOTALL)

re.DOTALL is needed to let the . match end-of-lines.

The r in front of the regex makes it a raw string so escapes with backslash are left as they are so the regex function will handle them. It is not needed here but still a good habit for regex's.

Your question is not very clear. Maybe this matches better what you want?

list(zip(*[iter(s.splitlines())]*4))

It gives a list with tuples.

edited Jul 7, 2012 at 18:34

answered Jul 7, 2012 at 18:20

Marco de Wit

2,8362 gold badges20 silver badges23 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Overlapping patterns regex

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related