Regular expression pattern

Question

Regular expressions are one of the things that still escape me. What I want is simple enough, but I have yet to be able to consistently match. The text I want to match is /ssl/checkoutstep1.aspx regardless of case.

This case seems too simple to use a regex. Just lowercase the string and check for equality. — jjnguy
– jjnguy, Commented Jul 19, 2011 at 16:28
The language is standard perl, but the implementation is not for a language but rather an A/B testing interface, so I just need the pattern itself. — S16
– S16, Commented Jul 19, 2011 at 16:28
My effort so far has been fruitless. As for this case being to simple, it's not a matter of finding the right tool for the match; it's a matter of being required to use regex. — S16
– S16, Commented Jul 19, 2011 at 16:29
jjnguy depending on the language, might have a case insensitive compare. — Yuriy Faktorovich
– Yuriy Faktorovich, Commented Jul 19, 2011 at 16:29

Svante · Accepted Answer · 2011-07-19 17:01:11Z

4

Instead of the default delimiter /, it's easier if you use a non-slash like pipe: |

if ($string =~ m|/ssl/checkoutstep1\.aspx|i) {
  print 'match';
} else {
  print 'no match';
}

I'm assuming you actually need Regex (because you want to learn it, or you are doing a path rewrite, or something). Your example could easilly be solved with simple case-insensitive indexof or contains.

edited Jul 19, 2011 at 17:01

Svante

51.8k11 gold badges84 silver badges127 bronze badges

answered Jul 19, 2011 at 16:39

agent-j

28k5 gold badges55 silver badges81 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

gpojd · Accepted Answer · 2011-07-19 17:18:55Z

1

Since it doesn't look like you really need a regular expression, you should consider eq or index.

if ( lc( $string ) eq '/ssl/checkoutstep1.aspx' ) { ... } ## for exact matches

or

if ( index( lc( $string ), '/ssl/checkoutstep1.aspx' ) != -1 ) { ... } ## for partial matches

This is faster and avoids the confusion of regular expressions. If you insist on regular expressions, agent-j's response is what you want, although I prefer {}.

if ( $string =~ m{\Q/ssl/checkoutstep1.aspx\E}i ) { ... } ## the \Q and \E escape the special chars between them

edited Jul 19, 2011 at 17:18

answered Jul 19, 2011 at 17:11

gpojd

23.2k8 gold badges45 silver badges71 bronze badges

5 Comments

hobbs Over a year ago

I'm not convinced that it's faster, since lc has to make a copy of a (potentially large) string, and m//i uses a reasonably quick bitmap-based method to do case-insensitive searches (at least when not in Unicode mode).

gpojd Over a year ago

I have benchmarked it before, but that was with an older version of perl. I think the larger the string, the slower the regex (the regex shown, not one with anchors).

gpojd Over a year ago

I benchmarked them again and index/eq almost always beat the regex. The only case that it doesn't is when the string is long AND the match is at the very beginning.

hobbs Over a year ago

oh, anchor the regex match! m{^\Q/fixed/string\E\z}i

gpojd Over a year ago

eq is about twice as fast as the anchored regex on my machine, according to my benchmark.

Collectives™ on Stack Overflow

Regular expression pattern

2 Answers 2

Comments

5 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

5 Comments

Your Answer

Sign up or log in

Post as a guest

Related