Regex for stripping HTML tags and contents

Question

I've searched and searched and for some reason I couldn't find any solution.

This is my current text:

Lorem ipsum <strong>dolor</strong> sit <i>amet</i>.

This is what I want:

Lorem ipsum sit.

I do not want to use an HTML parser. I just want to use a simple regex to remove HTML tags and their inner content.

Tommy Ivarsson · Accepted Answer · 2014-05-20 02:51:56Z

1

This regular expression used with the global flag will match html-tags and text inside html-tags.

<[\/\!]*?[^<>]*?>[A-Za-z0-9.,;:]*<[\/\!]*?[^<>]*?>

edited May 20, 2014 at 2:51

answered May 20, 2014 at 2:37

Tommy Ivarsson

6054 silver badges7 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

user3650808 Over a year ago

strip_tags just unwraps the content. I want the content gone as well.

Tommy Ivarsson Over a year ago

Your question has already been answered here stackoverflow.com/questions/1516085/…

user3650808 Over a year ago

Both answers use HTML parsers, something I don't want to use.

Tommy Ivarsson Over a year ago

Edited the answer with a regular expression for you.

MrPizzaFace · Accepted Answer · 2014-05-20 03:19:31Z

0

Though @Tommy's answer works for you, that regex is really much too complicated for what you want to do. You can simply do this:

$str = "Lorem ipsum <strong>dolor</strong> sit <i>amet</i>.";

$r = preg_replace("/ <\S*>/", "", $str);

echo $r;
#=> Lorem ipsum sit.

answered May 20, 2014 at 3:19

MrPizzaFace

8,12616 gold badges84 silver badges125 bronze badges

1 Comment

Tommy Ivarsson Over a year ago

Nice. I just took a regexp from the notes on the documentation for strip_tags(). This looks way nicer.

Todor Todorov · Accepted Answer · 2014-10-04 08:46:05Z

0

preg_replace('/(<.*?>)|(&.*?;)/', '', $string)

This one works pretty well for me. It strips all the HTML tags and special HTML characters. Hope this helps.

answered Oct 4, 2014 at 8:46

Todor Todorov

2,5391 gold badge18 silver badges15 bronze badges

Collectives™ on Stack Overflow

Regex for stripping HTML tags and contents

3 Answers 3

4 Comments

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

4 Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related