A regex that converts text lists to html in PHP

Question

I'm trying to code a regexp to convert a block of text:

* List item
* Another list item

to html:

<ul>
    <li>List item</li>
    <li>Another list item</li>
</ul>

I know there are snippets or classes to do this (Markdown, Textile, etc) but I think it's overkill: I really just want some basic functionality. So far I'm trying with:

$text = preg_replace("/\*+(.*)?/i","<li>$1</li>",$text);

But I don't know how to wrap everything in <ul> tags without using a separate replace, like so:

$text = preg_replace("/(\<li\>(.*)\<\/li\>\n*)+/is","<ul>\n$1\n</ul>\n",$text);

This interferes with other code, for example ordered lists. There must be a better way.

Thanks.

BrandonS · Accepted Answer · 2011-02-04 20:32:10Z

14

On this question, if you where talking about the fact that the code you used would wrap multiple sets of li tags in one ul tag even if there was suppose to be a break in there like so:

* line 1
* line 1
* line 1
this is not part of a list
* line 1
* line 1
* line 1

Would become:

<ul>
<li>line 1</li>
<li>line 1</li>
<li>line 1</li>
this is not part a the list
<li>line 1</li>
<li>line 1</li>
</ul>

Then I have a solution for you. You had 90% of it there, here is a solution I came up with (but I am sure you already solved it anyway):

$text = preg_replace("/\*+(.*)?/i","<ul><li>$1</li></ul>",$text);
$text = preg_replace("/(\<\/ul\>\n(.*)\<ul\>*)+/","",$text);

The solution does not mess with lists of any kind already on the page in the text or whatever and makes sure to separate multiple lists. Reason is that every match it finds where an asterisk was used to create a text list item it surrounds that with a ul and li then the 2nd line finds all of the back to back closing and opening ul tags and removes them.

answered Feb 4, 2011 at 20:32

BrandonS

9327 silver badges16 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Reven Over a year ago

That's quite ingenious! And it would solve the problem. Thank you. I'll give it a RealWorld® spin and see how it works.

bart Over a year ago

The above regex gives problems when * appears in the middle of a phrase. If you run into that problem, then you can modify the regex to only match * when it's at the beginning of a line: preg_replace("/^*+(.*)?/im","<ul><li>$1</li></ul>",$text);

DCC · Accepted Answer · 2010-02-26 20:34:40Z

1

Why don't you store the first regex in an array with preg_match_all, and glue it like this:

$list='<ul><li>';
$list .= implode('</li><li>',$arr_regex);
$list .= '</li></ul>';

answered Feb 26, 2010 at 20:34

DCC

2491 silver badge5 bronze badges

1 Comment

user282320 Over a year ago

That would work if the text was the only element in the block of text, but there are things before and after.

Paulo Santos · Accepted Answer · 2010-02-26 20:24:53Z

0

Well, you could simply do

$text = "<ul>" . preg_replace("/\*+(.*)?/i","<li>$1</li>",$text) . "</ul>";

or, if you really want to use preg_replace

$text = preg_replace("/(\<li\>(.*?)\<\/li\>\n*)+/is","<ul>\n$1\n</ul>\n",$text);

answered Feb 26, 2010 at 20:24

Paulo Santos

11.7k5 gold badges46 silver badges67 bronze badges

1 Comment

user282320 Over a year ago

Again, maybe I didn't make this clear (sorry), but there are more things in $text, so adding <ul> won't work.

Ignacio Vazquez-Abrams · Accepted Answer · 2010-02-27 00:17:42Z

0

Perhaps you may find PHP Markdown useful.

answered Feb 27, 2010 at 0:17

Ignacio Vazquez-Abrams

804k160 gold badges1.4k silver badges1.4k bronze badges

1 Comment

user282320 Over a year ago

Was kind of trying to avoid using it, to be honest. Just need a couple of substitutions. My script is about 10Kb. Including a 40Kb script just to do that seems overkill.

Collectives™ on Stack Overflow

A regex that converts text lists to html in PHP

4 Answers 4

2 Comments

1 Comment

1 Comment

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

1 Comment

1 Comment

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related