Strip out string from string in C#

Question

I have a string like this

orem ipsum dolor sit amet, consectetur adipiscing elit. Fusce rutrum, neque eu 
varius placerat, <p class="how-pkg"> leo diam viverra velit, </p> a commodo 
nibh metus nec orci. Nulla pharetra ut augue quis blandit.

I want to strip out a string value which is inside this  ------ 

Is there any way to accomplish this straight ahead?

without splitting the string multiple times.

Expected out put :leo diam viverra velit,

Do you only have one such tag in your string? Or can there be more? — germi
– germi, Commented Dec 20, 2013 at 7:20

Sergey Berezovskiy · Accepted Answer · 2013-12-20 08:00:10Z

4

use html agility pack and write

HtmlDocument doc = new HtmlDocument();
doc.LoadHtml(yourText);
var text = doc.DocumentNode.SelectNodes("/p[@class='how-pkg']").InnerText;

edited Dec 20, 2013 at 8:00

Sergey Berezovskiy

237k44 gold badges441 silver badges468 bronze badges

answered Dec 20, 2013 at 7:21

Kamil Budziewski

23.1k14 gold badges88 silver badges107 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Soner Gönül Over a year ago

doc is string in this case?

Kamil Budziewski Over a year ago

@SonerGönül nope, it's HtmlDocument from HtmlAgilityPack <- I've added creation of object to answer

Sergey Berezovskiy Over a year ago

+1 Also "/p[@class='how-pkg']" selector can be more appropriate if suddenly there will be other tags

Kamil Budziewski Over a year ago

@SergeyBerezovskiy thanks, added more specific selector

Ondrej Janacek · Accepted Answer · 2013-12-20 07:23:08Z

2

Using only string operations.

var searchForStart = "<p class=\"how-pkg\">";
int startIndex = input.IndexOf(searchForStart ) + searchFor.Length;
var searchForStop = "</p>";
int stopIndex = input.IndexIf(searchForStop, startIndex);

var output = text.Substring(startIndex, stopIndex - startIndex);

answered Dec 20, 2013 at 7:23

Ondrej Janacek

12.6k14 gold badges62 silver badges96 bronze badges

Comments

user176134 · Accepted Answer · 2013-12-20 07:30:44Z

1

string s = "orem ipsum dolor sit amet, consectetur adipiscing elit. Fusce rutrum, neque eu varius placerat, <p class=\"how-pkg\"> leo diam viverra velit, </p> a commodo nibh metus nec orci. Nulla pharetra ut augue quis blandit.";
int start = s.IndexOf("<p class=\"how-pkg\">") + 20;
int end = s.IndexOf("</p>", start);

string result = s.Substring(start, end - start);

answered Dec 20, 2013 at 7:30

user176134

Comments

athabaska · Accepted Answer · 2013-12-20 07:34:25Z

1

Assuming source is a your string:

var start = "<p class=\"how-pkg\">";
var p0 = source.IndexOf(start);
var p1 = source.IndexOf("</p>");
var s = source.Substring(p0 + start.Length, p1 - p0);

Something like that

edited Dec 20, 2013 at 7:34

answered Dec 20, 2013 at 7:23

athabaska

4553 silver badges23 bronze badges

3 Comments

Ondrej Janacek Over a year ago

It will actually include in the output.

Soner Gönül Over a year ago

This is wrong.. This gets output as a  leo diam viverra velit,

Ondrej Janacek Over a year ago

Now it won't work if there is  somewhere before  in the input.

Quinton Bernhardt · Accepted Answer · 2013-12-20 08:03:42Z

1

If your tag structure is always going to be the same then you can use regex to extract the value like this:

    var result = Regex.Match("<p class="how-pkg">hello</p>", "(?<=<p class="how-pkg">).*(?=</p>)").Value;

If your tag structure will change then you can capture both tag and values with named groups like this:

    <(?<tag>\.*)>(?<text>.*)</\k<tag>>

To capture just the value hello from <one>hello</one>:

    (?<=<.*>).*(?=</\w*>)

eg.

    var result = Regex.Match("<p class="how-pkg">hello</p>", "(?<=<.*>).*(?=</\w*>)").Value;

edited Dec 20, 2013 at 8:03

answered Dec 20, 2013 at 7:52

Quinton Bernhardt

4,81321 silver badges29 bronze badges

Comments

Sinatr · Accepted Answer · 2014-01-06 08:49:23Z

1

Simplest way:

search for <p (or <p class)
search for > after that - you found a tag (disregards of specified class) and opening point
(optinal) check if you support this class
search for  - you found result and the point where continue search (if necessary).

edited Jan 6, 2014 at 8:49

answered Dec 20, 2013 at 8:32

Sinatr

22.3k18 gold badges108 silver badges349 bronze badges

Collectives™ on Stack Overflow

Strip out string from string in C#

6 Answers 6

4 Comments

Comments

Comments

3 Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

4 Comments

Comments

Comments

3 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related