getting text content of specific element

Question

I'm trying to get element text content only ignoring element's descendants, for instance if you look at this HTML:

<p>hello <h1> World </H1> </p>

for element "P" the right output should be ONLY "hello ".

I have checked the function: "element.textContent" but this returns the textual content of a node and its descendants (in my example it will return "hello world").

Thanks,

Your markup is incorrect. <h1> element can't be inside <p>. — VisioN
– VisioN, Commented Sep 20, 2013 at 10:22
BTW your HTML code is broken, so most valid solutions posted below won't work. (you can't have block-level elements inside <p />). — pawel
– pawel, Commented Sep 20, 2013 at 10:22

pawel · Accepted Answer · 2013-09-20 10:14:07Z

3

Considering this HTML:

<div id="gettext">hello <p> not this </p> world?</div>

do you want to extract "hello" AND "world"? if yes, then:

var div = document.getElementById('gettext'), // get a reference to the element
    children = [].slice.call(div.childNodes), // get all the child nodes
                                              // and convert them to a real array  
    text = children.filter(function(node){
        return node.nodeType === 3;           // filter-out non-text nodes
    })
    .map(function( t ){ 
        return t.nodeValue;                   // convert nodes to strings 
    });    

console.log( text.join('') );                 // text is an array of strings.

http://jsfiddle.net/U7dcw/

answered Sep 20, 2013 at 10:14

pawel

37.1k7 gold badges59 silver badges54 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

VisioN Over a year ago

You should also add a notice about browser support of filter and map methods.

pawel Over a year ago

IE9+ and every other sane browser. For IE8 and below you need to provide Array.map and .filter methods.

VisioN Over a year ago

+1. This is the only answer here that really deserves the upvote.

Netorica · Accepted Answer · 2013-09-20 10:04:40Z

1

well behind it is an explanation

 $("p").clone()   //clone element
        .children() //get all child elements
        .remove()   //remove all child elements
        .end()  //get back to the parent
        .text();

answered Sep 20, 2013 at 10:04

Netorica

19.5k19 gold badges78 silver badges109 bronze badges

1 Comment

pawel Over a year ago

The question isn't tagged jQuery so I wouldn't assume its presence ;)

MarsOne · Accepted Answer · 2013-09-20 10:19:31Z

1

The answer i have is the same provided in couple of other answer. However let me try and offer an explanation.

<p >hello<h1>World</h1> </p>

This line will be rendered as

hello

World

If you look at this code it will be as follow

<p>hello</p>
<h1>World</h1> 
<p></p>

With the <p> tag you do not necessarily need the closing </p> tag if the paragraph is followed by a element. Check this article

Now you can select the content of the first p tag simply by using the following code

var p = document.getElementsByTagName('p');
console.log(p[0].textContent);

JS FIDDLE

answered Sep 20, 2013 at 10:19

MarsOne

2,1875 gold badges30 silver badges54 bronze badges

12 Comments

VisioN Over a year ago

@BOTH The markup is incorrect. <h1> element can't be inside <p>. The browser tries to repair it pushing <h1> out of <p>. Replace <p> with <div> and you'll see the difference.

aleation Over a year ago

Then say it, instead of just saying Wrong. The spirit of SO is to learn stuff, if you just say wrong even when you are right, when the output is clearly the right string, people gets confused

aleation Over a year ago

I know, AFTER commenting on various answers saying just "Wrong, it will output everything"

aleation Over a year ago

@VisioN Are YOU ok? xD If you noticed, after you provided the explanation, I deleted my own answer as I realized it's wrong, it's you who can't accept some constructive criticism because of your ego, we are just saying that if you go around just saying "You are wrong" even when they are getting the correct output (in a wrong way) people get confused xD

MarsOne Over a year ago

@VisioN, you one downvote doesnt make a difference buddy, three other ppl have upvoted my answer so guess that spoils your party doesnt it. think about it. what did you really achieve after all this? food for thought?

|

Simone · Accepted Answer · 2013-09-20 10:04:26Z

0

You can use the childNodes property, i.e.:

var p = document.querySelector('p');
p.childNodes[0]; // => hello

jsFiddle

answered Sep 20, 2013 at 10:04

Simone

21.6k15 gold badges84 silver badges112 bronze badges

1 Comment

pawel Over a year ago

how about <p>hello <h1> World </H1> THIS? </p> :)

Sara · Accepted Answer · 2013-09-20 10:08:11Z

0

Change your html to

<p id="id1">hello <h1> World </h1> </p>

Use this script,

alert(document.getElementById("id1").firstChild.nodeValue);

answered Sep 20, 2013 at 10:08

Sara

2721 gold badge4 silver badges13 bronze badges

Comments

user2700307 · Accepted Answer · 2013-09-20 10:10:29Z

0

Try to provide id for the element which you want to do some operation with that.

Below is the working example, it show output as "hello" as you expected.


<!DOCTYPE html>
<html>
<head>
<script type="text/javascript">
function showParagraph()
{
   alert(document.getElementById('test').innerHTML);

}
</script>
</head>

<body>
<p id="test">hello <h1> World </H1> </p>
<input type="button" onclick="showParagraph()" value="show paragraph" />
</body>

</html>

answered Sep 20, 2013 at 10:10

user2700307

1301 silver badge5 bronze badges

Comments

zaerymoghaddam · Accepted Answer · 2013-09-20 10:28:58Z

0

Plain texts are considered as nodes named #text. You can use childNodes property of element p and check the nodeName property of each item in it. You can iterate over them and select just #text nodes.

The function below loops over all element in document and prints just #text items

function myFunction()
{
    var txt="";
    var c=document.body.childNodes;
    for (i=0; i<c.length; i++)
    {
        if(c[i].nodeName == "#text")
            txt=txt + c[i].nodeName + "<br>";
    };
    return txt;
}

EDIT:

As @VisioN said in comments, using nodeType is much more safer (for browser compatibility) and recommended.

edited Sep 20, 2013 at 10:28

answered Sep 20, 2013 at 10:06

zaerymoghaddam

3,1671 gold badge33 silver badges37 bronze badges

2 Comments

VisioN Over a year ago

Is #text browser consistent? I'd better go for .nodeType === 3.

zaerymoghaddam Over a year ago

Yes you'r right. It would be much better to use nodeType for this purpose

Collectives™ on Stack Overflow

getting text content of specific element

7 Answers 7

3 Comments

1 Comment

World

12 Comments

1 Comment

Comments

Comments

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

3 Comments

1 Comment

World

12 Comments

1 Comment

Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related