how to get data between td tags in unix shell script

Question

I want to get data between td tags in unix shell script in a generalize way.

for example in the following

<td style="padding:3px;" align="center">123.456</td>

how to retrieve 123.456 in a generalize way.

Thanks

What system are you working with? Can you start/install XML Shell (xmlsh)? — likeitlikeit
– likeitlikeit, Commented Apr 25, 2013 at 11:04

sat · Accepted Answer · 2013-04-25 11:02:04Z

2

You can try with sed,

sat:~# cat file
<td style="padding:3px;" align="center">123.456</td>
<td>sat</td>
sat:~#  
sat:~# sed 's/<td\(.*[^<>]\+\?>\)\(.*\)<\/td>/\2/g' file
123.456
sat
sat:~#

I hope it will help you.

answered Apr 25, 2013 at 11:02

sat

15k7 gold badges49 silver badges69 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Sidharth C. Nadhan · Accepted Answer · 2013-04-25 11:04:09Z

0

sed 's/^.*<td.*>\(.*\)<.*$/\1/' file

answered Apr 25, 2013 at 11:04

Sidharth C. Nadhan

2,2832 gold badges18 silver badges18 bronze badges

Comments

Fredrik Pihl · Accepted Answer · 2013-04-25 11:07:28Z

0

For a proper solution and in a generalized way use a proper parser like html-xml-utils

for a non-proper and non-gerneralized way, use sed

sed 's/^.*>\([0-9.]*\)<.*$/\1/'

answered Apr 25, 2013 at 11:07

Fredrik Pihl

45.9k7 gold badges89 silver badges133 bronze badges

Comments

Kent · Accepted Answer · 2013-04-25 11:12:51Z

0

If for some reason you cannot use a xml parser,

grep was born to extract things. :)

grep -Po '(?<=>)[^<]*'

answered Apr 25, 2013 at 11:12

Kent

197k36 gold badges248 silver badges317 bronze badges

Collectives™ on Stack Overflow

how to get data between td tags in unix shell script

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related