Python: Getting specific list elements

Question

So I made a list of elements from a HTML-Page and counted the frequency of these elements. But I just need some specific elements like "bb" and "nw". So I don't know what position they'll have in the list and I'm not sure how to seperate them from the other elements.

This is my code so far:

from bs4 import BeautifulSoup
import urllib2
import re
import operator
from collections import Counter
from string import punctuation

source_code = urllib2.urlopen('https://de.wikipedia.org/wiki/Liste_von_Angriffen_auf_Fl%C3%BCchtlinge_und_Fl%C3%BCchtlingsunterk%C3%BCnfte_in_Deutschland/bis_2014')
html = source_code.read()
soup = BeautifulSoup(html, "html.parser")

text = (''.join(s.findAll(text=True))for s in soup.findAll('a'))

c = Counter((x.rstrip(punctuation).lower() for y in text for x in y.split()))

bb,nw=operator.itemgetter(1,2)(c.most_common())
print(bb,nw)

Thank you for your help and any hints.

What do you mean by you need only specific elements? Do you mean that you need their frequency? — Peaceful
– Peaceful, Commented Mar 28, 2016 at 20:11

user2390182 · Accepted Answer · 2016-03-28 20:19:01Z

2

You could use a filter:

relevant_items = ('bb', 'nw')
items = filter(lambda x: x[0] in relevant_items, c.most_common())

Alternatively, you can already filter in the comprehension:

c = Counter((x.rstrip(punctuation).lower() for y in text for x in y.split() if x in relevant_items))

edited Mar 28, 2016 at 20:19

answered Mar 28, 2016 at 20:12

user2390182

73.7k6 gold badges71 silver badges95 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Kendel Ventonda Over a year ago

Thanks a lot. This was exactly what I was looking vor.

Collectives™ on Stack Overflow

Python: Getting specific list elements

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related