How to count elements in a list of lists of strings

Question

If a have a list like this:

[['welcome','a1'],['welcome','a1'],['hello','a2'],['hello','a3']]

and I want to return something like this:

[['welcome','a1', 2],['hello','a2', 1],['hello','a3', 1]]

If the same pair of strings in a sublist is encountered, increment the count

What I have so far:

counter = 0
for i in mylist:
  counter += 1 
  if i[0]== i[0]:
    if i[1] == i[1]:
        counter -= 1
 ouptut.append([mylist, counter])

I'm new at this and I appreciate your help!

It sounds like a dictionary problem but the question is kinda poorly stated... — jdero
– jdero, Commented Aug 10, 2013 at 1:05
Shouldn't the output of the example be [['welcome', 'a1', 2], ['hello', 'a3', 1], ['hello', 'a2', 1]]? — arshajii
– arshajii, Commented Aug 10, 2013 at 1:11
sorry I edited the question, I need to avoid the repetitions by comparing the 1st elem of each sublist and if it is alike compare the 2nd element (code) to chk the difference, if so count them separately otherwise just count it once. — JPP
– JPP, Commented Aug 10, 2013 at 1:19

Ashwini Chaudhary · Accepted Answer · 2013-08-10 01:37:18Z

1

Use a set here to get only unique items:

>>> lis = [['welcome','a1'],['welcome','a1'],['hello','a2'],['hello','a3']] 
>>> [list(x) + [1] for x in set(map(tuple, lis))]
>>> [['welcome', 'a1', 1], ['hello', 'a3', 1], ['hello', 'a2', 1]]

Explanation:

Set always returns unique items from an iterable or iterator, but as sets can only contain immutable item so you should convert them to a tuple first. A verbose version of the above code, only difference is that will also preserve the original or

>>> lis = [['welcome','a1'],['welcome','a1'],['hello','a2'],['hello','a3']] 
>>> s = set()
>>> for item in lis:
...     tup = tuple(item)  #covert to tuple
...     s.add(tup)
>>> s
set([('welcome', 'a1'), ('hello', 'a3'), ('hello', 'a2')])

Now use a list comprehension to get the expected output:

>>> [list(item) + [1] for item in s]
[['welcome', 'a1', 1], ['hello', 'a3', 1], ['hello', 'a2', 1]]

If the order of items matter(sets don't preserve order), then use this:

>>> seen = set()
>>> ans = []
>>> for item in lis:
...     tup = tuple(item)
...     if tup not in seen:
...         ans.append(item + [1])
...         seen.add(tup)
...         
>>> ans
[['welcome', 'a1', 1], ['hello', 'a2', 1], ['hello', 'a3', 1]]

I am not sure what's the point of using 1 here.

edited Aug 10, 2013 at 1:37

answered Aug 10, 2013 at 1:14

Ashwini Chaudhary

252k60 gold badges478 silver badges519 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

arshajii Over a year ago

I was just about to edit my answer to include this exact thing, +1

JPP Over a year ago

It seems that it work but I don't understand fully your code to do it... but thanks

JPP Over a year ago

How you can edit the loop so as to get [['hello', 'a1',1], ['hello', 'a1',1]]; return ['hello', 'a1',2],... This is just to understand the logic better. Thank you for your replies! :)

Ashwini Chaudhary Over a year ago

@JPP Have a look at collections.Counter

JPP Over a year ago

Should I change if tup not in seen: to => if tup in seen: ??

|

Collectives™ on Stack Overflow

How to count elements in a list of lists of strings

1 Answer 1

8 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

8 Comments

Your Answer

Sign up or log in

Post as a guest

Related