concatenate an arbitrary number of lists in a function in Python

Question

I hope to write the join_lists function to take an arbitrary number of lists and concatenate them. For example, if the inputs are

m = [1, 2, 3]
n = [4, 5, 6]
o = [7, 8, 9]

then we I call print join_lists(m, n, o), it will return [1, 2, 3, 4, 5, 6, 7, 8, 9]. I realize I should use *args as the argument in join_lists, but not sure how to concatenate an arbitrary number of lists. Thanks.

There's no need to write this function, just from itertools import chain. — Derek Veit
– Derek Veit, Commented Jan 31, 2018 at 3:16

Marcin · Accepted Answer · 2013-09-05 18:26:26Z

19

Although you can use something which invokes __add__ sequentially, that is very much the wrong thing (for starters you end up creating as many new lists as there are lists in your input, which ends up having quadratic complexity).

The standard tool is itertools.chain:

def concatenate(*lists):
    return itertools.chain(*lists)

or

def concatenate(*lists):
    return itertools.chain.from_iterable(lists)

This will return a generator which yields each element of the lists in sequence. If you need it as a list, use list: list(itertools.chain.from_iterable(lists))

~~If you insist on doing this "by hand", then use extend:~~

~~def concatenate(*lists): newlist = [] for l in lists: newlist.extend(l) return newlist~~

Actually, don't use extend like that - it's still inefficient, because it has to keep extending the original list. The "right" way (it's still really the wrong way):

def concatenate(*lists):
    lengths = map(len,lists)
    newlen = sum(lengths)
    newlist = [None]*newlen
    start = 0
    end = 0
    for l,n in zip(lists,lengths):
        end+=n
        newlist[start:end] = list
        start+=n
    return newlist

http://ideone.com/Mi3UyL

You'll note that this still ends up doing as many copy operations as there are total slots in the lists. So, this isn't any better than using list(chain.from_iterable(lists)), and is probably worse, because list can make use of optimisations at the C level.

Finally, here's a version using extend (suboptimal) in one line, using reduce:

concatenate = lambda *lists: reduce((lambda a,b: a.extend(b) or a),lists,[])

edited Sep 5, 2013 at 18:26

answered Sep 5, 2013 at 17:34

Marcin

50.1k18 gold badges137 silver badges207 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

SethMMorton Over a year ago

Doesn't PEP8 state that though we can write a loop on one line, we shouldn't? I'm just being nit-picky.

Marcin Over a year ago

@SethMMorton It also states that a foolish consistency is the hobgoblin of small minds, as I recall. I certainly find this more readable for one single expression.

SethMMorton Over a year ago

Haha. That's true too.

Marcin Over a year ago

@AshwiniChaudhary Thanks. I think there's a badge for a certain number of answers with more votes than the accepted answer.

Random832 Over a year ago

What about adding up the length and doing [None]*newlen as in your example then doing newlist[:] = itertools.chain.from_iterable(lists) ? C-level optimizations don't get you out of the fact that the list constructor cannot know the final length before iterating through everything.

|

Talia Stocks · Accepted Answer · 2017-12-04 20:36:09Z

16

One way would be this (using reduce) because I currently feel functional:

import operator
from functools import reduce
def concatenate(*lists):
    return reduce(operator.add, lists)

However, a better functional method is given in Marcin's answer:

from itertools import chain
def concatenate(*lists):
    return chain(*lists)

although you might as well use itertools.chain(*iterable_of_lists) directly.

A procedural way:

def concatenate(*lists):
    new_list = []
    for i in lists:
        new_list.extend(i)
    return new_list

A golfed version: j=lambda*x:sum(x,[]) (do not actually use this).

edited Dec 4, 2017 at 20:36

Talia Stocks

33 bronze badges

answered Sep 5, 2013 at 17:29

rlms

11.1k8 gold badges47 silver badges62 bronze badges

6 Comments

alittleboy Over a year ago

thanks! How about not importing any modules and just use the basic tools?

Marcin Over a year ago

The first form will have quadratic complexity, and create n intermediate lists.

Marcin Over a year ago

The second form is still inefficient, because it extends the list n times.

Marcin Over a year ago

Finally, I'd like to note that there's a one-line version using extend

rlms Over a year ago

@Marcin oh, I assumed it was different, and that was why you gave it. Is itertools.chain.from_iterable(lists) efficient?

|

Andrew Clark · Accepted Answer · 2013-09-05 17:30:44Z

2

You can use sum() with an empty list as the start argument:

def join_lists(*lists):
    return sum(lists, [])

For example:

>>> join_lists([1, 2, 3], [4, 5, 6])
[1, 2, 3, 4, 5, 6]

answered Sep 5, 2013 at 17:30

Andrew Clark

210k36 gold badges286 silver badges310 bronze badges

4 Comments

rlms Over a year ago

Why? I think this looks fine.

Frerich Raabe Over a year ago

This solution is only good for very very short lists, it has quadratic complexity.

Duncan Over a year ago

Interesting question. Does sum() internally use + or += to accumulate the result? With one of those this will have quadratic complexity, with the other it is linear.

Duncan Over a year ago

...and to answer my own question, @FrerichRaabe is correct. If sum() used inplace addition it would mutate the starting value which could break things, so it has to use ordinary addition with quadratic complexity.

Joyfulgrind · Accepted Answer · 2013-09-05 17:43:38Z

0

Another way:

>>> m = [1, 2, 3]
>>> n = [4, 5, 6]
>>> o = [7, 8, 9]
>>> p = []
>>> for (i, j, k) in (m, n, o):
...     p.append(i)
...     p.append(j)
...     p.append(k)
... 
>>> p
[1, 2, 3, 4, 5, 6, 7, 8, 9]
>>>

answered Sep 5, 2013 at 17:43

Joyfulgrind

2,8568 gold badges37 silver badges44 bronze badges

Comments

AlexH · Accepted Answer · 2014-01-25 23:48:24Z

0

This seems to work just fine:

def join_lists(*args):
    output = []
    for lst in args:
        output += lst
    return output

It returns a new list with all the items of the previous lists. Is using + not appropriate for this kind of list processing?

edited Jan 25, 2014 at 23:48

answered Jan 25, 2014 at 23:41

AlexH

1111 bronze badge

Comments

xxlovesmilie · Accepted Answer · 2013-10-29 15:04:28Z

-1

Or you could be logical instead, making a variable (here 'z') equal to the first list passed to the 'join_lists' function then assigning the items in the list (not the list itself) to a new list to which you'll then be able add the elements of the other lists:

m = [1, 2, 3]
n = [4, 5, 6]
o = [7, 8, 9]

def join_lists(*x):
    z = [x[0]]
    for i in range(len(z)):
        new_list = z[i]
    for item in x:
        if item != z:
            new_list += (item)
    return new_list

then

print (join_lists(m, n ,o)

would output:

[1, 2, 3, 4, 5, 6, 7, 8, 9]

answered Oct 29, 2013 at 15:04

xxlovesmilie

1

Collectives™ on Stack Overflow

concatenate an arbitrary number of lists in a function in Python

6 Answers 6

6 Comments

6 Comments

4 Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

6 Comments

6 Comments

4 Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related