Python list crossing efficiency: generator or filter ()?

Question

Python list crossing efficiency: generator or filter ()?

I would like to cross two lists in Python (2.7). I need the result to be iterable:

list1 = [1,2,3,4] list2 = [3,4,5,6] result = (3,4) # any kind of iterable

Ensuring a full iteration will be performed primarily after crossing, which of the following functions is more efficient?

Using a generator:

 result = (x for x in list1 if x in list2)

Using filter ():

 result = filter(lambda x: x in list2, list1)

Other offers?

Thanks in advance,
Amnon

+4

python list python-2.7 intersection intersect

Amnon grossman Jun 16 '11 at 9:13

source share

3 answers

Your solution has complexity O(m*n) , where m and n are the corresponding lengths of two lists. You can improve O(m+n) complexity by using a set for one of the lists:

 s = set(list1) result = [x for x in list2 if x in s]

In cases where speed matters more than readability (i.e. almost never), you can also use

 result = filter(set(a).__contains__, b)

which is about 20% faster than other solutions on my machine.

+7

Sven marnach Jun 16 '11 at 9:16

source share

for a list of lists, the most efficient way is to use:

 result = set(list1).intersection(list2)

as mentioned, but for numpy arrays the intersection1d function is more efficient:

 import numpy as np result = np.intersection1d(list1, list2)

In particular, when you know that lists have no duplicate values, you can use it like:

 result = np.intersection1d(list1, list2, assume_unique=True)

0

ses Jul 14 '17 at 18:21

source share

Daniel Roseman · Accepted Answer · 2011-06-16T09:16:52+0000

None of them. The best way is to use kits.

 list1 = [1,2,3,4] list2 = [3,4,5,6] result = set(list1).intersection(list2)

Sets are repeatable, so there is no need to convert the result to anything.

Python list crossing efficiency: generator or filter ()?

More articles: