Numpy double python vectorization for loop

Question

Numpy double python vectorization for loop

V is a (n, p) numpy array whose size is n ~ 10, p ~ 20,000

The code that now looks like

A = np.zeros(p)
for i in xrange(n):
    for j in xrange(i+1):
        A += F[i,j] * V[i,:] * V[j,:]

How do I rewrite this to avoid double python for the loop?

+4

python numpy linear-algebra

user1984528 Nov 20 '13 at 10:52

source share

3 answers

, j <= i. , :

M = (V.reshape(n, 1, p) * V.reshape(1, n, p)) * F.reshape(n, n, 1)
A = M.sum(0).sum(0)

F ( F[i,j] == F[j,i]), M :

D = M[range(n), range(n)].sum(0)
A = (M.sum(0).sum(0) - D) / 2.0 + D

, , n << p, for -loops .

. , , F, , , M.sum(0).sum(0) , .

+7

Isaac 20 . '13 23:11

, np.newaxis -construct:

na = np.newaxis
X = (np.tri(n)*F)[:,:,na]*V[:,na,:]*V[na,:,:]
X.sum(axis=1).sum(axis=0)

3D- X[i,j,p], , 1D A[p]. F , .

+1

flonk 21 . '13 12:46

source share

Jaime · Accepted Answer · 2013-11-21T01:35:12+0000

While Isaac's answer seems promising as it removes these two nested loops, you need to create an intermediate array Mthat is nonce the size of your original array V. Python for loops is not cheap, but memory access is also not free:

n = 10
p = 20000
V = np.random.rand(n, p)
F = np.random.rand(n, n)

def op_code(V, F):
    n, p = V.shape
    A = np.zeros(p)
    for i in xrange(n):
        for j in xrange(i+1):
            A += F[i,j] * V[i,:] * V[j,:]
    return A

def isaac_code(V, F):
    n, p = V.shape
    F = F.copy()
    F[np.triu_indices(n, 1)] = 0
    M = (V.reshape(n, 1, p) * V.reshape(1, n, p)) * F.reshape(n, n, 1)
    return M.sum((0, 1))

If you now take both for a test drive:

In [20]: np.allclose(isaac_code(V, F), op_code(V, F))
Out[20]: True

In [21]: %timeit op_code(V, F)
100 loops, best of 3: 3.18 ms per loop

In [22]: %timeit isaac_code(V, F)
10 loops, best of 3: 24.3 ms per loop

, for 8x . ... , , 3 , . IN, , , np.einsum:

def einsum_code(V, F):
    n, p = V.shape
    F = F.copy()
    F[np.triu_indices(n, 1)] = 0
    return np.einsum('ij,ik,jk->k', F, V, V)

:

In [23]: np.allclose(einsum_code(V, F), op_code(V, F))
Out[23]: True

In [24]: %timeit einsum_code(V, F)
100 loops, best of 3: 2.53 ms per loop

, 20% , , , . , ...

Numpy double python vectorization for loop

More articles: