Slicing an n-dimensional array using a list of indices

Question

Slicing an n-dimensional array using a list of indices

Let's say I have a three-dimensional numpy array:

np.random.seed(1145) A = np.random.random((5,5,5))

and I have two lists of indexes corresponding to the 2nd and 3rd dimensions:

 second = [1,2] third = [3,4]

and I want to select the elements in the numpy array matching

 A[:][second][third]

therefore, the shape of the truncated array will be (5,2,2) and

 A[:][second][third].flatten()

will be equivalent to:

 In [226]: for i in range(5): for j in second: for k in third: print A[i][j][k] 0.556091074129 0.622016249651 0.622530505868 0.914954716368 0.729005532319 0.253214472335 0.892869371179 0.98279375528 0.814240066639 0.986060321906 0.829987410941 0.776715489939 0.404772469431 0.204696635072 0.190891168574 0.869554447412 0.364076117846 0.04760811817 0.440210532601 0.981601369658

Is there a way to trim a numpy array this way? While I'm trying A[:][second][third] to get IndexError: index 3 is out of bounds for axis 0 with size 2 , because [:] for the first dimension seems to be ignored.

+7

python arrays numpy

pbreach Aug 9 '14 at 21:04

source share

3 answers

One way is to use np.ix_ :

 >>> out = A[np.ix_(range(A.shape[0]),second, third)] >>> out.shape (5, 2, 2) >>> manual = [A[i,j,k] for i in range(5) for j in second for k in third] >>> (out.ravel() == manual).all() True

Downside is that you must explicitly specify the missing coordinate ranges, but you can wrap this in a function.

+3

DSM Aug 9 '14 at 21:25

source share

I think there are three problems with your approach:

Both second and third should be slices
Since the 'to' index is exclusive, it must go from 1 to 3 and from 3 to 5
Instead of A[:][second][third] you should use A[:,second,third]

Try the following:

 >>> np.random.seed(1145) >>> A = np.random.random((5,5,5)) >>> second = slice(1,3) >>> third = slice(3,5) >>> A[:,second,third].shape (5, 2, 2) >>> A[:,second,third].flatten() array([ 0.43285482, 0.80820122, 0.64878266, 0.62689481, 0.01298507, 0.42112921, 0.23104051, 0.34601169, 0.24838564, 0.66162209, 0.96115751, 0.07338851, 0.33109539, 0.55168356, 0.33925748, 0.2353348 , 0.91254398, 0.44692211, 0.60975602, 0.64610556])

+1

tobias_k Aug 9 '14 at 21:41

source share

Warren weckesser · Accepted Answer · 2014-08-09T21:45:47+0000

Numpy uses multiple indexing, so instead of A[1][2][3] you can - and should - use A[1,2,3] .

Then you might think that you can do A[:, second, third] , but the numpy indices are passed, and the second and third broadcasts (two one-dimensional sequences) end as the numpy-equivalent zip , so the result has the form (5, 2) .

What you really want is to index, in fact, the second and third external product. You can do this using translation by doing one of them, say second into a two-dimensional array with the form (2.1). Then the form that occurs as a result of the translation of second and third is (2,2) .

For example:

 In [8]: import numpy as np In [9]: a = np.arange(125).reshape(5,5,5) In [10]: second = [1,2] In [11]: third = [3,4] In [12]: s = a[:, np.array(second).reshape(-1,1), third] In [13]: s.shape Out[13]: (5, 2, 2)

Note that in this particular example, the values in second and third are sequential. If this is typical, you can simply use slices:

 In [14]: s2 = a[:, 1:3, 3:5] In [15]: s2.shape Out[15]: (5, 2, 2) In [16]: np.all(s == s2) Out[16]: True

There are a couple of very important differences in these two methods.

The first method will also work with indexes that are not equivalent to slices. For example, it will work if second = [0, 2, 3] . (Sometimes you will see this indexing style called fancy indexing.)
In the first method (using broadcast and "fantastic indexing"), the data is a copy of the original array. In the second method (using only slices), the s2 array is a representation in the same memory block as a . Changes in place in one will change them both.

Slicing an n-dimensional array using a list of indices

More articles: