Feature Extraction: Dense SURF, PCA Whitening, Advanced Fisher and GMM Vectors

Question

Feature Extraction: Dense SURF, PCA Whitening, Advanced Fisher and GMM Vectors

I am trying to implement the classifier discussed in this document . I implemented everything except function allocation. In section 5.1, the author writes:

“For each superpixel, two types of objects are extracted: dense surfers, which are converted using signed quadrature values and Lab color values. In our experiments, it was also useful to extract objects around the superpixels precisely within its bounding box to include more context. Both values for surfing and colors are encoded using the improved Fisher vectors implemented in VlFeat and gmm with 64 modes . We perform pca-whitening on both channels of functions . At the end, two vectors are coded traits are concatenated, creating a dense vector with 8576 values . "

A lot is going on here, and I got confused in what order I should follow the steps, and also in which part of the data set.

Here is my interpretation in pseudo python:

def getFeatures(images): surfs_arr = [] colors_arr = [] for image in images: superpixels = findSuperpixels for superpixel in superpixels: box = boundingBox(superpixel) surfs = findDenseSURFs(box) colors = findColorValues(box) surfs_arr.append(surfs) colors_arr.append(colors) surfs_sample = (randomly choose X samples from surfs_arr) colors_sample = (randomly choose Y samples from colors_arr) #or histogram? # gmm has covariances, means properties gmm_surf = GMM(modes=64, surfs_sample) gmm_color = GMM(modes=64, colors_sample) surfs_as_fisher_vectors = IFV(gmm_surf, surfs_arr) colors_as_fisher_vectors = IFV(gmm_color, color_arr) pca_surfs = PCA(ifv_surfs, whiten, n_components = 64) pca_colors = PCA(ifv_colors, whiten, n_components = 64 features = concatenate((pca_surfs, pca_colors), axis=1) return features

my questions:

I am. should whitening the ATP before creating a GMM? (for example, as an example )

II. Do I have to remove the set of surfs_sample and colors_sample from surfs_arr and colors_arr respectively before they are encoded as Fisher Vectors?

III. As for the description of color values, is it better to leave them as they are or create a histogram?

IV. The author claims that he uses Dense SURF, but does not mention how dense. Do you recommend a specific starting point? 4x4, 16x16? Do I really not understand this?

v. Any idea when the author comes up with a "dense vector with 8576 values"? To get a consistent number of functions with different sizes of superpixels, it seems to me that it should be

1) using a histogram to represent color values, and either

2a) resizing each superpixel or

2b), changing the density of its SURF network.

I work in python w / numpy, opencv, scikit-learn, mahotas and vector ported from VLFeat.

Thanks.

+5

python image-processing opencv machine-learning computer-vision

sawyer Jul 21 '15 at 23:01

source share

No one has answered this question yet.

See related questions:

31