It really depends on the application script you have. Very briefly, if you are dealing with data in which the actual difference in attribute values is important, go with Euclidean Distance. If you are looking for trend or shape similarities, move on to correlation. Also note that if you normalize the z-score in each object, Euclidean Distance behaves similarly to the Pearson correlation coefficient. Pearson is not sensitive to linear data transformations. There are other types of correlation coefficients that take into account only series of values that are insensitive to both linear and nonlinear transformations. Note that the usual use of correlation as a dissimilarity is 1 — a correlation that does not take into account all the rules for metric distance.
There are several studies that select a proximity criterion for a specific application, for example:
Pablo A. Jaskowiak, Ricardo JGB Campello, Ivan G. Costa Filho, “Approximations for Clustering Gene Subtraction Data: Validation Data and Comparative Analysis”, IEEE / ACM Transactions on Computational Biology and Bioinformatics, vol. 99, no. PrePrints, p. 1, 2013
John d
source share