Definition of points from a set of pairwise distances

Question

Definition of points from a set of pairwise distances

Given the matrix of distances between points, is there an algorithm for determining the set of n-dimensional points having these distances? (or at least minimize error)

sort of like an n-dimensional version of a trunk problem.

The best I can come up with is using multidimensional scaling.

+6

algorithm matrix geometry

mat kelcey Oct 08 '08 at 10:43

source share

4 answers

You can cheat and use an iterative numerical method. First, take all the points in some “random” positions, and then scroll through them, moving them apart in proportion to the required distance. This will prefer some points, but taking the average number of steps before applying them, applying the average will fix this problem. This is an O (n²) algorithm, but very simple to implement and understand. In Example 2 below, the error is <10%, although this may not behave as well if the indicated distances are unrealistic.

C ++ example:

#include <conio.h> #include <math.h> #include <stdio.h> #include <stdlib.h> #define DAMPING_FACTOR 0.99f class point { public: float x; float y; public: point() : x(0), y(0) {} }; // symmetric matrix with distances float matrix[5][5] = { { 0.0f, 4.5f, 1.5f, 2.0f, 4.0f }, { 4.5f, 0.0f, 4.0f, 3.0f, 3.5f }, { 1.5f, 4.0f, 0.0f, 1.0f, 5.0f }, { 2.0f, 3.0f, 1.0f, 0.0f, 4.5f }, { 4.0f, 3.5f, 5.0f, 4.5f, 0.0f } }; int main(int argc, char** argv) { point p[5]; for(unsigned int i = 0; i < 5; ++i) { p[i].x = (float)(rand()%100)*0.1f; p[i].y = (float)(rand()%100)*0.1f; } // do 1000 iterations float dx = 0.0f, dy = 0.0f, d = 0.0f; float xmoves[5], ymoves[5]; for(unsigned int c = 0; c < 1000; ++c) { for(unsigned int i = 0; i < 5; ++i) xmoves[i] = ymoves[i] = 0.0f; // iterate across each point x each point to work out the results of all of the constraints in the matrix // collect moves together which are slightly less than enough (DAMPING_FACTOR) to correct half the distance between each pair of points for(unsigned int i = 0; i < 5; ++i) for(unsigned int j = 0; j < 5; ++j) { if(i==j) continue; dx = p[i].x - p[j].x; dy = p[i].y - p[j].y; d = sqrt(dx*dx + dy*dy); dx /= d; dy /= d; d = (d - matrix[i][j])*DAMPING_FACTOR*0.5f*0.2f; xmoves[i] -= d*dx; ymoves[i] -= d*dy; xmoves[j] += d*dx; ymoves[j] += d*dy; } // apply all at once for(unsigned int i = 0; i < 5; ++i) { p[i].x += xmoves[i]; p[i].y += ymoves[i]; } } // output results printf("Result:\r\n"); for(unsigned int i = 0; i < 5; ++i) { for(unsigned int j = 0; j < 5; ++j) { dx = p[i].x - p[j].x; dy = p[i].y - p[j].y; printf("%f ", sqrt(dx*dx + dy*dy)); } printf("\r\n"); } printf("\r\nDesired:\r\n"); for(unsigned int i = 0; i < 5; ++i) { for(unsigned int j = 0; j < 5; ++j) { printf("%f ", matrix[i][j]); } printf("\r\n"); } printf("Absolute difference:\r\n"); for(unsigned int i = 0; i < 5; ++i) { for(unsigned int j = 0; j < 5; ++j) { dx = p[i].x - p[j].x; dy = p[i].y - p[j].y; printf("%f ", abs(sqrt(dx*dx + dy*dy) - matrix[i][j])); } printf("\r\n"); } printf("Press any key to continue..."); while(!_kbhit()); return 0; }

+3

jheriko Feb 01 '09 at 4:46

source share

There is an algorithm for this in collective intelligence programming , p. 49 "View data in two dimensions", which can be adapted for n-measures.

Hey is multi-dimensional scaling - so I think you're on the right track.

+2

Lou franco Oct 08 '08 at 11:48

source share

I cannot edit the original because I do not have enough rep, but I tried to repeat the problem here.

The OP has an input NxN distance matrix. He wants to create an output array, the size of N, N-dimensional coordinates representing the points where the distance between each point is stored in the input matrix.

Note that this is not solvable in the general case:

Suppose I have such a matrix

  ABCA x 1 2  
 B x 0  
 C x

A is 1 unit of distance (for example, 1 meter) from B, and A is one meter from C. But B and C are in the same place.

In this particular case, the minimum sum of errors is 1 meter, and there are an infinite number of solutions that achieve this result.

+1

Airsource ltd Oct 08 '08 at 11:47

source share

Vebjorn ljosa · Accepted Answer · 2008-10-09T16:31:25+0000

You are on the right track with multi-dimensional scaling (MDS), but MDS is impractical for large datasets, as its temporal complexity is quadratic in the number of points. You might want to take a look at FastMap, which has linear time complexity and is better suited for indexing. Cm:

Christos Falutos and King-ip Lin: "FastMap: Fast Algorithm for Indexing, Data Processing and Visualization of Traditional and Multimedia Datasets, in Proc. SIGMOD , 1995, doi: 10.1145 / 223784.223812

Definition of points from a set of pairwise distances

More articles: