predictas @EdChum suggested, can be used for invisible data. This method (and, moreover, the method transform) is useful when the k-tool is used to extract traits in semi-server training: you cluster a large set of samples, then use the nearest centroid / centroid distance as a function for subsequent supervised training problem. When using the result for forecasting, you get samples that were not visible using k-means.
source
share