I want to perform hierarchical clustering with the fastcluster module. When I am the default distance metric (Euclidean), it works fine:
import fastcluster import scipy.cluster.hierarchy distance = spatial.distance.pdist(data) linkage = fastcluster.linkage(distance,method="complete")
But the problem is that I want to use the "cosine similarity" as a distance metric:
distance = spatial.distance.pdist(data,'cosine') linkage = fastcluster.linkage(distance,method="complete")
Output:
Traceback (most recent call last): File "C:\djcode\mysite\mysite\scipytest.py", line 52, in <module> linkage = fastcluster.linkage(distance,method="complete") File "C:\Python33\lib\site-packages\fastcluster.py", line 245, in linkage linkage_wrap(N, X, Z, mthidx[method]) FloatingPointError: NaN dissimilarity value.
python scipy hierarchical-clustering
user1680859
source share