Closed patryk-kowalski95 closed 2 years ago
The current function definition is as follows:
"""Euclidean distance dissimilarity function""" if np.isnan(a).any() or np.isnan(b).any(): raise ValueError("Missing values detected in numerical columns.") return np.sum((a - b) ** 2, axis=1)
whereas the Euclidean distance requires a square root. It should be:
"""Euclidean distance dissimilarity function""" if np.isnan(a).any() or np.isnan(b).any(): raise ValueError("Missing values detected in numerical columns.") return np.sqrt(np.sum((a - b) ** 2, axis=1))
This is a deliberate choice, see:
Thank you for clarifying
The current function definition is as follows:
whereas the Euclidean distance requires a square root. It should be: