Closed geek-yang closed 2 years ago
One more thing, spotted by @semvijverberg, the returned clustered values (dataarray
) include coordinate latitude
and longitudes
, which are actually the center of each cluster. But this information is not explicitly shown to the user. We can add an attribute to the dataarray
and mentions that these values are the center of clusters.
Currently the RGDR module will return a
dataarray
with dimensions[cluster_labels, anchor_year]
after callingtransform
(e.g.rgdr.transform(precursor_field)
). For most of the popular machine learning packages, e.g.scikit-learn
, the output from the dimensionality reduction method is always in the shape[samples, features]
(e.g. PCA in sklearn and the models also need input to be organized in this way (e.g. GraidentBoostingMachine in sklearn).It is nice to have the output from RGDR to be return with the shape
[anchor_year, cluster_labels]
, which is compatible with sklearn models.