mlr-org / mlr3pipelines

Dataflow Programming for Machine Learning in R
https://mlr3pipelines.mlr-org.com/
GNU Lesser General Public License v3.0
132 stars 25 forks source link

Support for metric MDS #757

Open m-muecke opened 6 months ago

m-muecke commented 6 months ago

Implentation: stats::cmdscale

mb706 commented 1 month ago

@m-muecke do you know if there is a way to "predict" for new data, i.e. do the same transformation for new points that was done to the training data? This should happen independently of predict data size, i.e. when "predicting" for a dataset, the result should be the same as the rbind() of each row in that dataset predicted independently.

m-muecke commented 2 weeks ago

I believe there are only efficient solutions when euclidean distances are used, but then the metric MDS is basically equivalent to the PCA which is already implemented.

References:

tidymodels use the implementation of the dimRed package, which is only implemented for euclidean distance: https://github.com/gdkrmr/dimRed/blob/master/R/mds.R#L88 and sklearn doesn't seem to have a predict for MDS: https://scikit-learn.org/stable/modules/generated/sklearn.manifold.MDS.html