Closed PhilippKaniuth closed 2 years ago
Further, it might be confusing since distance
determines both, which (dis-)similarity is used within a single feature (i.e. when applying reweighting) and globally (i.e. for the classical RSA score that is returned for quick comparison).
Solution could be to introduce yet another parameter and rename the
distance
parameter. Proposals:1.
single_dim_distance
(denotes the (dis-)similarity measure used within a feature) and multi_dim_distance
(denotes the global (dis-)similarity for classical RSA).2. reweighted_distance
und classical_distance
.
This would also allow to compute, for the predicting system, different (dis-)similarity measures for reweighed and classical RSA.
Maybe it should be noted that distance
also refers to similarity functions. Maybe there is a better naming for distance/similarity?
Yes, what's a good hypernym though for "(dis-)similarity"? I was thinking of "measures". With regard to #32 but also #31, I think I will:
distance
to measures
.measures
would be a list of strings with 3 mandatory elements, where the first indicates the "reweighted_distance" (aka "single_dim_distance"); the second indicates the "multi_dim_distance" (aka "classical_distance"); the third indicates the (dis-)similarity of the target matrix (which I need for #30).What do you think? I really don't want to overload the function with too many (new) parameters.
I am also pondering on whether there should be the possibility at all to indicate different measures for single_dim and mulit_dim. The fact that frrsa
outputs classical scores is really just a convenience to be able to quickly compare reweighted and classical scores. However, there is a multitude of measures one could want for the classical case and many are not (and will not) be supported by frrsa
. It migh well be that some user wants a measure for the multi_dim case that will never be supported by frrsa
because there are better suited alternatives to compute that - in that case one would throw away the classical score frrsa
outputs anyway.
Currently, the name of the parameter
distance
is a bit of a misnomer as it can be used to either have squared Euclidean distance being computed within each feature (i.e. a dissimilarity) or to compute the feature-specific dot-product (i.e. a similarity measure, so a distance). What's a good hypernym though for "(dis-)similarity"?