As of now, every data point contributes equally to the loss function and derived cluster updates.
Yet, in some use cases, it might be desirable to attach weights to data points.
This PR introduces sample_weights, a sequence of numeric values, as an optional parameter for KPrototypes' fit method as well as all downstream functions.
Some basic input validation as well as some testing are provided.
Coverage increased (+1.3%) to 97.908% when pulling 03e9ac68cbf923e8d53223def7e4f98fe542c802 on kklein:master into 370d64b1067331b413d641103a52bd4c636ac702 on nicodv:master.
As of now, every data point contributes equally to the loss function and derived cluster updates.
Yet, in some use cases, it might be desirable to attach weights to data points.
This PR introduces
sample_weights
, a sequence of numeric values, as an optional parameter forKPrototypes
'fit
method as well as all downstream functions.Some basic input validation as well as some testing are provided.