modAL-python / modAL

A modular active learning framework for Python
https://modAL-python.github.io/
MIT License
2.23k stars 324 forks source link

Add explanation of ‘random_tie_break’ to documentation #130

Open fritshermans opened 3 years ago

fritshermans commented 3 years ago

It’s not clear to me what the argument ‘random_tie_break’ exactly does. Could you please add a few lines to the documentation or examples about it?

damienlancry commented 3 years ago

in uncertainty sampling, we rank the data points by informativeness as measured by an acquisition function such as entropy or margin. In case there are several data points that are ranked equally, which ones should you choose? you can either take the ones returned by uncertainty.argsort()[:n] or you can first shuffle the data and then use argsort. that is what random tie break does here.