Closed lluisp1999 closed 9 months ago
Thanks for flagging this. We should alert the user that data will be deduplicated automatically for large n
.
Do you have a use case where you'd like keep duplicate points for large n
? We could potentially add that functionality if needed
The utility lays whenever we don't want to know the pareto front but rather seeing what points are not dominated. It has some great use-cases in many fields, particularly in GFlowNets in my case. I think it would be nice to have, or at least not to misslead the users. Thanks!
🐛 Bug
There is a bug in botorch.utils.multi_objective.pareto.is_non_dominated, where deduplicate is ignored whenever the size of the data is large enough.
To reproduce
Code snippet to reproduce
Stack trace/error message
Expected Behavior
When the data is large, the function is_non_dominated calls is_non_dominated_loop. The problem is that this function ignores deduplicate=False