plantnet / gbif-dl

GBIF classification dataloaders
https://plantnet.github.io/gbif-dl/
MIT License
40 stars 6 forks source link

Shuffle responses on the server #6

Open faroit opened 3 years ago

faroit commented 3 years ago

When drawing a subset of occurances from the GBIF api, they might be sorted by some internal database query. Given that a subset should taken from random distribution of samples for best performance and generalization, it would be idea if samples could be drawn randomly from the API.

currently this is not supported and we might want to raise this issue on either of the following trackers

faroit commented 3 years ago

@timrobertson100 would you be able to share some insights on this and what would be the best place to address this?

timrobertson100 commented 3 years ago

Thanks for pinging me - I've created the mirror issue to get discussions going on our side.

waddington commented 3 years ago

Any progress on this? I feel that this is a very important feature for creating a dataset.