EpistasisLab / pmlb

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.
https://epistasislab.github.io/pmlb/
MIT License
805 stars 135 forks source link

Only fetch needed datasets from git LFS in github actions workflow #147

Closed lacava closed 3 years ago

lacava commented 3 years ago

Currently we fetch all the LFS files using the lfs: true option here which is described here.

It would be better to only fetch datasets that need to be updated, especially since disk space and bandwidth are limited in github actions.