Closed emthompson-usgs closed 2 years ago
I'm guessing this would require all of users to reclone the repository again after cleaing up the history?
An alternative option to BFG Repo Cleaner, is https://github.com/newren/git-filter-repo which is more actively updated and offers a comparison of these types of tools.
If the releases are preserved and include all of the test data, I am less concerned about removing old test data from the repo.
Additionally, a long-term approach to dealing with large test datasets is to move them to a place where they can be downloaded, such as release assets.
I like the idea of moving the test data somewhere else, I just don't know how to do that.
I realize this goes against the idea of git preserving the entire repo history, but given that we recently cleaned up a bunch of large unused/redundant test data, should we clear some of those large files that out of the git history? I only just learned about BFG Repo Cleaner and I have not used it before. But it seems like it might be worth doing.