rtyley / bfg-repo-cleaner

Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
https://rtyley.github.io/bfg-repo-cleaner/
GNU General Public License v3.0
10.83k stars 535 forks source link

--strip-blobs-bigger-than does not search large files exhaustively #480

Open akp6031 opened 1 year ago

akp6031 commented 1 year ago

Hi host and community,

This is presumably rather a question than an issue that running "bfg --strip-blobs-bigger-than 5M repo.git" does not successfully filter out .pack files larger than 5mb.

I am preparing a submission to a journal where a provided validator enforces no file in .git history exceeds 5mb. However, even after running bfg and clearing caches in git, and seeing subsequent messages saying "bfg has already run here!", I still receive a report for a file named ".git/objects/pack/pack-xxxxxxxx.pack" by executing the journal validator. This can be removed with the traditional git filter-branch command, but I don't want to risk any side effects without knowing anything about the reason.

Could you enlighten possible situations I have to check? Thank you!