CfSOtago / GREENGridData

Code to process, document data and analyse data from the Renewable Energy and the Smart Grid (GREEN Grid) project.
https://cfsotago.github.io/GREENGridData/
GNU General Public License v3.0
1 stars 5 forks source link

Lots of derived files and images are in history #16

Closed dme26 closed 6 years ago

dme26 commented 6 years ago

Running git clone gives a folder around 234M in size although the files present total 10M. This is due to images from checkPlots/ and various derived HTML files in the history. It should be straightforward to rewrite some of that git history, but that doesn't mean it's worth doing (I'm happy to look into it, but if so we should rewrite before publication since the commit IDs will change).

dataknut commented 6 years ago

I know. I need to delete checkPlots/ from the history somehow

dataknut commented 6 years ago

https://rtyley.github.io/bfg-repo-cleaner/

hmm

dme26 commented 6 years ago

Looks good! ( and I like Scala, too… )The advantage here is that you have very specific directory paths to lop off the git trees (e.g., checkPlots/ ) which I think the cleaning tool would directly assist with. I was thinking of just using the git built-in commands, but I think your plan to use the cleaner would be better. I’d think it best to fork the repository before rewriting history, just so that the existing 250MB repository is still available should we need to go back to any part of it.

dataknut commented 6 years ago

fixed