watson-developer-cloud / visual-recognition-coreml

Classify images offline using Watson Visual Recognition and Core ML
Apache License 2.0
490 stars 77 forks source link

Repo’s git history is too large #37

Open bourdakos1 opened 6 years ago

bourdakos1 commented 6 years ago

@stevemart @devinaconley In the past .mlmodel files and .zip files full of training data were stored and changed in this repo. This has caused an enormous growth in the git history and has left us with cloning times of upwards of 10 minutes.

I propose we purge the history of these files.

https://help.github.com/articles/removing-sensitive-data-from-a-repository/

stevemar commented 6 years ago

Yes, here are the offending commits/files/sizes:

smartinelli-mac viz2 $ ./finder.sh 
All sizes are in kB. The pack column is the size of the object, compressed, inside the pack file.
size   pack   SHA                                       location
33722  33730  431b1e6a2da6bdc322fc68e1015129809be814a7  Training  Images/vga_male.zip
27383  27389  ac59e7014e3890bba89d4ba344b37d08d3ee85ac  Training  Images/hdmi_male.zip
25571  25576  4c9c3a4d361e70cda7f264a14a773db28a4ff7e3  Training  Images/usb_male.zip
20679  20682  e525a02fae412d985b85bb4ef45c360210452fc9  Training  Images/thunderbolt_male.zip
17947  16737  ec14ed4a8d46b5fcafde97168ef081a3d19425db  Core ML Vision  Simple/watson_plants.mlmodel
16744  15610  43f001317ad1277c495f2061eeaa124df4bd577d  Core Ml Vision  Simple/MobileNet.mlmodel
13990  13082  e0129fc9d2efc6b10d01c78dbbab8b551b60923e  Core ML  Vision  Simple/watson_tools.mlmodel
11589  11591  5b6982e1a158c70a9ae88f5e8608f2e4a14ff770  Training  Images/usb_male.zip
9698   9699   af0c2e1cc2036d16b6a5cb2ec15213a4340e085f  Training  Images/hdmi_male.zip