refactoring-ai / Data-Collection

Collect refactorings with metrics from java source code.
MIT License
6 stars 1 forks source link

Duplicate Metrices in Database #2

Open jan-gerling opened 4 years ago

jan-gerling commented 4 years ago

We collect lots of duplicates for all metrices (Process, Class, Method, Variable and Field), because we insert them into the database again and again even though did not change. I suggest to only insert new metrices into the db, if they are unique for the current commit. This would reduce database size and probably increase the speed of the data-collection, because we would not recompute these metrices.

Statistics

image image image image image image