smartshark / smartshark.github.io

Project Homepage
https://smartshark.github.io
Apache License 2.0
2 stars 1 forks source link

Duplicates in the refactoring collection? #25

Closed shehan closed 3 years ago

shehan commented 3 years ago

I understand that the tools rMiner and refDiff were utilized to mine refactoring operations in the code. Since these two tools overlap in the type of refactoring operations they detect, I need to know if I need to perform duplicate checks when querying the 'refactoring' collection.

For example, in File A, a rename method operation was applied to method ABC(). Are there duplicate entries in the 'refactoring' collection - one for rMiner and one for refDiff? Or is this refactoring instance only captured once in the 'refactoring' collection?

Thanks!

shehan commented 3 years ago

Exploring the data, I'm encountering some instances of duplication. Examples:

atrautsch commented 3 years ago

There is a detection_tool field for each document in the refactoring collection which indicates which refactoring detection tool reported the refactoring.