Closed shehan closed 3 years ago
Exploring the data, I'm encountering some instances of duplication. Examples:
There is a detection_tool field for each document in the refactoring collection which indicates which refactoring detection tool reported the refactoring.
I understand that the tools rMiner and refDiff were utilized to mine refactoring operations in the code. Since these two tools overlap in the type of refactoring operations they detect, I need to know if I need to perform duplicate checks when querying the 'refactoring' collection.
For example, in File A, a rename method operation was applied to method ABC(). Are there duplicate entries in the 'refactoring' collection - one for rMiner and one for refDiff? Or is this refactoring instance only captured once in the 'refactoring' collection?
Thanks!