researchart / fse16

info about artifacts from fse16
7 stars 3 forks source link

silva_whyWeRefactorDataset #20

Open danilofes opened 8 years ago

ctreude commented 8 years ago

This is a very insightful, useful and usable dataset.

In addition to the corresponding paper being a really nice contribution to the refactoring research community, the authors have done a great job putting together this dataset: The website is clearly structured, the visual presentation is appealing, data is available in a navigable format and as JSON, and all the relevant data appears to be there. I particularly appreciate the amount of detail regarding the thematic analysis which shows themes before and after consensus. The level of detail provided here works great towards increasing a user's confidence in the process of determining the themes.

As the authors point out, similar work is pretty limited and the amount of detail available in this dataset along with the ease of use make me optimistic that this dataset will be used by other researchers.

I'm happy to recommend the inclusion of this dataset in the online repository, the social media campaign and the award session.

hongyujohn commented 8 years ago

The authors have done a good job in collecting and analyzing the refactor dataset. The dataset is detailed and well-structured. The dataset not only reveals the type of refactoring in programming practices but also includes motivations behind each refactoring. To conduct thematic analysis the authors collected feedback from programmers. The dataset is easy to use with clear web items and JSON files. In summary, collecting such a dataset is a good job, which could help researchers and practitioners better understand the refactoring activity. I hope the authors can continue maintaining it in the future to make it a better resource for refactoring research.

I recommend acceptance.

yasutaka-kamei commented 8 years ago

This paper provides the dataset that investigates the motivations behind specific refactoring operations conducted by developers. The dataset includes (1) a list of 539 commits with 1,411 refactoring identified by the RefactoringMiner and confirmed with manual inspection and (2) 222 commits with 463 refactoring and their motivations.

According to the CFP, artifact papers are expected to include:

timm commented 8 years ago

Note these labels are still "under discussion" and are still subject to change prior to the final notifications Friday.

danilofes commented 8 years ago

First, we would like to thank all reviews for their comments. Second, we will address the following suggestion by Yasutaka Kamei:

Future studies may want to update this dataset because they obtain more commits with their motivation by conducting additional developer survey. Therefore, if the web page (http://aserg-ufmg.github.io/why-we-refactor) includes information about "how to contribute to improve the dataset" (e.g., how to make a pull request), it could be helpful.

This is a very interesting suggestion. We will include instructions in the website and in the camera-ready version of paper on how to contribute to improve the dataset via GitHub pull requests.