Open davidsjohnson opened 9 years ago
I'm looking to join a team for Project 2! Shoot me a message if you're looking for another person to help with your research problem.
Team: (Andrew H.) @Hoverbear (Brody H.) @BrodyHolden (Fraser D.) @fraserd
If anyone else would like to join, let us know.
Colin Knowles (@knowlesc) and Ryan McDonald (@ryanmcdonald)
Question: Does the activity on Stack Overflow, or social media sites such as Twitter, drive the number of contributors/size of a project or do social media mentions drive the contributors/size?
Datasets: We're thinking of looking at Angular, Bootstrap, and Rails, but we might modify the codebases later on. We plan to use gitstats, and the Stack Exchange API (at least) to gather data on these repositories.
Hypotheses: We think that the number of contributors/project size should drive the mentions on social media.
Plan of Attack: 1) Figure out how to use the APIs of some social media sites 2) Search for mentions of certain projects by date 2) Run gitstats on the codebases 3) Compare project statistics with social media information
Jian Guan @guand Jonathan Lam @lamj1234 Paul Moon @paulmoon
How are different types of GitHub activites correlated with user satisfaction?
We'll analyze some of the most popular GItHub repositories: AngularJS, Bootstrap, Node.js, Django, and MongoDB. The most popular repos were chosen because they generate a lot of user feedback and discussion within the community, which we can analyze to measure user satisfaction.
We are using reddit comments and posts as a primary data source for user feedback, with community wikis as the source for patches and updates
We hope to find that more successful games have a stronger relation between user feedback and patch note data. This would show that community feedback is vital to a games growth.
Jeremy Kroeker, Brayden Arthur
@BrodyHolden @Hoverbear and @fraserd Here's a study on refactoring you might find interesting. The paper touches on some metrics for complexity and coupling that may help. http://www.itworld.com/article/2891140/study-finds-that-refactoring-doesn-t-improve-code-quality.html
@Jsyro this may be of interest to your project group as well.
@fortjohnson Thanks! However, 4,500 lines is a really small project!
How do software development methods affect different properties of development?
Old Feature development time Initial use for Issue tracking New Estimated Feature completion time
We have found no relation between the time it took to develop these features between the selected development methods. However, you would think that these different methods would have different estimated rate of feature completed times. It is also assumed, that since the previously selected methods were so similar, that we will see differences for other, more varied methods.
Taken from our repositories readme: "Take the average feature completion time for the whole project (average of [end day-start day] for each feature) and divide the average estimated completion time for the whole project (average of [planned end day-start day] for each feature) to get the feature completion time ratios for each project. This would be visualized using a 'completion time ratio vs project' bar graph. An additional graph could be provided to show the average feature completion time ratios of all the projects in the same software development methods, plotted with other development methods. This graph would be a 'completion time ratio vs software development method' bar graph."
Mitchell Rivett Tyler Potter
Can we create a tree of communication to determine the organizational and social structure of a company? Will this tree reflect the structure of the code?
-Brandon Leech and Jorin Weatherston
Devin Corrigall and Andrew Hansen
For project two, we want to continue with the idea of examining the effect of progressive changes and anti-regressive changes on the number of reported bugs over time. Therefore, our question will still be "How does the number of progressive and anti-regressive changes effect the number of bugs reported over time?" Our hypotheses are that the more progressive changes there are, the more bugs will be reported in the following weeks, and that the more anti-regressive changes there are, the less bugs will be reported over the next few months. These are what we seemed to have found in project 1, but our proof and explanation of these were weak. Instead of using our script to get all this information from github, we want to find different ways to obtain the anti-regressive and progressive changes, and make sure it is on a daily basis, instead of monthly. We will probably still use our script for counting bugs, as it seems quite reliable for that. For anti-regressive changes, we would like to use the tool Transit that was created by another team. This will allow us to detect both the day of an anti-regressive change, and the size of it. For features, we are thinking of something that could count the number of lines added on a daily basis, or seeing if we can adapt another group's tool to accomplish this. For datasets, we want to use the same 3 (node, rails and bootstrap), so we can compare our results to what we got in project 1, and see what has improved. Once this is working the way we want it to, we can look at even more repositories on GitHub! :)
Parker Atkins, Rabjot Aujla, Greg Richardson, Jordan Heemskerk
We would like to continue with our old question with some modifications.
Research problem in the form of a question: Does the volume of unit tests in a project relate to the frequency of bugs?
Datasets to use: We would like to look at data sets that have a longer GitHub history and that track their bugs in a meaningful way. We've found that projects that have a more consistent development cycle (aka more mature) tend to produce better data. Some repositories we may use are: jQuery, AngularJS, Bootstrap.
A couple hypotheses based on some intuition from research or previous lessons learned: We would like to change our hypothesis to: Frequency of bugs will decrease as unit test change. The reason we are changing "increase" to "change" is because we care more about developers working on unit tests than strictly increasing the lines of code in unit tests. Developers working on unit tests, whether its adding or deleting lines of code, should help decrease the number of bugs.
Initial thoughts on how you plan to attack the problem: One of the biggest things we plan to do is refine our method to determine bugs. Right now we just grab all issues from the GitHub repo which can include bugs, but also features, and questions. We would like to change our tool to filter these issues by bugs. We need to decide how we will determine if it's a bug (ie. labelled bug vs keywords), and whether we want to filter bugs on a case by case basis, or the same for all repos. (Case by case basis may produce better results).
We would also like to make our tool work for anybody. We will need to add features that make it work for more general cases. For example, we plan to allow the user to filter their unit tests using a Regex expression instead of just a single folder of files.
Jason Syrotuck, Evan Hildebrandt, Keith Rollans
@Hoverbear Yes. I agree and have doubts about the results using such a small project. But thought the internal measure used may be of interest.
We'll use the second part of the lab for designing project 2. In this post provide the following: