-[x] 1. Collect the mailing list. Wolfgang provided the R script for that. #1
-[x] 2. Collect the version control system (VCS) of a project to parse #4.
-[x] 3. Match author identity from the mailing list to VCS. Requires parsing for authorship on mailing list and version control system + matching them if authors use different naming/email.
-[x] 4. Construct communication and collaboration networks. (Done as part of #1 and #4).
-[x] 5. Define the 4 metrics from Codeface as defined on TSE paper. "Exploring Community Smells in Open-Source: An Automated Approach".
-[x] 5.1 two of these metrics requires an additional clustering algorithm. For example, Codeface uses OSLOM 2011, but we are not constrained to it.