gotec / git2net

An Open Source Python package for the extraction of fine-grained and time-stamped co-editing networks from git repositories.
https://git2net.readthedocs.io
GNU Affero General Public License v3.0
53 stars 16 forks source link

Add example on usage of pandas in combination with git2net data #14

Closed SebastianZug closed 4 years ago

SebastianZug commented 4 years ago

Hi,

storing commit data in a sqlite db is nice, but by applying python pandas we are able to explore and analyze the data set more comfortable. In order to support new user, I prepared a small tutorial introducing the tool-chain.

Probably it would be interesting to include a pandas exporter directly?

Sebastian

gotec commented 4 years ago

Hi Sebastian,

Thank you very much for taking the time to prepare this tutorial! I also heavily use pandas to explore and analyse the repository data both during and after mining so this approach fits very nicely to git2net.

I have a couple of small questions and comments regarding the proposed pandas exporter, as well as the tutorial before integrating it with the rest of the project:

Pandas Exporter:

Tutorial:

Again thanks a lot for the very nicely written tutorial and the entertaining examples throughout. I am looking forward to hear your thoughts on the points above!

Cheers, Christoph

gotec commented 4 years ago

Content merged in https://github.com/gotec/git2net/pull/15#issue-475750039