Closed SebastianZug closed 4 years ago
Hi Sebastian,
Thank you very much for taking the time to prepare this tutorial! I also heavily use pandas to explore and analyse the repository data both during and after mining so this approach fits very nicely to git2net.
I have a couple of small questions and comments regarding the proposed pandas exporter, as well as the tutorial before integrating it with the rest of the project:
Pandas Exporter:
chunksize
argument to sequentially parse large tables. Thus, all these options would need to be supported by a proper pandas exporter in the way I understand your proposal.Tutorial:
Again thanks a lot for the very nicely written tutorial and the entertaining examples throughout. I am looking forward to hear your thoughts on the points above!
Cheers, Christoph
Content merged in https://github.com/gotec/git2net/pull/15#issue-475750039
Hi,
storing commit data in a sqlite db is nice, but by applying python pandas we are able to explore and analyze the data set more comfortable. In order to support new user, I prepared a small tutorial introducing the tool-chain.
Probably it would be interesting to include a pandas exporter directly?
Sebastian