src-d / ml-backlog

Issues belonging to source{d}'s Machine Learning team which cannot be related to a specific repository.
0 stars 3 forks source link

Launch and play with gitbase spark connector #67

Closed zurk closed 5 years ago

zurk commented 5 years ago

Launch gitbase spark connector on ~10 repos and get any dataset you want: identifiers, number of commits per developer, the average time between commits, etc.

Report any issues you found and link it to this issue.

We need it to be all on the same page and understand gitbase spark connector limitations and features.

vmarkovtsev commented 5 years ago

Oh hell yeah we've played enough.