src-d / jgit-spark-connector

jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
Apache License 2.0
71 stars 32 forks source link

Use case comparison: source{d} engine vs. Boa language #218

Closed marnovo closed 6 years ago

marnovo commented 6 years ago

The following examples are achievable through the Boa language for "Mining Ultra-Large-Scale Software Repositories" of the Iowa State University. It seems the Boa language is so far possibly the closest to a "competitor" the engine has at this point.

It is a good idea to see which ones we can reproduce using source{d} engine queries (and make these into our example list) and which ones we cannot—and if we want these to be future features of the engine.

Their example list with the corresponding code can be found here and is listed again below:

bzz commented 6 years ago

His can be useful support for Engine adoption in academia.

Here is also a whitepaper on Boa

ajnavarro commented 6 years ago

@marnovo All the questions that are related with SVN in our use case should be answered changing SVN to GIT, right?

marnovo commented 6 years ago

@marnovo All the questions that are related with SVN in our use case should be answered changing SVN to GIT, right?

@ajnavarro indeed! It doesn't need to be answering the verbatim questions, of course there will be differences to be accounted for.

bzz commented 6 years ago

I believe those checkboxes might also be fearlessly implemented by @mcarmonaa in https://github.com/mcarmonaa/QuerySetApp/blob/master/src/main/scala/tech/sourced/queryset/BoaQueries.scala

ajnavarro commented 6 years ago

Done in: https://github.com/mcarmonaa/QuerySetApp