src-d / sourced-ce

source{d} Community Edition (CE)
https://docs.sourced.tech/community-edition
GNU General Public License v3.0
185 stars 52 forks source link

Allow easy repo management #113

Open r0mainK opened 5 years ago

r0mainK commented 5 years ago

Most of the idea(s) can be found in this Slack thread, but basically, I would like to be able to manage and especially exclude repositories with ease. I know this can also be done by adding filters on superset, but as a user I want something easier.

Feature proposals

marnovo commented 5 years ago

Brainstorming the entry points where the exclusion list could (in theory) be set:

  1. Docker compose
  2. CLI flag:
    1. Repo name(s) as args
    2. File(s) with repo name list as arg(s)
  3. Web UI

Any other?

I assume this would have to take place before/during the init, right? So probably 1 and 2 above are more likely?

r0mainK commented 5 years ago
  1. Docker compose: then we simply do not mount the repo(s) concerned on the volume
  2. / 3. Gitbase will have to do the work after being informed, either by dropping the data from it's database if it's already launched, or adding this excluding functionality if it is not.

I don't really see any other entry points, but think this should be doable at any point, not only before or during the init, as the functionality could prove useful during data exploration.

dpordomingo commented 5 years ago

I'd say that my other answer fits here.

smacker commented 5 years ago

We can start with a flag for cli but according to my experience, it would be much more useful to filter out repositories from UI.

I run srcd-ce without forks on src-d organization. After it downloaded all the data I saw some strange data in the charts. I quickly identified that go-vitess repository is the reason. It's not marked as a fork on github but it is a fork. The point is: a user, just like me, would often identify what should be excluded only AFTER init.