MetricsGrimoire / CVSAnalY

The CVSAnalY tool extracts information out of source code repository logs and stores it into a database.
http://metricsgrimoire.github.com/CVSAnalY/
GNU General Public License v2.0
48 stars 31 forks source link

Repeated repo name when analyzing repos with deep URIs #100

Open canasdiaz opened 8 years ago

canasdiaz commented 8 years ago

The field 'name' of the repositories table is useless when we have different repos with different URIs but same last token. Example:

mysql> SELECT * FROM repositories WHERE name = 'core';
+-----+-----------------------------------------------+------+------+
| id  | uri                                           | name | type |
+-----+-----------------------------------------------+------+------+
|   5 | https://gerrit.wikimedia.org/r/pywikibot/core | core | git  |
|  67 | https://gerrit.wikimedia.org/r/mediawiki/core | core | git  |
| 282 | https://gerrit.wikimedia.org/r/oojs/core      | core | git  |
+-----+-----------------------------------------------+------+------+
3 rows in set (0,00 sec)

This bug affects GrimoireLib due to it gets SCM data grouping by 'name'. This bug was repored by the Wikimedia folks (https://phabricator.wikimedia.org/p/jayvdb/) in the following ticket https://phabricator.wikimedia.org/T123808#1970520