opensource-observer / oso

Measuring the impact of open source software
https://opensource.observer
Apache License 2.0
74 stars 16 forks source link

Transformations to detect duplicates in OSSD and clean up collections #2549

Closed ccerv1 closed 2 hours ago

ccerv1 commented 2 hours ago

What is it?

Currently our validators won't detect a collision in OSSD if there are differences in case (eg, for addresses) or a trailing slash on a GitHub url. For now, let's use a transformation to convert everything in OSSD to lower case and remove slashes. While we're at it, a utility to sort a collection would be nice!

ccerv1 commented 2 hours ago

https://github.com/opensource-observer/oss-directory/pull/586