biopragmatics / bioregistry

📮 An integrative registry of biological databases, ontologies, and nomenclatures.
https://bioregistry.io
MIT License
112 stars 49 forks source link

simplify obo ingest #49

Closed cmungall closed 3 years ago

cmungall commented 3 years ago

there is a lot going on in the obofoundry ingester, including doing checkouts of repos, checking if an obo file is there..

I think this is all uneccessary, we probably have not done a good job of advertising the right way to collect metadata from obo

cthoyt commented 3 years ago

I would love to see some of this metadata better organized on the OBO side, or if it's already available in a structured way, I can greatly reduce this code :)

cthoyt commented 3 years ago

I updated the way OBO Foundry's registry is ingested in https://github.com/bioregistry/bioregistry/pull/80. Now it's much more slick in how it grabs the download URLs for OWL, OBO, and OBO Graph JSON. The downside is without all of this complicated checking, there's still no explicit field denoting the github/source control repository (though it could maybe be inferred from other fields like the issue tracker for a large portion)