Open cmungall opened 8 years ago
Hi @cmungall
What's the status on this ? I'm asking because it's mentioned here: https://github.com/geneontology/go-site/blob/master/metadata/datasets/README.md
Thanks, Pascale
Also led here from the same README. Any news?
I think we should archive the contributor-data-pool and merge any relevant info in go-site.
@cmungall @kltm
@pgaudet That is a different class of project. re: https://github.com/geneontology/go-site/issues/207#issuecomment-575035899
Currently much of the GO pipeline is driven in an ad-hoc way by assumptions of directory structure layout, metadata bout different GAFs embedded in perl, etc.
We should switch to a system whereby we have one yaml file for each contributing source/authority/database. For key consortium contributors, this will live in go-site, with any number of external contributors.
The yaml will have metadata on each contributor, plus a list of contributed datasets/files, the URL for the GOC cleaned up version, and the source URL.