identifiers-org / identifiers-org.github.io

MIT License
8 stars 1 forks source link

Registries for genomes #70

Closed Midnighter closed 4 years ago

Midnighter commented 5 years ago

It seems to me that the ability of a genome (assembly) reference are missing for NCBI, GenBank, and RefSeq. An example for Escherichia coli str. K-12 substr. MG1655.

mbdebian commented 4 years ago

Dear @Midnighter , Thanks for your feedback. Could you be more specific on your request? Thanks in advance. Kind Regards, Manuel

Midnighter commented 4 years ago

When working with genome-scale metabolic models, see for example, http://bigg.ucsd.edu/models/iML1515, there is a need to unambiguously annotate the genome that was used as a basis for reconstructing the metabolic network. Identifiers.org contains namespaces for NCBI genes and proteins but not for whole assemblies such as the one linked above. Similarly, the current RefSeq pattern only allows for genes and proteins but not for entire genomes (assemblies). GenBank, another repository for genomes is not present at all.

mbdebian commented 4 years ago

Hi @Midnighter , We have considered your request and, at identifiers.org we'd like to point out that the registry contains information that life sciences resources decided to register on it, and it is not an active crawler of the data repositories, making up prefixes for everything found in the process. I hope you find this information useful, please, don't hesitate to let us know if you'd have any other question. Kind Regards, Manuel

Midnighter commented 4 years ago

So what you're saying is that you would wait for those repositories to register themselves with Identifiers.org rather than proactively including them?

mbdebian commented 4 years ago

Identifiers.org offers, the life sciences community, resolution services for compact identifiers, but it is up to those resources to decide on whether they would like to enhance their FAIR metrics / score by adopting this mechanism or not. It depends on them and their communities to find out whether this compact identifier mechanism and the services built around it by identifiers.org are of added value for them and their communities.

Midnighter commented 4 years ago

As an afterthought, through browsing other issues I found the genome assembly database namespace. That is exactly what I wanted but I didn't manage to find through the registry browser before.

Midnighter commented 4 years ago

Oh, and you do include RefSeq now as well. :smiley: