OBOFoundry / OBOFoundry.github.io

Metadata and website for the Open Bio Ontologies Foundry Ontology Registry
http://obofoundry.org
Other
163 stars 201 forks source link

Request for new ontology : Vertebrate Breed Ontology (VBO) #1883

Closed sabrinatoro closed 2 years ago

sabrinatoro commented 2 years ago

Title

Vertebrate Breed Ontology

Short Description

A single computable resource for vertebrate breed names

Description

Online Mendelian Inheritance in Animals (OMIA) provides up-to-date summary information on all known harmful and beneficial variants in vertebrates (apart from those vertebrates that have their own similar resource), together with background information on all known inherited disorders and non-disease traits, which are collectively called ‘phenes’. OMIA is hosted by the Sydney School of Veterinary Science (SSVS) within the University of Sydney.

As OMIA developed over the past 26 years, breed names were entered as published in the papers reporting the phenes. Inevitably, the resultant OMIA list of breed names contains inconsistencies, duplications and errors. The ever-increasing inter-connectedness of the internet has highlighted the need for OMIA to adopt a standardised vocabulary for breeds that will enable OMIA to be reciprocally hyperlinked with other relevant internet resources.

The Anstee Hub for Inherited Diseases of Animals (AHIDA) is a new initiative from the SSVS. Currently under development by the Sydney Informatics Hub (SIH) in collaboration the AHIDA team, and with a projected launch in 2022, it will be a web resource for surveillance, reporting and control of inherited diseases in animals. As with OMIA, AHIDA requires a standardised vocabulary for breeds that will enable AHIDA to be reciprocally hyperlinked with other relevant internet resources.

Inspired by the development of the Livestock Breed Ontology (LBO) by Zhiliang Hu, James Reecy and Cari Park at Iowa State University (ISU), the OMIA and AHIDA teams are collaborating with the SIH, with the instigators of the Monarch Initiative and some of their colleagues, and with the creators of LBO to create a breed ontology ― a Vertebrate Breed Ontology (VBO) ― that will include all the known breed names (including synonyms) for all species that are relevant to OMIA and AHIDA.

By far the most comprehensive global resource for breeds is the list of breed names and synonyms compiled by the Food and Agriculture Organisation (FAO) of the United Nations in its Domestic Animal Diversity Information System (DADIS). A powerful attribute of DADIS is that its information is contributed and updated by designated National Coordinators from each of 182 countries. DADIS contains information on more than 15,000 national breed populations representing more than 8,800 breeds across 38 species.

In collaboration with FAO colleagues responsible for DADIS (Roswitha Baumung and Gregoire Leroy), the VBO team is using the DADIS list of breed names and synonyms as the basis for the development of the VBO.

Breed names and synonyms not included in the DADIS list, e.g. for cats, will be added to the VBO. OMIA breed names will then be mapped to VBO ids, as will LBO ids and ids or names from other relevant breed lists. Once developed, the VBO will be incorporated into OMIA and AHIDA, and will be made freely available. It is recognised that, as with the LBO, there will be a continual need for enhancement of the VBO.

Identifier Space

VBO

License

CC-BY 4.0

Domain

organisms

Source Code Repository

OBOFoundry.github.io

Homepage

https://monarch-initiative.github.io/vertebrate-breed-ontology/

Issue Tracker

https://github.com/monarch-initiative/vertebrate-breed-ontology/issues

Ontology Download Link

https://raw.githubusercontent.com/monarch-initiative/vertebrate-breed-ontology/master/vbo.owl

Contact Name

Sabrina Toro

Contact Email

sabrina@tislab.org

Contact GitHub Username

sabrinatoro

Contact ORCID Identifier

0000-0002-4142-7153

Formats

Dependencies

No response

Related

No response

Usages

No response

Intended Use Cases and/or Related Projects

Online Mendelian Inheritance in Animals (OMIA, https://omia.org) will use VBO to model their data and make it interoperable.

Data Sources

Domestic Animal Diversity Information System (DAD-IS) breed list [​​https://www.fao.org/dad-is/en/].

Additional comments or remarks

We are aware that the majority of terms in VBO do not have definitions (and therefore the corresponding checkbox below was not checked). Breeds are defined as specific groups within a species that have a phenotype, behavior and/or other characteristics sufficiently distinctive for other members of the species to have been given distinctive names. These distinguishing characteristics are not often recorded/communicated.

OBO Foundry Pre-registration Checklist

matentzn commented 2 years ago

@sabrinatoro thank you for your submission! Before we assign reviewers and start looking at your ontology, you have the pass the OBO dashboard:

https://obofoundry.org/obo-nor.github.io/dashboard/vbo/dashboard.html

Please let us know when you have fixed all issues (aside from usages this may be open/red).

matentzn commented 2 years ago

Also, we need all checkboxes above to be checked. You didn't check:

I understand that term definitions, while not mandatory, are key to understanding the intentions of a term especially when the ontology is used in curation. I made sure that a reasonable majority of terms in my ontology have definitions, using the IAO:0000115 property.

Can you provide us some detail on why you do not provide human readable definitions for your classes, or whether you plan to do so at a later stage?

sabrinatoro commented 2 years ago

Thank you @matentzn ! The errors from the OBO dashboard have (should have) been resolved. We are however aware of the following outstanding issues that will be addressed at a later stage:

matentzn commented 2 years ago

Great, this is now ready for reviewer assignment.

ddooley commented 2 years ago

Hi, Its neat to see VBO built with ODK, and it looks like some ROBOT generated input. ID's look fine.

It looks like the src/ontology/imports/ncbitaxon_terms.txt needs to be regenerated into ncbitaxon_import.owl to get rid of a handful of NCBITaxon items showing at top level of VBO.

The names of the species look a little funny with semicolons in them and countries up front, but I understand they need to be there to distinguish regions that a particular product originated from.

Is the Livestock Breed Ontology (LBO, https://bioportal.bioontology.org/ontologies/LBO) going to be rolled into VBO? Can you give some guidance about which ontology would be supported into the future? It seems like LBO is active.

I was wondering if it is appropriate to upgrade a few things that are annotations into object properties:

image

I was thinking rather than have an annotation property "exists in country" that restricted itself to a string reference to Wikipedia country, possibly the generic "located in" could be used. But RO states that is a relation between instances. I'll check with OBO folks about that. It would be nice if the wikidata countries (or other geographical location identifiers like "Mount Meru") be an import file? Then each country could get an ISO code, rather than the shadow annotation of subspecies to the ISO code.

As well, the domestication status looks like something that could be reasoned over, something that has states or qualities.

image

Perhaps a "domesticated" quality, so "'African, Ukrane; Ostrich' has quality some 'domesticated quality' "? Or perhaps 'has disposition' some 'domesticated disposition'?

bpeters42 commented 2 years ago

One question / suggestion: Are animals bred for research purposes (e.g. Syrian hamsters, B6 mice) in the scope of this ontology? That is one area we are very interested in, and it would be great if information on e.g. strains from Jackson labs could be represented in the same broader format. Completely understood if that is outside of your main domain, but it will be nice to have different extensions of NCBI taxonomy done in a compatible ways.

ddooley commented 2 years ago

Discussed on OBO Operations committee, and aside from the desire to have a few questions about annotation usage above answered, the committee has accepted VBO into the foundry! I will send a followup email.

sabrinatoro commented 2 years ago

@ddooley and @bpeters42 I want to reflect here what I said in my email as I don't think it was sent to everyone. Thank you for accepting VBO into the obo foundry! And thank you for taking the time to review it and offering feedback and suggestions. I am still consulting with the team, and therefore it is taking me longer than expected to get back to you. I will however answer to your feedback and suggestion as soon as possible. Thank you for your patience.

nlharris commented 2 years ago

What are the next steps here?

sabrinatoro commented 2 years ago

We are working (slowly) on the suggestions offered here. Here is an update:

It looks like the src/ontology/imports/ncbitaxon_terms.txt needs to be regenerated into ncbitaxon_import.owl to get rid of a handful of NCBITaxon items showing at top level of VBO.

The reason for the NCBITaxon items showing at top level of VBO was that they were not in the NCBITaxon Ontology. We added these NCBI_taxon in https://github.com/obophenotype/ncbitaxon/blob/master/subsets/taxon-subset-ids.txt, and they should show up in the VBO main classification after the next NCBITaxon ontology release.

Is the Livestock Breed Ontology (LBO, https://bioportal.bioontology.org/ontologies/LBO) going to be rolled into VBO? Can you give some guidance about which ontology would be supported into the future? It seems like LBO is active.

We are working with colleagues at LBO and have an upcoming scheduled meeting to determine our plan. It is clear to the VBO and LBO teams that eventually, VBO and LBO will merge or LBO will become a part of VBO. In the short term, we are planning on creating non-ambiguous mappings between VBO and LBO.

I was wondering if it is appropriate to upgrade a few things that are annotations into object properties: I was thinking rather than have an annotation property "exists in country" that restricted itself to a string reference to Wikipedia country, possibly the generic "located in" could be used. But RO states that is a relation between instances. I'll check with OBO folks about that. It would be nice if the wikidata countries (or other geographical location identifiers like "Mount Meru") be an import file? Then each country could get an ISO code, rather than the shadow annotation of subspecies to the ISO code.

As well, the domestication status looks like something that could be reasoned over, something that has states or qualities. Perhaps a "domesticated" quality, so "'African, Ukrane; Ostrich' has quality some 'domesticated quality' "? Or perhaps 'has disposition' some 'domesticated disposition'?

We are still investigating this. We need to review breed information from the other species to determine whether this "domestication status" is required for breeds other than the livestock breeds. If so, we need to make sure that the solution (e.g using a "domesticated" quality) will work for all the breeds.

One question / suggestion: Are animals bred for research purposes (e.g. Syrian hamsters, B6 mice) in the scope of this ontology? That is one area we are very interested in, and it would be great if information on e.g. strains from Jackson labs could be represented in the same broader format. Completely understood if that is outside of your main domain, but it will be nice to have different extensions of NCBI taxonomy done in a compatible ways.

YES! We are definitely discussing including laboratory animal breeds/strains into VBO. This has not yet become a priority as we are currently working on improving the current ontology (focused on livestock breeds) and adding cat breeds, but this is definitely on our to-do list. We have been in communication with members of the “Immune Epitope Database and Analysis Project” for whom including these strains in VBO would be useful, and are planning to work with members from the mouse and rat community to ensure that existing resources can be reused and/or leveraged.

The names of the species look a little funny with semicolons in them and countries up front, but I understand they need to be there to distinguish regions that a particular product originated from.

We recognize that the concatenation of name+country+species for livestock breed label is less than ideal. We have investigated using parenthesis instead of semicolons. We also investigated other solutions, but we have not been able to find an alternative that would ensure term labels to be unique (since the same breed name can refer to breed in different species and/or different country).

matentzn commented 2 years ago

Thank you @sabrinatoro for the detailed response! I hope VBO will continue to be a FAIR ontology resource for the representation of breeds in the ontology world.