biolink / biolink-model

Schema and generated objects for biolink data model and upper ontology
https://biolink.github.io/biolink-model/
Other
170 stars 71 forks source link

Adding has_habitat and biotically_interacts_with predicates #850

Closed diatomsRcool closed 2 years ago

diatomsRcool commented 3 years ago

I want to add predicates to the model that represent associations between an organism or organism taxon and their environments or other organisms. I am doing this in the context of eco-kg and importing EOL TraitBank data. In addition to trait data, EOLTraitBank contains information about where an organism has been observed and its organism interactions.

I would like to add has_habitat as a child of located_in. I can use located_in to describe political/geographical locations, like countries or mountains. I need something a bit different for habitat types, like saying an organism lives in a swamp or a boreal forest. I would like to add biotically_interacts_with as a child of interacts_with. I need to represent the collection of trophic interactions represented in the eco subset of RO. I will likely need more predicates as children of biotically_interacts_with, but cross that bridge.

Any objections?

@sierra-moxon @justaddcoffee @jhpoelen

jhpoelen commented 3 years ago

@diatomsRcool no objections here: I much like the idea of increasing the reach of species interactions data!

Also, much, if not all, of EOL TraitBank interaction data comes from GloBI (https://globalbioticinteractions.org) . And, I'd be happy to help produce/update a GloBI data product that would help make it easier to import/link the continuously updating species interaction data into the eco-kg.

Curious to learn more!

jhpoelen commented 3 years ago

cc-ing EOL TraitBank collaborators @katjaschulz and @jhammock .

sierra-moxon commented 3 years ago

@wdduncan tagging you for feedback too

cmungall commented 3 years ago

See https://github.com/oborel/obo-relations/issues/453 for coordinating with RO

wdduncan commented 3 years ago

Some of the classes in the envo hierarchy might be of use here. For example,

diatomsRcool commented 3 years ago

I think I was mostly thrown by the definitions for located in found in RO and in BioLink. The BioLink definition is very broad, which isn't necessarily a problem. The RO definition seems maybe a little too "physical". I think has habitat almost says more about the organism and its adaptations. Does that make sense? Maybe I'm thinking too much.

wdduncan commented 3 years ago

MIxS has properties for env_broad_scale, env_local_scale, env_medium. Guidance on how evno uses them is document here.

These properties are described as: env_broad_scale

In this field, report which major environmental system your sample or specimen came from. The systems identified should have a coarse spatial grain, to provide the general environmental context of where the sampling was done (e.g. were you in the desert or a rainforest?). | We recommend using subclasses of ENVO’s biome class: http://purl.obolibrary.org/obo/ENVO_00000428

env_local_scale

In this field, report the entity or entities which are in your sample or specimen’s local vicinity and which you believe have significant causal influences on your sample or specimen.

env_medium

In this field, report which environmental material or materials (pipe separated) immediately surrounded your sample or specimen prior to sampling

The labels for these properties are cryptic. I find the labels env_biome, env_feature, and env_material more intuitive.

@diatomsRcool if these properties were added to the biolink model, would that help you?

sierra-moxon commented 2 years ago

Per slack conversation, I believe located_in was selected as the predicate in this use case. @diatomsRcool ok to close?

diatomsRcool commented 2 years ago

yes, this can be closed.