Sage-Bionetworks / sage-monorepo

Where OpenChallenges, Schematic, and other Sage open source apps are built
https://sage-bionetworks.github.io/sage-monorepo/
Apache License 2.0
23 stars 12 forks source link

[Story] Explore the integration of EDAM ontology to OC #2209

Closed tschaffter closed 1 year ago

tschaffter commented 1 year ago

What product(s) is this story for?

OpenChallenges

As a user, I want

As a Challenge Organizer, I want my challenges to be annotated with existing ontologies so that the annotations are useful to a wider group of users.

As an OC Developer, I want to leverage existing ontologies as much as possible so that we don't reinvent the wheel.

Description

Explore how the EDAM ontology could be used to annotate challenge resources.

Acceptance criteria

No response

Tasks

No response

Anything else?

I think that the first time the EDAM ontology was suggested in the context of OC was for assigning values about the input data type.

Have you linked this story to a GitHub Project?

gaiaandreoletti commented 1 year ago
tschaffter commented 1 year ago

EDAM ontology added to OC excel file

We should probably discuss what the integration would have looked like before updating the "production" spreadsheet. :slightly_smiling_face:

@gaiaandreoletti Can you please describe in this ticket what the solution you propose is?

gaiaandreoletti commented 1 year ago

To make it easier for me to search for the appropriate ontology based on the challenge data, I added the EDAM ontologies as a new column. However, we can decide to delete this column later on. Currently, the tab called 'input_data_types' contains the ontologies that correspond to the type of data found across all challenges listed in the excel file. Previously, this tab was empty. A drop down menu listing all ontologies was made created to facilitate process. iF the EDAM onthology for the type of data used in the challange was not available , I used either the topic or the operation reflecting the expetiment used. e.g.

Preotein function prediction operation_1777 Preoteomic experiment topic_3520 Pretein interaction data data_0906 Protein structure data_1460 Protein variants topic_3120 RNA sequencing topic_3170

tschaffter commented 1 year ago

I have never used EDAM personally but here are some questions:

tschaffter commented 1 year ago

I think we have managed to narrow down how to annotate challenges with EDAM values!

Are we settling on what follows?

@gaiaandreoletti @vpchung

vpchung commented 1 year ago

Set input data type to an array of EDAM values (data, format and possibly topic ids)

By array, did you mean a comma-separated list...? If so, I think it will be easier to follow the same table format as contribution_role where each row is a challenge_id and a edam_id (or edam_name). For example:

id challenge_id edam_name
1 1 Protein variants
2 2 Systems biology
3 2 Electronic health record
vpchung commented 1 year ago

Add the property challenge.operation as a string that takes an EDAM operation id

There are only 3 challenges so far with an operation annotation, so this is done! ✅

tschaffter commented 1 year ago

Update

@gaiaandreoletti presented the EDAM ontology to the team and received positive feedback. The implementation is currently underway.