phenoscape / TraitFest-2023

Main repository for information advertising and documenting the 2023 SCATE TraitFest
Creative Commons Zero v1.0 Universal
3 stars 0 forks source link

Ontology-based data mining for morphological matrix construction #8

Open JCGiron opened 1 year ago

JCGiron commented 1 year ago

Means (tools, pipelines) to extract information from species descriptions in PDF format (text) or from images to create annotated morphological character matrices (nexus, xml). Perhaps a good starting point to update/integrate Phenex.

meghalithic commented 1 year ago

perhaps related to issue #9?

evo-palaeo commented 1 year ago

On a semi-related note, I'd also like to learn about mining binary and multistate characters using RPhenoscate

teleaslamellatus commented 1 year ago

Related to #5

One premise of using phenoscript is to generate matrices from descriptions

diegosasso commented 1 year ago

It is a nice idea for a project! Rphenoscape has some functions for classifying phenotypes in the KB into exclusivity classes. In short, these classes allow clustering phenotypes as putative alternative character states of a given phylogenetic character. We used these functions in Rphenoscate to try out to build synthetic character matrices for non-absence/presence data (i.e., qualities like shape, composition, etc). Maybe @hlapp can give an overview of the mutual exclusivity of the code!