Open JCGiron opened 1 year ago
perhaps related to issue #9?
On a semi-related note, I'd also like to learn about mining binary and multistate characters using RPhenoscate
Related to #5
One premise of using phenoscript is to generate matrices from descriptions
It is a nice idea for a project! Rphenoscape has some functions for classifying phenotypes in the KB into exclusivity classes. In short, these classes allow clustering phenotypes as putative alternative character states of a given phylogenetic character. We used these functions in Rphenoscate to try out to build synthetic character matrices for non-absence/presence data (i.e., qualities like shape, composition, etc). Maybe @hlapp can give an overview of the mutual exclusivity of the code!
Means (tools, pipelines) to extract information from species descriptions in PDF format (text) or from images to create annotated morphological character matrices (nexus, xml). Perhaps a good starting point to update/integrate Phenex.