Closed rebeccaito closed 12 months ago
It looks like using model -t mendelian_disease.MendelianDisease
instead of -t mendelian_disease.Symptom
will abstract HPO terms as expected. Are there plans to create more detailed documentation for different use-cases and for the provided templates? Many thanks.
Hi @rebeccaito, and thanks for your patience! You're absolutely correct that using the MendelianDisease
class from this template should deliver results more closely aligned with your expectations, and this reveals part of how OntoGPT parses the template. In this case, specifying the Symptom
class alone means it's only using this class:
Symptom:
is_a: NamedEntity
id_prefixes:
- HP
annotations:
annotators: sqlite:obo:hp, sqlite:obo:mondo
attributes:
characteristic:
affects:
severity:
onset_of_symptom:
range: Onset
and that's probably fine for modeling data, but it's missing the examples and multivalued property from the parent MendelianDisease
class so OntoGPT doesn't generate a prompt explicitly stating you're looking for symptoms and it doesn't expect to find more than one of them.
We certainly do need more detailed documentation about this and about modifying/creating templates. Expect it soon!
Many thanks for creating
ontogpt
. Is it correct that if I use themendelian_disease.Symptom
template, I should be able to extract HPO phenotype terms from text? I attempted this by creating a filephenotagger_example.txt
which contains the test example found here:I then called
ontogpt
with:And got the following output:
I was expecting output that would create an extracted object for each phenotype term. For example:
Am I using
ontogpt
correctly? Is there a better way to use this tool to extract phenotype terms from text? Thanks.