Open joncison opened 5 years ago
👍 for SPARQL based quality control !
Excellent. Thanks a lot for the pointer @simonjupp. edamxpathvalidator has been super useful, but as its author I'd be willing to replace it with something more generic and powerful any day. + it's probably the best alibi I can get to do some Java and SPARQL work!
All EFO checks are expressed as SPARQL here https://github.com/EBISPOT/efo/tree/master/src/sparql. Couple these with robot verify
http://robot.obolibrary.org/verify.
Also check out the ontology starter kit that provide the inspiration much of the tools for all this https://github.com/INCATools/ontology-development-kit
I'll be very happy to help in "SPARQLing" with EDAM. Also, validation queries could be integrated into a jupyter notebook to extract / visualize metrics. A demo notebook is available here : https://github.com/albangaignard/fresh-toolbox .
@hmenager I propose something like
Does that sound about right?
Awesome if you can help @albangaignard ! I want to publish EDAM 2.0 in a good journal later in the year, and doing a systematic job on the validation would certainly be a ticket for that article!
Comprehensive validation would be to encapsulate:
it's this issue @albangaignard - pls. post the date of the Nantes meeting here.
see also https://github.com/edamontology/edamontology/issues/366 and https://github.com/edamontology/edamontology/issues/292 which are somewhat related.
Meeting in Nantes with @hmenager the 14th of february 2020. Goals :
cool ... 6h+ journey (each way!) for me, so I will join remotely, if you can please set up a call-in facility ... cc @matuskalas (are you free?)
Hi @albangaignard and @hmenager
For our call today, I propose we follow the route @simonjupp @ EBI did for EFO ontology, i.e. use https://github.com/ontodev/robot. More specifically:
In summary (distilling the above thread):
src/
(script), queries/
(sparql queries), and /reports
(query output)queries/README.md
) - capturing things from:
If we get time, we can also think about EDAM "vital statistics" generation.
Does that sound like a plan? Ping me here when you guys are settled.
From talking to Simon Jupp in Austria last week, we should explore https://github.com/ontodev/robot for EDAM housekeeping including validation, diff, slim generation etc.
Validation is provided by the report function - we'd need to provide a list of SPARQL queries for the specific checks we'd want (@simonjupp - could you pls. send a link to some sample queries - which you showed me last week)
There's also a file structure-independent ontology diff and lots of other stuff
It could augment or perhaps eventually replace edamxpathvalidator
cc @hmenager