Open balhoff opened 2 years ago
Thanks for pointing out the issue. We will try and get this fixed in ChEBI 2.0 (a project we are currently working on to redevelop ChEBI's ageing infrastructure). In the meantime I have replaced the curly brackets in the ChEBI name with square brackets which should temporary fix the issue in next months release.
Can we get some information about what Chebi 2.0 exactly is?
I am also forced to do quite a bit of debugging due to CHEBI having OBO format issues. I recommend adding some minimal CI, like a roundtrip through ROBOT, and a fastobo-validator
(https://github.com/fastobo/fastobo-validator).
Parsing `mirror/chebi.owl.tmp.obo`
Failed parsing `mirror/chebi.owl.tmp.obo`
--> mirror/chebi.owl.tmp.obo:863257:14
|
863257 | xref: KEGG:C 2339 ␊
| ^---
|
= expected EOL or QuotedString
While it you are working on CHEBI 2.0, would it be possible to patch the release files so they can be parsed?
I strongly second @matentzn's suggestion. It would be very easy to add a fastobo validation check with the existing CHEBI infrastructure, just run the check prior to a release or during a snapshot.
@matentzn The issue has now been fixed in the nightly OBO file (https://ftp.ebi.ac.uk/pub/databases/chebi/ontology/nightly/chebi.obo). Hopefully when the new release is completed in the next few days, it will also be fixed in the monthly chebi.obo file (https://ftp.ebi.ac.uk/pub/databases/chebi/ontology/chebi.obo).
We have just been awarded a 3 year BBSRC grant to redevelop ChEBI so the back-end and front-end infrastructure will be redeveloped. A new annotation tool and submission tool will be built, searching and ontology visualization will be improved. The current SOAP based web-services will be replaced by REST. We also plan to move away from commercial software such as Oracle to PostgreSQL.
Great, thank you for addressing this! What would be great if you could work with the OBO community to introduce a CI Testing system for the ontology along the lines of what other ontologies implement! We would be happy to assist on OBO slack!
There is a term in the ChEBI OBO file with a label containing an opening brace:
In OBO syntax the opening brace starts a trailing qualifier, leading to a parse error. The brace needs to be escaped like this:
Here is the stack trace output from ROBOT:
cc @kltm