uniprot / enzymeportal

The EBI Enzyme Portal
http://www.ebi.ac.uk/enzymeportal/
Apache License 2.0
11 stars 4 forks source link

Compounds not found in ChEBI #167

Open rafael-alcantara opened 11 years ago

rafael-alcantara commented 11 years ago

Inhibitors and activators are extracted from UniProt "Enzyme regulation" comments using regular expressions. Sometimes this leads to strange compound names (grep 'Not found in ChEBI' mega-mapper.log) like

[WARN]-[2013-05-13 16:38:43,859]-UniprotSaxParser-main- Not found in ChEBI: through dephosphorylation by protein phosphatase type 1

We have to improve the regular expressions in order to have less of these.

On the other hand, some of the extracted compound names seem sensible but are not found in ChEBI, ex:

[WARN]-[2013-05-13 16:38:56,314]-UniprotSaxParser-main- Not found in ChEBI: p-(chloromercuri)benzenesulfonic acid

These we should report to the ChEBI team.