uniprot / enzymeportal

The EBI Enzyme Portal
http://www.ebi.ac.uk/enzymeportal/
Apache License 2.0
11 stars 4 forks source link

Data inconsistency - Compounds #160

Closed rafael-alcantara closed 11 years ago

rafael-alcantara commented 11 years ago

Currently, compounds shown in the list of filters (search results) and those shown in the small molecules tab don't match.

The tab seems to be fine, including:

However, the compound filter included this:

I suggest the following:

All these additions should be done to the mega-map (i.e. use IntEnz, Rhea and extract more from UniProt when building it).

rafael-alcantara commented 11 years ago

mega-mapper has been modified so that the database is populated according to the suggestion above. Furthermore, there is a new table in the database for quick retrieval of compound cross-references.

ep-util is used now for text parsing (inhibitors and activators). Any improvements in regular expressions for that matter will have to be done there.

ep-domain has been modified (XML schema) in order to have roles for molecules, which allows classification of compounds retrieved from the mega-map, and hence the use of the database as the only source of small molecule data, be it for filters or for the small molecules tab (the purpose of this ticket). Other modules which use the domain model, like chebi-adapter, have been updated accordingly.