ContentMine / phylotree

A repository for ami-phylotree development
0 stars 0 forks source link

Consistency of Binomial and EGID #35

Open petermr opened 9 years ago

petermr commented 9 years ago

The OCR process provides both a binomial and an EGID (ENAGenbankID). This issue is to devise a strategy for reconciling conflicts in interpretation.

At the OCR level the parse is checked, substituted (for incorrect types of character, e.g. punctuation), and validated against a template syntax. All discussion below relates to valid syntax (NOT necessarily valid content).

OCR
    -> OCR_Binomial
    -> OCR_EGID

EGID and Binomial are then looked up and There is a matrix of:

... TBC ...