Add code to convert "source" taxonomy files to expected-taxonomy.tsv by extracting full-length taxonomy strings from reference database X.
Similarly, to extract database identifiers, e.g., from GenBank.
The issues with both of these is that manual curation is still very much needed and database quality can be a major issue. But the first would be approachable and would streamline the process of creating these files.
Add code to convert "source" taxonomy files to
expected-taxonomy.tsv
by extracting full-length taxonomy strings from reference database X.Similarly, to extract database identifiers, e.g., from GenBank.
The issues with both of these is that manual curation is still very much needed and database quality can be a major issue. But the first would be approachable and would streamline the process of creating these files.