epam / Indigo

Universal cheminformatics toolkit, utilities and database search tools
http://lifescience.opensource.epam.com
Apache License 2.0
292 stars 100 forks source link

Implement GenBank/GenPept sequences import #1844

Closed even1024 closed 2 months ago

even1024 commented 3 months ago

Background In addition to the traditional representation of a sequence, there is a GenBank/GenPept representation:

simple sequence: ACCLCAACCLC

GenBank/GenPept:

1 gatcctccat atacaacggt atctccacct caggtttaga tctcaacaac ggaaccattg 61 ccgacatgag acagttaggt atcgtcgaga gttacaagct aaaacgagca gtagtcagct 121 ctgcatctga agccgctgaa gttctactaa gggtggataa catcatccgt gcaagaccaa

Solution Implement GenBank/GenPept automatic detection so both simple sequence and GenBank/GenPept should be supported for the import.