epam / Indigo

Universal cheminformatics toolkit, utilities and database search tools
http://lifescience.opensource.epam.com
Apache License 2.0
315 stars 105 forks source link

Macro(Import FASTA): The '>' symbol is not recognized as indicating a new sequence #1850

Closed Zhirnoff closed 6 months ago

Zhirnoff commented 6 months ago

Steps to Reproduce

  1. Switch to Macro mode
  2. Paste FASTA sequence with '>'
    >M18404.1 Human IgG2 lambda antibody (1B8.env reactive) gp41 coding region DNA
    GCAGTGGGAAGAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCCTCAA
    >M18404.1 Human IgG2 lambda antibody (1B8.env reactive) gp41 coding region DNA
    GCAGTGGGAAGAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCCTCAA

Actual behavior The '>' symbol is not recognized as indicating a new sequence

Expected behavior The '>' symbol is recognized as indicating a new sequence. Each sequence should be imported as a separate molecule. The start of the sequence is the first symbol in a line after header line, the end of the sequence is the last symbol before ">" https://github.com/epam/Indigo/issues/1755

Desktop (please complete the following information):

Indigo version [Version 1.19.0-rc.1]

Attachments

2024-03-19_16h38_09 2024-03-19_16h38_13

AlexeyGirin commented 6 months ago

Fixed. image