There is an off-by-one error in the encoding constructor of GappedSequence. The symbols should be written to the symbol array starting from offset 1, but currently starts at offset 0. This causes issues with the internal sequence representation, and with the decoding as well. Luckily this constructor is actually not used anywhere in FAMSA, but when I tried wrapping it in PyFAMSA I started getting weird issues and traced it back here!
Hi @agudys!
There is an off-by-one error in the encoding constructor of
GappedSequence
. The symbols should be written to the symbol array starting from offset 1, but currently starts at offset 0. This causes issues with the internal sequence representation, and with the decoding as well. Luckily this constructor is actually not used anywhere in FAMSA, but when I tried wrapping it in PyFAMSA I started getting weird issues and traced it back here!