Closed tseemann closed 3 years ago
Dear Torsten,
Thanks for your suggestion.
1) If you want a sub-sequence relative to a reference sequence you can do it with goalign subseq --ref-seq <sequence name>
. However, it will not take all coordinates from a file.
2) If you want to select several columns from an alignment, a workaround would be to use the goalign split --partition
command, which takes all columns defined in the partition file and output them in a separate file. That way you can define several partitions. However this one does not take reference coordinates without gaps yet...
Example of a partition file:
M1,p1=1-12/3,2-12/3
M2,p2=3-12/3
It will output 2 files:
p1.fa
: with first and second nucleotides of each codon from position 1 to 12 (1-based)p2.fa
: with third nucleotide each codon from position 1 to 12 (1-based)3) Finally, I implemented the command goalign subsites
, which extracts given sites from an input alignment. Options --ref-seq
, --reverse
and --sitefile
are available.
Do not hesitate to tell me if it is what you were looking for.
@fredericlemoine thanks for the quick response!
I just "pre"-released v0.3.3c
Thanks! Just installed now and testing.
Hi @tseemann , Can I close this issue? Also, If you want to have a look, I implemented the extract command, to extract several sub-alignments from an input alignment.
If i have a text file of positions i want to remove from an alignment. Like
subseq
but i can choose all positions (or inverse)Also would like option for the
pos.txt
to be relative to a specified "reference" taxon which ignores it's gaps.