GraffiTE is a pipeline that finds polymorphic transposable elements in genome assemblies and/or long reads, and genotypes the discovered polymorphisms in read sets using genome-graphs.
Commit c04ccd9 - in bin/repmask_vcf.sh use samtools faidx + awk instead of awk alone to generate indels.length. awk alone was missing last seq in indels.fa
Commit a6b622f - in main.nfmerge_VCFs, use find instead of ls to generate vcf.list. If ls config includes colors, will break this step.
Commit a7634fb - gzip the final GraffiTE merged genotypes to save storage
Commit 9b92167 - add check if variant IDs are less than 50 characters, will produce error in RepeatMasker if not
c04ccd9
- inbin/repmask_vcf.sh
usesamtools faidx
+awk
instead ofawk
alone to generateindels.length
.awk
alone was missing last seq inindels.fa
a6b622f
- inmain.nf
merge_VCFs
, usefind
instead ofls
to generatevcf.list
. Ifls
config includes colors, will break this step.a7634fb
- gzip the final GraffiTE merged genotypes to save storage9b92167
- add check if variant IDs are less than 50 characters, will produce error inRepeatMasker
if not