YaoZhou89 / TGG

tomato graph pangenome
MIT License
73 stars 18 forks source link

31个样本SV如何合并到一个vcf文件 #9

Open Johnsonzcode opened 1 year ago

Johnsonzcode commented 1 year ago

尊敬的周老师@YaoZhou89 请问在如下步骤中,多个个体以什么方式合并到一个vcf中,多个样本合并还是融合到一个样本? image 这里的cleanSV行使了什么功能?

另外,在如下步骤中,如果我的流程中不涉及使用多个SV calling软件,只用了一个软件,我可以不使用这个步骤来对SV去重吗? image

敬上 强森

YaoZhou89 commented 1 year ago
  1. Our SVs are defined as having a length greater than 50. To ensure data quality, we utilize cleanSV to remove any variants with lengths less than 50.
  2. This method relies on identifying duplicated SVs within the dataset. Feel free to give it a try and see if it works for your data.
Johnsonzcode commented 1 year ago

Thank you so much! Which process did you do from graph31.vcf.gz* to graph31.vcf.gz, if graph31.vcf.gz* indicates vcf file of 31 samples, graph31.vcf.gz indicates vcf file of merging them.