Open yeeus opened 3 weeks ago
Hi Quanyu,
Thank you for using our tool and sorry for a bit delayed answer.
Regarding short alleles: Tomorrow I will publish an update that will not fail immediately if alleles are too short. Nevertheless, they should be at least somewhat long (window size + 2 * boundary size, --window
and --boundary
in locityper preproc
). Basically, if allele is too short, we don't have enough information to distinguish it from other alleles. In addition, it will probably not be very accurate if some alleles are very short and some have normal size. If you have locus assemblies upstream/downstream from the gene of interest, I would recommend you to use them as well.
As for the VCF input: as I understand, even though vcfbub
will reduce the number of variants, all the information remains, you will simply have longer variants. The problem with non-vcfbub`ed VCF is that it will have overlapping variants, which are not so easy to traverse.
Finally, for the FASTA input: try putting all alleles in a single FASTA file and then providing it.
Nice tool! But when I used locityper to genotyping HGDP reads on my interested loci (five prime utrs which are usually shorter than 500 bp) with the command:
It reports that
It seems locityper can't be used on short loci?
And another question, the vcf file I used above was generated by MC and therefore was filtered by
vcfbub
which would result in less variants in my case. I want to know can I used theraw vcf
? And if I can't, can I input all samples fasta file about one locus in bed file? I tried this:OMG, it seems I can't provide all samples about one locus? What can I do? Please help me.. Best wishes!