hsinnan75 / GSAlign

GSAlign: an ultra-fast sequence alignment algorithm for intra-species genome comparison
MIT License
51 stars 16 forks source link

possibility of using GSAlign for exomes #14

Open devonorourke opened 3 years ago

devonorourke commented 3 years ago

Hello, I was wondering if you had any experience testing GSAlign with whole exome data for intra-species comparison purposes. For example, in one of your evaluations in the preprint you aligned the chimpanzee genome to the human genome. Would it be possible to align a chimpanzee exome to GRCh38 genome? Or similarly, perhaps a user has a series of mouse whole exome data and want to align it to a different mouse species genome. My sense was that it may work for long exons, but perhaps may struggle with shorter sequences? I'm guessing at very least the -alen parameter would need to be reduced. Thanks for any insights you can offer!