bcgsc / ntSynt

Detecting multi-genome synteny using minimizer graph mapping
GNU General Public License v3.0
63 stars 1 forks source link

Parameters for detecting small-scale intra-species synteny? #46

Closed zhaotao1987 closed 1 month ago

zhaotao1987 commented 1 month ago

![Uploading 20240606210546.png…]() Thanks for developing this novel tool, I installed and did a test today, I would like to use this tool for detecting syntenic or alletic contigs within the genome. For example, if I have a dotplot like the following, it would be nice if ntSynt can treat these colored contigs as syntenic, I tried with -d 5, and did not get any syntenic regions, actually the contigs I use is a tetraploid genome, which should contain groups of syntenic contigs.

lcoombe commented 1 month ago

Hi @zhaotao1987,

I'm not sure if it didn't upload correctly on your end, but I'm unfortunately not able to see your image!

Just to check - is your tetraploid assembly phased, or all together in a single fasta file? You are right that if you are just specifying the single file, ntSynt currently looks for synteny blocks across different genome files, not within a genome file. For example, I have done runs in the past using diploid assemblies, but supplied each haplotype as separate genome files, and ntSynt successfully computed synteny there. So, if you are able to do that separation, that would be the solution. It is definitely on our radar to look at inter-genomic synteny (within a given genome assembly file), but that isn't a feature that we currently have.

Thank you for your interest in ntSynt! Lauren

zhaotao1987 commented 1 month ago

20240606210546 Sorry, I re-uploaded. It's just a partial dotplot, showing that there are multiple alletic contigs. It's not phased, I was thinking using your tool and HiC tools to help phasing. Yes, intra-genome synteny will be a great implementation, and can be used for whole genome duplication inferences and so on. I found your tool currently can only find very well synteny, I mean the chromosome lengths are very comparable in different genomes, that is what we say as macrosynteny. But I think microsynteny, local genomic contexts identification would be a great improvement.

lcoombe commented 1 month ago

Thanks for the feedback! Indeed, ntSynt is currently well-suited to detecting macrosynteny across different species. But we will certainly look at finer-grain microsynteny and intra-genomic synteny detection for future features!