marbl / SALSA

SALSA: A tool to scaffold long read assemblies with Hi-C data
MIT License
172 stars 47 forks source link

Mate-pair in SALSA #170

Closed aabaricalla closed 1 year ago

aabaricalla commented 1 year ago

Hi!

I was working with old genomic data produced by Illumina Mate pair sequencing. This strategy cuts linear intrachromosomal regions in 3kb, 5kb, 8kb, or 20kb segments, ligate, circularizes the fragments, and then sequences the ligated part to recover the extremes of the fragments. Image 1 from this paper shows this..

Could this data be compared to Hi-C data? This sequencing strategy could be used with SALSA to scaffold genome assemblies?. any suggestions to optimize the results? Thanks in advance!

skoren commented 1 year ago

No, I wouldn't try to use mate-pairs in salsa. They are a very different datatype and have their own idiosyncrasies that salsa won't handle correctly.