dellytools / delly

DELLY2: Structural variant discovery by integrated paired-end and split-read analysis
BSD 3-Clause "New" or "Revised" License
422 stars 136 forks source link

Consensus sequences #311

Open ceromova opened 1 year ago

ceromova commented 1 year ago

Hello,

I would like to ask if the consensus sequences of delly's output are human or just unknown insertion sequences (maybe virus or bacteria ones).

Thank you.

tobiasrausch commented 1 year ago

Hi,

The consensus sequences are computed by local assembly of all SV-supporting reads. They allow you to examine the SV breakpoint for homology patterns or small insertions.

Best, Tobias

ceromova commented 1 year ago

Therefore, if a read is half human genome half viral genome that would be detected by delly and report it as an insertion? This half-half sequence would appear in the consensus sequence?

Thank you.

tobiasrausch commented 1 year ago

For long reads, yes.

For short reads, the length of the reads is usually not sufficient and that's why people map against "augmented" references that include the viral sequences.

ceromova commented 1 year ago

What do you mean when you say long/short reads? Are you refering to Nanopore as a long read and Illumina as a short read?

tobiasrausch commented 1 year ago

Yes.