arq5x / lumpy-sv

lumpy: a general probabilistic framework for structural variant discovery
MIT License
309 stars 118 forks source link

Pairend_distro.py warning: only x elements #281

Open Fabou13 opened 5 years ago

Fabou13 commented 5 years ago

Hello, After running lumpy_smooth on a sample bam file I got the following warning message: Warning: only 392 elements in distribution (min: 1000) I figured out it comes from pairend_distro.py that does have enough elements to define mean and stdev. So it generates NA. I tried to run pairend_distro.py and tweaked the tail -n and the -N argument and got: Warning: only 413 elements in distribution (min: 1000) Do you have an idea what are the cause of that warning (coverage depth too low?)? What could I check to understand what's going wrong with thata sample? Does the min value of 1000 is the minimum acceptable or it can be lowered? Thanks in advance Fab

ryanlayer commented 5 years ago

Is the library normal paired end illumina sequencing?

On Dec 13, 2018, at 2:46 AM, Fabou13 notifications@github.com wrote:

Hello, After running lumpy_smooth on a sample bam file I got the following warning message: Warning: only 392 elements in distribution (min: 1000) I figured out it comes from pairend_distro.py that does have enough elements to define mean and stdev. So it generates NA. I tried to run pairend_distro.py and tweaked the tail -n and the -N argument and got: Warning: only 413 elements in distribution (min: 1000) Do you have an idea what are the cause of that warning (coverage depth too low?)? What could I check to understand what's going wrong with thata sample? Does the min value of 1000 is the minimum acceptable or it can be lowered? Thanks in advance Fab

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

Fabou13 commented 5 years ago

The library was made with NEBNext Ultra II Directional RNA. It is paired end illumina RNA-seq. I ran other samples from the same sequencing run with lumpy and they gave correct results.

ryanlayer commented 5 years ago

I have never used RNA in LUMPY, but I am going to guess you will get some pretty noisy results from all of the intron splicing. Your error indicated that it didn’t see enough normal reads to make a distribution. This may be why.

On Dec 13, 2018, at 5:24 AM, Fabou13 notifications@github.com wrote:

The library was made with NEBNext Ultra II Directional RNA. It is paired end illumina RNA-seq. I ran other samples from the same sequencing run with lumpy and they gave correct results.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

Fabou13 commented 5 years ago

I am actually running lumpy on RNA viruses to detect recombinant genomes from two parental genomes. The output of SVtyper are fine for some samples and quite coherent. But I am having some samples for which I get the warning and no lumpy output. Maybe one of the genomes involved in the recombinant has a too small fraction represented and therefore would have not enough normal reads. That's why I was thinking to lower down the min value.

ryanlayer commented 5 years ago

Interesting! What happens when you run "samtools view $BAM | wc -l"?

On Thu, Dec 13, 2018 at 5:48 AM Fabou13 notifications@github.com wrote:

I am actually running lumpy on RNA viruses to detect recombinant genomes from two parental genomes. The output of SVtyper are fine for some samples and quite coherent. But I am having some samples for which I get the warning and no lumpy output. Maybe one of the genomes involved in the recombinant has a too small fraction represented and therefore would have not enough normal reads. That's why I was thinking to lower down the min value.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/arq5x/lumpy-sv/issues/281#issuecomment-446956742, or mute the thread https://github.com/notifications/unsubscribe-auth/AAlDUYl5rR6BmbDqTNJeuC6gZRGgN0tFks5u4kyxgaJpZM4ZRRW_ .

Fabou13 commented 5 years ago

Hi, I ran the command for two samples, here is what I got from the bam files: Sample 9: 7744319 Sample 10: 6580433 These two samples gave the Warning: only xx elements in distribution (min: 1000) Does it look abnormal to you? Thank you