WGLab / NanoCaller

Variant calling tool for long-read sequencing data
MIT License
90 stars 8 forks source link

Issue --end flag resquested in whole alignment Nanocaller #5

Open AG-Run opened 3 years ago

AG-Run commented 3 years ago

Dear all.

I'm using your Nanocaller tool in my Pacbio data. However I'm facing some issues related with the -end flag.

Currently Im planning to extract all SNPs and Indel from a bam file derived from CCS_reads aligned to contigs assembly. Despite the --end flag being optional, the tool is asking me to define this -end flag

How can I fix it?

Thanks for the help

umahsn commented 3 years ago

For NanoCaller, you are required to specify a contig name. The -end flag is optional in the sense that if a user does not provide a value for -end flag, then we use the reference genome FASTA file(.fai index file to be precise) to infer the end coordinates for the specified contig. Are you providing an indexed reference genome FASTA file, and are you specifying a contig?

AG-Run commented 3 years ago

I'm not specifying a contig name because I want to extract the SNPs and indels in all the contigs, when you mean contig name is a general name for all the contigs?

Thanks

El mié., 9 sept. 2020 a las 15:07, umahsn (notifications@github.com) escribió:

For NanoCaller, you are required to specify a contig name. The -end flag is optional in the sense that if a user does not provide a value for -end flag, then we use the reference genome FASTA file(.fai index file to be precise) to infer the end coordinates for the specified contig. Are you providing an indexed reference genome FASTA file, and are you specifying a contig?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/WGLab/NanoCaller/issues/5#issuecomment-689791431, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHFDTT6XI3CRYU6BHPOKFATSE7N63ANCNFSM4RC6743Q .

-- Adriana María Gallego Rúa *Investigadora *Laboratorio de Biotecnología Universidad de Antioquia

ColombiaMedellín-2020

AG-Run commented 3 years ago

This is the specific error is showing

in_dict={'chrom':args.chrom, 'start':start, 'end':end,

'sam_path':args.bam, 'fasta_path':args.ref, \ NameError: name 'end' is not defined

When I run next script

NanoCaller.py -bam file.sorted.bam -mode both -seq pacbio -model NanoCaller3 -vcf nanocaller -chrom CHROM -ref contigs_assembly.fasta -prefix my_test -cpu 32

And the reference fasta file as well the .fai file are both in the same directory

Thanks

El mié., 9 sept. 2020 a las 15:20, adriana gallego (< adriana.gallego.02@gmail.com>) escribió:

I'm not specifying a contig name because I want to extract the SNPs and indels in all the contigs, when you mean contig name is a general name for all the contigs?

Thanks

El mié., 9 sept. 2020 a las 15:07, umahsn (notifications@github.com) escribió:

For NanoCaller, you are required to specify a contig name. The -end flag is optional in the sense that if a user does not provide a value for -end flag, then we use the reference genome FASTA file(.fai index file to be precise) to infer the end coordinates for the specified contig. Are you providing an indexed reference genome FASTA file, and are you specifying a contig?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/WGLab/NanoCaller/issues/5#issuecomment-689791431, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHFDTT6XI3CRYU6BHPOKFATSE7N63ANCNFSM4RC6743Q .

-- Adriana María Gallego Rúa *Investigadora *Laboratorio de Biotecnología Universidad de Antioquia

ColombiaMedellín-2020

-- Adriana María Gallego Rúa *Investigadora *Laboratorio de Biotecnología Universidad de Antioquia

ColombiaMedellín-2020

umahsn commented 3 years ago

The problem is that the specified contig or chromosome is not found in the reference .fai file. Can you confirm whether that is the case or not?

umahsn commented 3 years ago

We have addressed several of your concerns in an updated version of NanoCaller. Please refer to issue #6.

AG-Run commented 3 years ago

Thanks!

El lun., 21 sept. 2020 a las 14:08, umahsn (notifications@github.com) escribió:

We have addressed several of your concerns in an updated version of NanoCaller. Please refer to issue #6 https://github.com/WGLab/NanoCaller/issues/6.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/WGLab/NanoCaller/issues/5#issuecomment-696312544, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHFDTTZ35RMVMW6VRAK7XFDSG6QB5ANCNFSM4RC6743Q .

-- Adriana María Gallego Rúa *Investigadora *Laboratorio de Biotecnología Universidad de Antioquia

ColombiaMedellín-2020