Open ashleethomson opened 2 years ago
Hi, this is not possible when using an exon-only spliced graph. The reason for this is that when creating the exon-only graph, the graph is split into smaller disjoint subgraphs corresponding to genes. Because intergenic and intronic regions are removed as part of this process it is not possible to represent the full chromosome paths anymore in the graph and they are therefore removed.
One possible solution if you want calls in chromosome coordinates instead of transcript coordinates would be to use a full spliced graph:
vg rna -p -r -n chr${i}.gtf chr${i}.vg > ${GRAPH_PREFIX}_${i}.vg
In this case the chromosome paths will still be present in the spliced graph. You can then use -p
in vg call
to specify which reference path(s) to get the calls on.
Are you using RNA-seq data for vg call
? If so, I just quickly want to mention that using vg call
for variant calling or genotyping from RNA-seq data is not something we have evaluated. It has been developed and parameterized with genomic data in mind so I am not sure how well it will perform on transcriptomic data.
Thank you so much for this information, it will help immensely! And also thank you for the advice regarding vg call
. I will make note of this if any discrepancies arise in my results. Thank you again!
Hi, I'm currently creating an exon only spliced graph (NOT haplotype specific) using
vg rna
, and wanted to know if there was a way to make the IDs (contigs) the chromosome number instead of the transcript ID. I understand that when usingvg rna
there is an option to input a transcript file:-n, --transcripts FILE transcript file(s) in gtf/gff format
(An example of my GTF is as follows)
and the option to select which attribute tag to use as the ID:
-s, --transcript-tag NAME use this attribute tag in the gtf/gff file(s) as id [transcript_id]
but I want to use the chromosome number as the ID tag.
Currently when I use
vg call
on my graph, my VCFs look like thisBut I want this format:
Is there a way around this? Or is this specific to
vg rna
? My current code forvg rna
is:vg rna -p -d -o -r -n chr${i}.gtf chr${i}.vg > ${GRAPH_PREFIX}_${i}.vg
Any advice is welcome, and thanks in advance.