velocyto-team / velocyto.py

RNA velocity estimation in Python
http://velocyto.org/velocyto.py/
BSD 2-Clause "Simplified" License
160 stars 83 forks source link

Is it the right genome annotation file for cellranger 10X human to use? Chromosome MT+/- and others not present in the annotation #319

Closed denvercal1234GitHub closed 2 years ago

denvercal1234GitHub commented 2 years ago

Hi there,

Would anyone mind confirming which Option 1 or 2 is the right gtf to use for humans run10x? Because when I ran velocyto for 10x, the .out file had 2 warnings as shown.

Thank you so much!

Screen Shot 2021-11-15 at 5 22 14 PM

Option 1 (produced current error): I downloaded the gtf from: http://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_32/gencode.v32.primary_assembly.annotation.gtf.gz

Option 2: will I need to download "wget https://cf.10xgenomics.com/supp/cell-exp/refdata-gex-GRCh38-2020-A.tar.gz" then, use the "genes.gtf" that is located in the refdata-gex-GRCh38-2020-A/genes directory

I downloaded expressed repeats annotation from (group = All tracks; track = RepeatMasker): https://genome.ucsc.edu/cgi-bin/hgTables?hgsid=611454127_NtvlaW6xBSIRYJEBI0iRDEWisITa&clade=mammal&org=Human&db=0&hgta_group=allTracks&hgta_track=rmsk&hgta_table=rmsk&hgta_regionType=genome&position=&hgta_outputType=gff&hgta_outFileName=mm10_rmsk.gtf

Releated issues: #315

hyjforesight commented 2 years ago

I solved it. check this. https://github.com/velocyto-team/velocyto.py/issues/318

denvercal1234GitHub commented 2 years ago

Solved this. It is option 2. But encountering another issue #321 "Could not retrieve index file for the cellsorted_possorted_genome_bam.bam" even when the loom was successfully generated.

AY-LIANG commented 2 years ago

Hi, I still get similar warning with option 2. Any suggestions?

denvercal1234GitHub commented 2 years ago

@AY-LIANG -- Maybe it is just a warning and not necessarily an error?