etal / cnvkit

Copy number variant detection from targeted DNA sequencing
http://cnvkit.readthedocs.org
Other
541 stars 165 forks source link

ensembl-gene-info file hg19 #386

Open patidarr opened 6 years ago

patidarr commented 6 years ago

Hi,

Could you please share the hg19 version of this file for rnaseq cnv analysis or show me what columns it should have if my rsem output looks like following:

` gene_id transcript_id.s. length effective_length expected_count TPM FPKM 6M1-18 uc010jrh.1 473 272.4 0 0 0 7M1-2 uc003wdr.3 68 0 0 0 0

`

etal commented 5 years ago

Unfortunately I didn't see hg19 available in Ensembl BioMart, where the hg38 files came from.

The columns I retrieved from Ensembl BioMart were:

The first column in this gene resources file, called gene_id internally, needs to match the gene_id column of your RSEM file. So if you ran RSEM with Ensembl's transcript annotation, these should be Ensembl gene identifiers. Otherwise, if you used another transcript annotation, you'll need a BioMart instance with that annotation's gene identifiers.