reneshbedre / bioinfokit

Bioinformatics data analysis and visualization toolkit
MIT License
333 stars 77 forks source link

Normalization on TCGA dataset #23

Closed vappiah closed 3 years ago

vappiah commented 3 years ago

Hello bioinfokit team, I have downloaded raw htseq gene expression data from TCGA and I would like to perform FPKM normalization. But on checking the documentation the commands require the gene lengths but I don't have that info. Is there a way to normalize without supplying the gene length? Thanks

reneshbedre commented 3 years ago

@vappiah FPKM is based on the gene length for normalization. Check formula here. If you do not have gene length you can use CPM or DESeq2 for normalization. Check here.

vappiah commented 3 years ago

Thanks @reneshbedre I will do that