reneshbedre / bioinfokit

Bioinformatics data analysis and visualization toolkit
MIT License
334 stars 77 forks source link

What the df in the Normalization part #19

Closed susutBu closed 3 years ago

susutBu commented 3 years ago

Hello Dr. Bedre, Thanks for you developed this tool for bioinformatic analysis, it useful do much. When I caculate the CPM, RPKM or TPM, there a question confused me. You mentioned the df is Pandas dataframe containing raw gene expression values, How can I understand the expression values ? It is the mapping counts or any other number? if it is anyother number, how do you generate it? Thanks

reneshbedre commented 3 years ago

Hi @susutBu

Thank you for your feedback about bioinfokit. The expression values that I mentioned in the post are the raw reads counts obtained by mapping RNA-seq reads to the genome.