RPK calculation - Githubissues

nabilquraishi commented 6 years ago

Hi, Really like FMAP, starting to learn how to use it. Just wondered how you calculated RPK - I saw your code says readRpk = 1 / ($proteinLength * 3 / 1000). Wondered why you multiplied proteinlength by 3. Shouldnt it just be readRpk = 1 / ($proteinLength / 1000)?

Also would there be a update that builds a database on the entire Uniref90 for mapping and quantification rather than just the IDs that have KO. I would like to use it for gene enrichment analysis using GO.

Thanks, Nabil

jiwoongbio commented 6 years ago

Hi Nabil,

The sequencing reads are nucleotide-level and the mapping database is protein-level. To resolve the discordance between nucleotide-level and protein-level, we multiply 3 on the protein length and obtain the CDS length of nucleotide-level.

I am sorry to let you know that we have never planned to add a new function of UniRef database only because one key feature of FMAP is providing a protein database including orthology information.

Thanks, Jiwoong

On Sat, Feb 10, 2018 at 7:31 AM, nabilquraishi notifications@github.com wrote:

Hi, Really like FMAP, starting to learn how to use it. Just wondered how you calculated RPK - I saw your code says readRpk = 1 / ($proteinLength * 3 / 1000). Wondered why you multiplied proteinlength by 3. Shouldnt it just be readRpk = 1 / ($proteinLength / 1000)?

Also would there be a update that builds a database on the entire Uniref90 for mapping and quantification rather than just the IDs that have KO. I would like to use it for gene enrichment analysis using GO.

Thanks, Nabil

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/jiwoongbio/FMAP/issues/7, or mute the thread https://github.com/notifications/unsubscribe-auth/AHb3kv2jwciMIecaEb3lsH0lonBce8qQks5tTZomgaJpZM4SA7dU .

nabilquraishi commented 6 years ago

Ah of course that makes sense! Thanks Jiwoong.

jiwoongbio / FMAP

RPK calculation #7