Closed Akanksha2511 closed 3 months ago
@Akanksha2511 Hi, thanks for your interests in using LIQA. ReadPerGene_corrected is the estimated number of reads assigned to the isoform. TPM can be derived using ReadPerGene_corrected. We will update the document to explain the outputs. Thanks!
Hi, thanks. Could you please elaborate on how TPM can be derived using the ReadPerGene_corrected values. Sorry I am new to this.
Just bumping this. It would be really good to have some description in the docs on what the values in the quantification output are.
EDIT: for those finding this issue in the meantime, my current best guess at the meaning of the columns is
I don't know if this is accurate though.
Another bump for this WGLab developers,
Thank you for the clarification, Raj
Another bump for this WGLab developers,
A section in the documentation explaining the quantification output, in detail, would help tremendously:
* **ReadPerGene_corrected** represents how many reads were assigned to an isoform while accounting for read bias. And that these are the read number which can be used to calculate TPM. But how does it differ from actual raw read count? Typically read counts should not have values less than 1 unless it is 0. * _If I wanted to show how many transcripts were detected, would I list all those transcripts that have a value greater than 0? What about those that have a very small value greater than 0 but very far away from 1 such as "0.000321" or "0.000636"? How would I be able to tell if an isoform is actually detected? Is it better to use Read per gene or relative abundance for this? And is there a suggested cutoff value to use?_ * _Can you provide an example of how to calculate TPM from this value? Especially since the paper mentions you use RPG 10K for quantification of isoform level which may change the math compared to conventional TPM calculation._ * **Relative abundance** is the proportion of reads pertaining to a specific isoform compared to all the reads of the transcripts from the same gene. Please correct me if I am wrong. * **Infor_ratio** is unclear and I have no idea how to understand it. I know that the value is the same for every gene so it must refer to something that is global for all transcripts from a particular gene.
Thank you for the clarification, Raj
Hi Raj,
Thanks
Hi, thanks for developing Liqa.
I was wondering if there is a documentation that explains the output column names likes ReadPerGene_corrected and infor_ratio.
Can I use relativeabundance values directly as TPMs ?
Thanks, Akanksha