iRNA-COSI / APAeval

Community effort to evaluate computational methods for the detection and quantification of poly(A) sites and estimating their differential usage across RNA-seq samples
MIT License
13 stars 14 forks source link

feat(Summary workflow): Utils script/module to filter and convert ground truth TPM data to be compatible with relative quantification benchmark #301

Open faricazjj opened 2 years ago

faricazjj commented 2 years ago

We're adding a new quantification challenge relative expression output. However, since the quantification challenge originally outputs TPM values, the ground truth data was also in TPM. Hence, we need to convert the ground truth data from TPM to relative expression to be compatible with the new quantification challenge output

Depends on relative expression implementation: https://github.com/iRNA-COSI/APAeval/issues/277

Estimate: 4h

ninsch3000 commented 2 years ago

Might this be a task for @mrgazzara when he's already busy with filtering the ground truth?

SamBryce-Smith commented 2 years ago

I'm co-opting this issue to prevent duplication. Purpose is to track implementation of utils script/module to prepare ground truth files for the relative quantification benchmark/summary workflow.

Related to proof-of-concept implementation - #399

The plan is to generally follow Joseph's blueprint (TODO: add link) for filtering the ground truth to two representative sites overlapping terminal exons. In addition to this, I propose some additional details to address challenges I encountered in the proof-of-concept #399 along with discussions with @mrgazzara .

Partially blocked by #413 - defining a 'terminal exon ID'. For now we're going to assume we're following my proposed definition.

This script/module will take as input:

The workflow is more or less as follows:

@yuukiiwa & I will be working on adapting my proof of concept into this module. I have code for most of the above steps but will need to be tidied up a little into functions / a script

SamBryce-Smith commented 2 years ago

see branch utils_filterPAS for a WIP implementation