Open faricazjj opened 2 years ago
Might this be a task for @mrgazzara when he's already busy with filtering the ground truth?
I'm co-opting this issue to prevent duplication. Purpose is to track implementation of utils script/module to prepare ground truth files for the relative quantification benchmark/summary workflow.
Related to proof-of-concept implementation - #399
The plan is to generally follow Joseph's blueprint (TODO: add link) for filtering the ground truth to two representative sites overlapping terminal exons. In addition to this, I propose some additional details to address challenges I encountered in the proof-of-concept #399 along with discussions with @mrgazzara .
Partially blocked by #413 - defining a 'terminal exon ID'. For now we're going to assume we're following my proposed definition.
This script/module will take as input:
The workflow is more or less as follows:
@yuukiiwa & I will be working on adapting my proof of concept into this module. I have code for most of the above steps but will need to be tidied up a little into functions / a script
see branch utils_filterPAS for a WIP implementation
We're adding a new quantification challenge relative expression output. However, since the quantification challenge originally outputs TPM values, the ground truth data was also in TPM. Hence, we need to convert the ground truth data from TPM to relative expression to be compatible with the new quantification challenge output
Depends on relative expression implementation: https://github.com/iRNA-COSI/APAeval/issues/277
Estimate: 4h