iRNA-COSI / APAeval

Community effort to evaluate computational methods for the detection and quantification of poly(A) sites and estimating their differential usage across RNA-seq samples
MIT License
13 stars 14 forks source link

Execution Workflow: TAPAS #39

Closed ninsch3000 closed 1 year ago

ninsch3000 commented 3 years ago

WHAT

Write execution workflow for TAPAS. Use the provided small files for testing (running the workflow on real data is a different issue).

CHECKLIST

OUTPUTS (see specification):

SamBryce-Smith commented 3 years ago

Hey @yoseopyoon

Just wanted to check if you're still going to be able to contribute to this workflow? Absolutely not a problem if not (though would be great if you still want to help out!), it's mainly so the rest of the group know what still needs to be done.

Also, if you've managed to make any progress so far it would be great to see any of your code pushed, even if it's still a work in progress! Most of all you'll get credit for your work but then others can join in too without starting from scratch.

Thanks :)

yoseopyoon commented 3 years ago

Hi Sam,

Thanks for contacting me. Since I am an RNA biologist with poor background in bioinformatics, I think it would be better to have someone else to continue work on this issue. However, I am willing to contribute to other areas such as manuscript, literature search etc.

I don’t have any code to push; below is what I tried during APAeval: • I installed TAPAS, figured out how to run it using our small sample data set • Troubleshooted converting our test annotation file to refFlat and found it requires “NM_” identifier to be run by the command. Etc. • I updated TAPAS section of methods summary spreadsheet and also uploaded TAPAS results from our small test data • Submitted feedback about the method

Thank you !!

Best,

Yoseop

Yoseop Yoon, Ph.D. Postdoctoral Scholar Shi Lab Department of Microbiology & Molecular Genetics School of Medicine University of California, Irvine

On Jul 2, 2021, at 7:26 AM, SamBryce-Smith @.***> wrote:

Hey @yoseopyoon https://github.com/yoseopyoon Just wanted to check if you're still going to be able to contribute to this workflow? Absolutely not a problem if not (though would be great if you still want to help out!), it's mainly so the rest of the group know what still needs to be done.

Also, if you've managed to make any progress so far it would be great to see any of your code pushed, even if it's still a work in progress! Most of all you'll get credit for your work but then others can join in too without starting from scratch.

Thanks :)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/iRNA-COSI/APAeval/issues/39#issuecomment-873039218, or unsubscribe https://github.com/notifications/unsubscribe-auth/AUHATPVHPJPHL6O6I6LW4H3TVXD7VANCNFSM45PCEL4A.

yuukiiwa commented 3 years ago

Hi @SamBryce-Smith, I will be taking over the TAPAS execution workflow. Thank you so much for all your effort, @yoseopyoon!

SamBryce-Smith commented 2 years ago

Update on the progress of TAPAS execution workflow

@yuukiiwa implemented a nextflow pipeline on branch TAPAS. When trying to run with test data (i.e. much smaller than a typical dataset), the workflow exited with a 137 error code (out of memory error code). See screenshot below & attached nextflow.log for this run.

Screenshot 2022-05-13 at 4 54 46 PM

nextflow.log

This workflow is currently on hold whilst we focus on other aspects of the challenge, but we may return to this to try and debug (other APA tools/papers have managed to get it running).

SamBryce-Smith commented 2 years ago

I spotted on branch TAPAS that the read-length parameter (-l) was missing the from TAPAS APA_sites_detection call.

@yuukiiwa addressed this (3d9f687e159a63a83ebc4e000b3405f3213cc502) and re-ran the workflow with a 12MB BAM file for testing. The workflow completed successfully in 8 hours (!). image

Execution workflows are currently a low priority right now, but it does seem possible to get TAPAS to generate APAeval-compliant outputs (although the scalability of the tool to large/full-size datasets is a concern...)

faricazjj commented 2 years ago

@yuukiiwa @SamBryce-Smith thank you guys for all the work!! Are we planning to create a PR for TAPAS branch to be merged to main?