Closed ninsch3000 closed 1 year ago
Hey @yoseopyoon
Just wanted to check if you're still going to be able to contribute to this workflow? Absolutely not a problem if not (though would be great if you still want to help out!), it's mainly so the rest of the group know what still needs to be done.
Also, if you've managed to make any progress so far it would be great to see any of your code pushed, even if it's still a work in progress! Most of all you'll get credit for your work but then others can join in too without starting from scratch.
Thanks :)
Hi Sam,
Thanks for contacting me. Since I am an RNA biologist with poor background in bioinformatics, I think it would be better to have someone else to continue work on this issue. However, I am willing to contribute to other areas such as manuscript, literature search etc.
I don’t have any code to push; below is what I tried during APAeval: • I installed TAPAS, figured out how to run it using our small sample data set • Troubleshooted converting our test annotation file to refFlat and found it requires “NM_” identifier to be run by the command. Etc. • I updated TAPAS section of methods summary spreadsheet and also uploaded TAPAS results from our small test data • Submitted feedback about the method
Thank you !!
Best,
Yoseop
Yoseop Yoon, Ph.D. Postdoctoral Scholar Shi Lab Department of Microbiology & Molecular Genetics School of Medicine University of California, Irvine
On Jul 2, 2021, at 7:26 AM, SamBryce-Smith @.***> wrote:
Hey @yoseopyoon https://github.com/yoseopyoon Just wanted to check if you're still going to be able to contribute to this workflow? Absolutely not a problem if not (though would be great if you still want to help out!), it's mainly so the rest of the group know what still needs to be done.
Also, if you've managed to make any progress so far it would be great to see any of your code pushed, even if it's still a work in progress! Most of all you'll get credit for your work but then others can join in too without starting from scratch.
Thanks :)
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/iRNA-COSI/APAeval/issues/39#issuecomment-873039218, or unsubscribe https://github.com/notifications/unsubscribe-auth/AUHATPVHPJPHL6O6I6LW4H3TVXD7VANCNFSM45PCEL4A.
Hi @SamBryce-Smith, I will be taking over the TAPAS execution workflow. Thank you so much for all your effort, @yoseopyoon!
@yuukiiwa implemented a nextflow pipeline on branch TAPAS
. When trying to run with test data (i.e. much smaller than a typical dataset), the workflow exited with a 137 error code (out of memory error code). See screenshot below & attached nextflow.log
for this run.
This workflow is currently on hold whilst we focus on other aspects of the challenge, but we may return to this to try and debug (other APA tools/papers have managed to get it running).
I spotted on branch TAPAS
that the read-length parameter (-l
) was missing the from TAPAS APA_sites_detection
call.
@yuukiiwa addressed this (3d9f687e159a63a83ebc4e000b3405f3213cc502) and re-ran the workflow with a 12MB BAM file for testing. The workflow completed successfully in 8 hours (!).
Execution workflows are currently a low priority right now, but it does seem possible to get TAPAS to generate APAeval-compliant outputs (although the scalability of the tool to large/full-size datasets is a concern...)
@yuukiiwa @SamBryce-Smith thank you guys for all the work!! Are we planning to create a PR for TAPAS branch to be merged to main?
WHAT
Write execution workflow for TAPAS. Use the provided small files for testing (running the workflow on real data is a different issue).
CHECKLIST
OUTPUTS (see specification):
[ ] Output: Adhere to output specification for Identification challenge
[ ] Output: Adhere to output specification for quantification challenge
[ ] Output: Adhere to output specification for differential usage challenge