Long reads genome guided

Hello,

I have implemented RNA-Bloom on a subset of the PacBio bulk transcriptomic data and achieved the expected results. However, I noticed that some steps in RNA-Bloom take a considerable amount of time to execute. Before running RNA-Bloom on the entire dataset, I'd like to discuss potential methods to speed up the execution.

One idea is to align the reads to the genome, extract the aligned reads from specific non-overlapping regions of the reference, and then supply those to RNA-Bloom. Essentially, I am considering adopting the genome-guided strategy used by Trinity. Do you think this approach could help accelerate the process? Additionally, would adjusting certain parameters be beneficial since we wouldn't need to compare all reads to each other anymore?

Thank you

bcgsc / RNA-Bloom

Long reads genome guided #76