stjudecloud / workflows

Bioinformatics workflows developed for and used on the St. Jude Cloud project.
MIT License
33 stars 10 forks source link

Add a read trimming step to alignment workflows #179

Open a-frantz opened 3 weeks ago

a-frantz commented 3 weeks ago

Currently, our workflows assume read trimming has already occurred upstream, so we don't perform it as part of alignment. This assumption is often violated.

As part of this issue, we need to select a read trimming tool/algorithm (might require some comparative analysis) and then incorporate it into the *-core workflows. We also need to ensure there's no harm in read trimming FASTQs that have already been read trimmed.

If we opt to investigate multiple read trimming tools, we might as well write WDL tasks for all of them. It could be nice if users could select that as part of the workflow, however we may find that they are not all created equal and only one choice should be supported. TBD.