kids-first / kf-rnaseq-workflow

:microscope: RNA-Seq workflow for Kids-First DRC
Apache License 2.0
9 stars 8 forks source link

✨ Single End Support #15

Closed dmiller15 closed 4 years ago

dmiller15 commented 4 years ago

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Resolves https://github.com/d3b-center/bixu-tracker/issues/623 Also contains estimates for https://github.com/d3b-center/bixu-tracker/issues/627

Type of change

How Has This Been Tested?

Test Configuration:

Checklist:

migbro commented 4 years ago

Are there plans to update documentation (both README and cavatica doc) in another PR? This will need to be completed before updating the cavatica public and cutting a new release update

dmiller15 commented 4 years ago

I'll update the README and doc today and put it in this PR. Shouldn't be much to add as the changes are relatively small from a user's perspective.

bmennis commented 4 years ago

Ok so I tested the SE samples I have used in the PBTA Splicing Analysis project. All three tasks completed successfully here: https://cavatica.sbgenomics.com/u/kfdrc-harmonization/kf-reference-pipeline/tasks/#q?search=RNA&page=1

For comparison to the analysis for the PBTA Splicing project, the completed tasks can be found here: https://cavatica.sbgenomics.com/u/d3b-bixu/rs-h7xa9jez-pbta-splicing/tasks/#q?search=RNA-Seq-single&page=1&status=COMPLETED

I have taken a quick look at the RNASeQC metrics and they appear to match up. Let me know thoughts on this or if more testing is needed

migbro commented 4 years ago

@bmennis , when you have a chance, can you quickly re-run those test samples with this updated wf? Thanks!

bmennis commented 4 years ago

Thanks so much for looking at the workflow and making the changes, I will get those re tests running.

tkoganti commented 4 years ago

All the runs that @bmennis ran were single end samples. Would it be useful to run a task on paired end as well to make sure that runs as expected?

dmiller15 commented 4 years ago

From a regression testing point of view, that would be prudent.

tkoganti commented 4 years ago

Dan made some updates on cutadapt. Maybe running some tests on paired with adapters and without adapters to make sure everything works might be good

migbro commented 4 years ago

Hi @bmennis , when you get a chance, can you run the following tests to close this out, as @tkoganti suggested:

Even if the source doesn't really have adapters, really just to make sure cutadapt runs properly given the input, so you can just use standard illumina to test. Thanks!

bmennis commented 4 years ago

I have the SE test samples running here: https://cavatica.sbgenomics.com/u/kfdrc-harmonization/kf-reference-pipeline/tasks/#q?search=SE-SRR7268791&page=1

I also have the PE samples running here: https://cavatica.sbgenomics.com/u/kfdrc-harmonization/kf-reference-pipeline/tasks/#q?page=1&search=PE-BS_4X8PQ5G6