tgen / phoenix

Jetstream compatible workflow template supporting comprehensive analysis of human sequencing data against GRCh38
MIT License
17 stars 6 forks source link

Filter to Targets #45

Closed PedalheadPHX closed 5 years ago

PedalheadPHX commented 5 years ago

For exomes we need to filter calls to the callable space based on our callable files (targets+/-100bp padding plus extended to end of touched exons)

### Require preprocessing
# compressed and indexed bed file 
# CAN NOT include the header lines
# contigs need to match reference
# intervals need to be intersected as overlapping independent intervals will result in the same variant being extracted multiple times
bgzip test.bed
tabix test.bed.gz

### Example call
bcftools filter \
--regions-file test.bed.gz \
--output-type z \
--output f2t.vcf.gz \
NMTRC_0002_1_PB_Whole_C1_KHSTX.bwa.hc.vcf.gz
ryanrichholt commented 5 years ago

For constitutional/germline variant calls as well as somatic?

PedalheadPHX commented 5 years ago

For any exome regardless of source

On Wed, Mar 20, 2019 at 10:16 AM Ryan Richholt notifications@github.com wrote:

For constitutional/germline variant calls as well as somatic?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/tgen/phoenix/issues/45#issuecomment-474939404, or mute the thread https://github.com/notifications/unsubscribe-auth/AL1RHPBWouzkf1LxIiDSPtQvZG_HQKcEks5vYmzwgaJpZM4b9T2H .

-- Jonathan Keats, Ph.D. Assistant Professor & Director of Bioinformatics Translational Genomics Research Institute (TGen) jkeats@tgen.org | www.keatslab.org (W) 602-343-8690 | (M) 480-543-0634

--

This electronic message is intended to be for the use only of the named recipient, and may contain information that is confidential or privileged, including patient health information. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution or use of the contents of this message is strictly prohibited. If you have received this message in error or are not the named recipient, please notify us immediately by contacting the sender at the electronic mail address noted above, and delete and destroy all copies of this message. Thank you.

ryanrichholt commented 5 years ago

Fixed here, fd6e0b641ec4cf7abffdec0e6e4eea68105a6c38. I created extended bed files for most of the commercial Agilent kits, and also Strexome v5 and v6 here 69be1175a5d7076e55a35487abc95882fc9cc5a5