UCSC-Treehouse / pipelines

Makefiles to run dockerized pipelines used in Treehouse on a single sample
Apache License 2.0
3 stars 6 forks source link

Explore lite pipeline #20

Open rcurrie opened 6 years ago

rcurrie commented 6 years ago

Creating this ticket to evaluate a lighter weight pipeline using Kallisto for quantification.

rcurrie commented 6 years ago

@jpfeil @hbeale Pizzly is a fusion caller that goes with Kallisto

rcurrie commented 6 years ago

@hbeale mMaybe I was over thinking all the details. We essentially just need something that correlates with uniqMappedNonDupeReadCount. Once we have that we can scale appropriately. Preliminary attempt:

X = Total Reads * % Unique in first 100k of FASTQ Y = uniqMappedNonDupeReadCount

image

Will look into those flat liners with a bit more tooling. Make sense?

hbeale commented 6 years ago

The trend looks right.

We need a fraction so that we can apply it to the total number of exonic reads to get the number of non dupe exonic reads

On Fri, Jul 27, 2018 at 10:34 PM Rob Currie notifications@github.com wrote:

@hbeale https://github.com/hbeale mMaybe I was over thinking all the details. We essentially just need something that correlates with uniqMappedNonDupeReadCount. Once we have that we can scale appropriately. Preliminary attempt:

X = Total Reads * % Unique in first 100k of FASTQ Y = uniqMappedNonDupeReadCount

[image: image] https://user-images.githubusercontent.com/492570/43353396-b3c61120-91ec-11e8-9425-5e8eebb999ac.png

Will look into those flat liners with a bit more tooling. Make sense?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/UCSC-Treehouse/pipelines/issues/20#issuecomment-408584397, or mute the thread https://github.com/notifications/unsubscribe-auth/AADVg1EjrE-4OAAdbiu7PVzoKpvpeU8Zks5uK_fsgaJpZM4Vgntm .

rcurrie commented 6 years ago

Removing samples from the infamous THR33:

image

Proxy = # Total Pairs (from FASTQC) * % Pairs Unique in first 1M Pairs