joshuagryphon / plastid

Position-wise analysis of sequencing and genomics data
https://plastid.readthedocs.io
Other
36 stars 16 forks source link

Using 'centered' for read mapping - why and when? #19

Closed bdweave closed 6 years ago

bdweave commented 7 years ago

Hi Joshua,

Thanks for helping with my previous post re: mapfactories not working. Your workaround was great. This is less of an 'issue' and hopefully more of a discussion.

My main question is why and when to use 'centered' based read mapping. Following along in your docs, it appears this normalization is used for RNA-seq data specifically, when manipulating mapping dynamics for downstream analyses.

My next question is under what circumstances would you want to 'nibble' reads as you mapped them? The only thing I can come up with is to increase your signal after reducing it substantially with fractional counting mandated by 'centered' mapping.

Lastly, and a bit of a switch in topics, what would your workflow look like to identify the proportion of various start sites used in two different conditions in riboseq data, especially if alt start sites are located in 5' UTRs? I know that plastid likely contains all of the necessary tools to ask this question, but I am having trouble conceptualizing where to begin.

My best guess is to use the segment chain and genomic segment functionality as well as the 'get_counts' method combined with some type of sub-setting command to specifically target AUGs, NUGs, etc. If you could give me a big picture idea here, that would be super helpful!

Thanks for your time and attention. Really enjoying learning how to use Plastid and all of its functionality!

Respectfully, Brad

joshuagryphon commented 6 years ago

Hi Brad,

Sorry for the delayed response on this. I have been swamped since August, and this fell off my radar. In case it is still relevant, here is my response:

Hope this helps! I'm going to close this for now, since it's not quite an issue, but feel free to write back if you do have more questions. I'll be faster to respond.

Cheers, Josh