sr320 / LabDocs

Roberts Lab Documents
http://sr320.github.io/LabDocs/
9 stars 17 forks source link

Obtaining GOSlim information with UniProt IDs #655

Closed sr320 closed 6 years ago

sr320 commented 6 years ago

What are the best / different ways to do this?

laurahspencer commented 6 years ago

I too would like to know how to do this

Ellior2 commented 6 years ago

The more I think about it as I work through it, the more logical it seems to go directly to GO slim. When I unfold the GO terms for one protein and merge those GO terms with GO slim categories, I end up "doubling up" on the level of expression, if that make sense. Here I am working on 2016 data looking at Sample 1. As you can see many of the GO terms from one protein end up matching to the same GO slim category (makes sense). However since I want to use the NSAF values as a level of expression, I don't know if it really makes sense to sum these values like I did in the pivot table. My goal is to make a pie chart of the GO slim categories with relative expression to show resource allocation.

image

sr320 commented 6 years ago

This should get you from Blast output to GO slim without leaving Jupyter.

https://github.com/sr320/nb-2017/blob/master/misc/10-blast-2-slim.ipynb

drop

kubu4 commented 6 years ago

Baller!