Open dtaylo95 opened 10 months ago
Almost there! There's just one verrrrrry small issue:
When you are grabbing the phenotype values (i.e. the CB1908 IC50) values to make the boxplot, you're incrementing your number
index before you grab each phenotype, rather than after, which means all your phenotypes are shifted by one index (i.e. you're pairing each phenotype with the wrong genotype)
That's why your plot doesn't look the same as mine. Note that when you fix this, you will also need to change/update the way you're dropping the nan
values from the heterozygous phenotypes:
This is a super minor issue though, so I'm happy to give you a 10/10. I just want you to be aware of why your plot isn't quite working right.
Whether or not you decide to make that change, feel free to close this issue.
Current grade: 10/10
README.md
with commands and analyses1/2
Missing your answer to 3.4.
plotting.py
script to produce plots3/4
While the plot itself is missing, the code for your Manhattan plot looks good. Very minor issue, but it looks like you're plotting ALL of your associations in your manhattan plots, rather than just the genotype associations. To clarify: when you run your GWAS, you include the top PCs as covariates in the regression (this is correct). But this means that you also get regression results for the covariates, not just the variants you're testing. Take a look at the
TEST
column in the.assoc.linear
output file(s) of theplink --linear
command to figure out which results you want to keep/plot.For your boxplot, it looks like you're picking the correct top SNP (for CB1908), but I... have no idea what you're plotting. It looks like you're plotting some kind of correlation between genotypes? Really not sure. What you want to be plotting is the relationship between the genotypes of your top SNP (rs10876043) and the values of your phenotype (namely, the IC50 of the CB1908 drug). So you should have a series of 3 boxplots--one for each genotype of rs10876043--where for each genotype, you plot the IC50 values of all individuals with that genotype. Also make sure you label everything all pretty.
Pretty plots
3/4
Missing your Manhattan plot.
Grade
Total: 7/10