lexiefoster / BIOS967_Foster

0 stars 0 forks source link

Project Update at Thanksgiving Break (Nov. 21st) #4

Open lexiefoster opened 11 months ago

lexiefoster commented 11 months ago

At this point in my project, I have a good grasp on creating plots of 2-3 variables, specifically surrounding the comparison of two dependent variables (such as root and shoot weights) for each of the Genotype IDs. One roadblock with this, however, is visualizing two dependent variables for ALL 600 Genotype IDs. I am currently just visualizing for up to 200-300 Genotype IDs.

In addition to plotting and comparing various different variables, I have also worked on running some statistics on these variables to compare their correlations, as well as, running ANOVAs on the Genotype IDs and select dependent variables. These have been pretty straightforward in creating.

Moving forward, I am wanting to work on over this break and into next week, creating a correlation plot for multiple different variables in my datasheet. I want to construct correlation plots for 1) all the amino acids, 2) some of the most important plant phenotypic traits, and 3) a mix of both amino acids and plant phenotypic traits. I found a few packages online, including "gclus", "corrplot", and "ggcorrplot" that I am trying to learn to use to construct these correlation plots. Right now the main issue/roadblock I am running into is an error when using these packages that says 'x' must be numeric. I am assuming that I need to specify what my 'x' factor/variable is.

dshizuka commented 11 months ago

Hi Lexie, This is a great update. If you have been working on writing script to use the corrplot and other packages, can you make sure those changes are pushed through GitHub? I don’t see it on the current repository. And if you can point me to the line in the code that generates the error, I can probably figure out what is happening. Likely, your data is being interpreted as character instead of numeric at some point along the way, so that is probably where you need to fix.

Dai

From: Lexie Foster @.> Date: Tuesday, November 21, 2023 at 3:10 PM To: lexiefoster/BIOS967_Foster @.> Cc: Subscribed @.***> Subject: [lexiefoster/BIOS967_Foster] Project Update at Thanksgiving Break (Nov. 21st) (Issue #4) Non-NU Email


At this point in my project, I have a good grasp on creating plots of 2-3 variables, specifically surrounding the comparison of two dependent variables (such as root and shoot weights) for each of the Genotype IDs. One roadblock with this, however, is visualizing two dependent variables for ALL 600 Genotype IDs. I am currently just visualizing for up to 200-300 Genotype IDs.

In addition to plotting and comparing various different variables, I have also worked on running some statistics on these variables to compare their correlations, as well as, running ANOVAs on the Genotype IDs and select dependent variables. These have been pretty straightforward in creating.

Moving forward, I am wanting to work on over this break and into next week, creating a correlation plot for multiple different variables in my datasheet. I want to construct correlation plots for 1) all the amino acids, 2) some of the most important plant phenotypic traits, and 3) a mix of both amino acids and plant phenotypic traits. I found a few packages online, including "gclus", "corrplot", and "ggcorrplot" that I am trying to learn to use to construct these correlation plots. Right now the main issue/roadblock I am running into is an error when using these packages that says 'x' must be numeric. I am assuming that I need to specify what my 'x' factor/variable is.

— Reply to this email directly, view it on GitHubhttps://urldefense.com/v3/__https:/github.com/lexiefoster/BIOS967_Foster/issues/4__;!!PvXuogZ4sRB2p-tU!EpnqCZGAtiZEZeR4ZTBzjdifCJLaX3XQGbBRakFVpHzP1WTqBnQlOg2l7SMUEmw_5m3J3FBpbNmcIeUyp-HpfUvNPw$, or unsubscribehttps://urldefense.com/v3/__https:/github.com/notifications/unsubscribe-auth/ABVQI4GEV4TL4J7QOPC2AILYFUKFXAVCNFSM6AAAAAA7VGZ5J6VHI2DSMVQWIX3LMV43ASLTON2WKOZSGAYDKMJUGA4DKNQ__;!!PvXuogZ4sRB2p-tU!EpnqCZGAtiZEZeR4ZTBzjdifCJLaX3XQGbBRakFVpHzP1WTqBnQlOg2l7SMUEmw_5m3J3FBpbNmcIeUyp-HI7FfJSA$. You are receiving this because you are subscribed to this thread.Message ID: @.***>

lexiefoster commented 11 months ago

Hi Dr. Shizuka,

I believe I committed and pushed through my most recent set of code for starting to construct the correlation plots just now, so it should be up to date! The lines where I am having errors come up are lines 118 and 123. Thanks!

Best, Lexie