UBC-MDS / DSCI522_309

Workflows Analysis Project for DSCI 522
Other
1 stars 3 forks source link

EDA #5

Closed aromatic-toast closed 4 years ago

aromatic-toast commented 4 years ago

isn't it preferable to have the .Rmd EDA document rendered as a .md so that you can see it on Github?

marvinmin commented 4 years ago

isn't it preferable to have the .Rmd EDA document rendered as a .md so that you can see it on Github?

yes, you are right, we will add the md file when we finish the final EDA

vermashivam679 commented 4 years ago

My mistake.. I made the html file because I could see the plotly interactive plots... but the github shows the rawfile. I will change that to .md

marvinmin commented 4 years ago

For the last plot in EDA, I think we may need to change the order of the x_axis. I'll look into it.

aromatic-toast commented 4 years ago

may want to add titles to the histograms shown near the top of the .md. Not completely clear what these plots are showing.

marvinmin commented 4 years ago

may want to add titles to the histograms shown near the top of the .md. Not completely clear what these plots are showing.

which histograms? the one with two features facet?

vermashivam679 commented 4 years ago

Hey guys, I reviewed everything & it seems fine to me.... There is just one thing that we should have changed in the graph...showing relative frequency in barplots instead of count, the image shows how I wanted it to show but couldn't figure out how in ggplot.

image (1)

marvinmin commented 4 years ago

may want to add titles to the histograms shown near the top of the .md. Not completely clear what these plots are showing.

which histograms? the one with two features facet?

added title for this histogram

marvinmin commented 4 years ago

Hey guys, I reviewed everything & it seems fine to me.... There is just one thing that we should have changed in the graph...showing relative frequency in barplots instead of count, the image shows how I wanted it to show but couldn't figure out how in ggplot.

image (1)

How did you calculate the frequency? Is it the total number of trues in a month divided by the total number of trues? or divided by the total number of counts for both trues and falses?