UBC-MDS / DSCI522_309

Workflows Analysis Project for DSCI 522
Other
1 stars 3 forks source link

Visualization feedback #55

Open ksedivyhaley opened 4 years ago

ksedivyhaley commented 4 years ago
aromatic-toast commented 4 years ago
  1. This has already been removed in the latest commit.
  2. Will plot the variables on a log scale to see if this fixed the violin plot
  3. The data does not come with labels for the Browser and OS variables so naming these is impossible
    1. will reorder the factors of months to put them in chrono order
  4. we decided to remove the correlation plot. We had a github issue on the day the assignment was due and the code to resize the corr matrix was lost.
ksedivyhaley commented 4 years ago
  1. 👍
  2. Log scale is worth a try, as long as the scale is clearly labelled.
  3. You mean you have no way of determining which number corresponds with each Browser and OS?
  4. 👍
  5. Too bad! If it's a choice between keeping a big correlation plot and removing it, I would keep the big plot.
aromatic-toast commented 4 years ago

Yes, the data does not come with labels of the Browser and OS. It it just numbers so we have no way of knowing what these numbers map onto. There is minimal meta data coming from the UCI website.

aromatic-toast commented 4 years ago

We removed the correlation plot because we didn't have an impact on the downstream analysis as we didn't have the time to do any feature selection later. But it was part of the initial EDA. We were not sure if it was okay to include it even thought we don't really use it to inform the downstream analysis.