Closed kviip closed 6 years ago
The focus of this chapter will be to present approaches for visually exploring data and to demonstrate how this approach can be used to help guide feature engineering.
Both should be "approaches"
Part 4.2.1:
However, if the outcome were transformed prior to modeling, it would ensure than negative ridership could not be predicted. However, if the outcome were transformed prior to modeling, it would ensure that negative ridership could not be predicted.
(as see in Figure 1.7). (as seen in Figure 1.7).
(sidenote: does figure 1.7 show this? I'm guessing that you're referring to the y-axis is in natural units and not log units?)
On station particularly stands out, One station particularly stands out,
Part 4.2.3
to uncover relationships between pairs of predictors, an to understand if to uncover relationships between pairs of predictors, and to understand if
4.3 Visualizations for Categorical Data: Explorating the OkCupid Data 4.3 Visualizations for Categorical Data: Exploring the OkCupid Data
I think of data as plural so "data are" is appropriate.
In version dated "2018-05-12":
Should be "a"
One "only" is unnecessary.
Should be "correlation".
Should be "include" not "includes"
Should be "has a".
"is" is unnecessary.
Should be "is ... part" instead.
Should be either "where the sizes ... are" or "where the size ... is".
There is no red block in the lower right (probably meant to be upper right).
"Its" in the sentence above refers to test set which has no "results", hence indicated sentence probably requires rephrasing.
Should be "data sets" instead of "datasets".
"of" is unnecessary.
"predictors variables" is incorrect, I think, it should be "predictor variables", or only "predictors" or "variables".
Should be "has" instead of "have".
Unfinished part of sentence probably - "during ..." ?
Should be "descent".
Should be "overestimate" instead.