At the end of the First look at the dataset notebook, under the heading "we made important observations" we mention that linear models can only capture linear interactions but there is no prior reference to linear models.
The word "complex" at the end of the Effect of the sample size in cross-validation notebook may be misleading. Here we don’t mean complexity as in the depth of a decision tree or the degree of a polynomial (which indeed would lead to overfitting), but rather that a more "expressive" model should be used.
In the Solution for Exercise M3.02 the wording refers to the "accuracy of the model" but the model is a KNeighborsRegressor.
At the end of the First look at the dataset notebook, under the heading "we made important observations" we mention that linear models can only capture linear interactions but there is no prior reference to linear models.
The word "complex" at the end of the Effect of the sample size in cross-validation notebook may be misleading. Here we don’t mean complexity as in the depth of a decision tree or the degree of a polynomial (which indeed would lead to overfitting), but rather that a more "expressive" model should be used.
In the Solution for Exercise M3.02 the wording refers to the "accuracy of the model" but the model is a
KNeighborsRegressor
.The
value_counts
function in the Classification metrics notebook is raising a warning.This PR proposes some fixes to address those issues.