alan-turing-institute / rds-course

Materials for Turing's Research Data Science course
https://alan-turing-institute.github.io/rds-course/
31 stars 13 forks source link

2.2.1 Data Consistency - confusing variable name #119

Open jack89roberts opened 1 year ago

jack89roberts commented 1 year ago

The smaller_petals variable name here:

# True for each row with body_mass_g greater than the min value of 285g
smaller_petals = df["body_mass_g"] > df["body_mass_g"].min()

# Lowest penguin weight out of all rows with weights above 285g
df.loc[smaller_petals, "body_mass_g"].min()

is a leftover from when this notebook was using the Iris dataset. It should be renamed to something penguin related, e.g. heavier_penguins