DataSlingers / MoMA

MoMA: Modern Multivariate Analysis in R
https://DataSlingers.github.io/MoMA
GNU General Public License v2.0
22 stars 4 forks source link

LDA Example Data: Rockhoppers #59

Open michaelweylandt opened 5 years ago

michaelweylandt commented 5 years ago

The iris data for LDA / classification is overused and typically mis-applied [1].

Let's use a new data set for our LDA examples and include it in the package. Steinfurth et al. have a paper on classifying penguins by sex using various body measurements [2] which seems like it would make a great example.

Idea from [3]; see also [4-5].

[1] http://www.dicook.org/files/jsm19/slides#1 [2] https://www.int-res.com/abstracts/esr/v39/p293-302/ [3] https://twitter.com/dan_p_simpson/status/1164581393516527616 [4] http://www.publish.csiro.au/mu/MU16027 [5] https://figshare.com/articles/Data_from_Using_measurements_to_predict_laying_order_in_harvested_Northern_Rockhopper_Penguin_Eudyptes_moseleyi_eggs/3384109