LD1 and LD2 - Githubissues

Basically, LDA is projecting your X values (an n x p matrix, so for the lab this is n x 256) to a lower dimensional space (in this case n x 4 since there are 5 classes for y). The first two columns of this new lower dimension matrix (the n x 4 matrix created from LDA) are LD1 and LD2. The big picture from a data visualization perspective is that you cannot plot something in 256 dimensions (you can't even plot something in 4 dimensions!) but you can plot something that has just 2 dimensions - LD1 and LD2 are a reduced version of the large matrix of predictors (and they happen to be the best two-dimensional plane for visualizing the discriminant rule calculated from the model). I don't expect you to be able to calculate these "by hand" - we have only covered calculating the discriminant score, the probability of being in each class, and the discriminant rule in the simple 2 class 1 predictor case.

Visualizing data like this can help identify broadly how well the model will discriminate between groups (like if one group was very clearly clustered together, LDA is likely going to have no problem identifying those points).

If you're interested in a little more explanation on the math-side, I really like these slides

sta-363-s20 / community

LD1 and LD2 #27