why not using one-hot encoding and softmax for CIFAR-10 model

In the material, it's not mentioned why one-hot encoding is not used and why output layer has 10 units but not 1.

The material has

To calculate crossentropy loss for data that has its classes represented by integers (i.e., not one-hot encoded), we use the SparseCategoricalCrossentropy() function:

but it's too late to introduce here.

We should explain why one-hot encoding is not used at the point where building the architecture.

Reference:

https://keras.io/api/losses/probabilistic_losses/#sparsecategoricalcrossentropy-class

carpentries-incubator / deep-learning-intro

why not using one-hot encoding and softmax for CIFAR-10 model #292