In the material, it's not mentioned why one-hot encoding is not used and why output layer has 10 units but not 1.
The material has
To calculate crossentropy loss for data that has its classes represented by integers (i.e., not one-hot encoded), we use the SparseCategoricalCrossentropy() function:
but it's too late to introduce here.
We should explain why one-hot encoding is not used at the point where building the architecture.
In the material, it's not mentioned why one-hot encoding is not used and why output layer has 10 units but not 1.
The material has
but it's too late to introduce here.
We should explain why one-hot encoding is not used at the point where building the architecture.
Reference: