Closed st1992 closed 3 years ago
"Why is there a mismatch of 28 and 29?" I think because CTC blank token not in vocabulary
yes, CTC black token is a special one and it is NOT part of the vocabulary. There are, however, a dim for it in the output logits.
Post transcription I checked log_probs size and got this
result['log_probs'].size()
Gottorch.Size([1, 7528, 29])
Using Google Collab
print(len(quartznet.decoder.vocabulary))
Got 28Why is there a mismatch of 28 and 29?