Closed will-rice closed 11 months ago
This change makes sense because the audio, not the logits, is returned in the model output classes.
This change makes sense because the audio, not the logits, is returned in the model output classes.