Closed SinanAkkoyun closed 6 days ago
WhisperModel now also outputs encoder embeddings optionally.
WhisperModel
Usage: normal, backwards compatible:
segments, info = model.transcribe(audio_path, beam_size=1)
for encoder outputs:
segments, info, encoder_output = model.transcribe(audio_path, beam_size=1).all()
Usage is fully backwards compatible
Hello and thanks for your contribution, however, I don't see why would the encoder output be needed after the transcription, so far I'm not in favor of merging this PR as it's a very niche use case
WhisperModel
now also outputs encoder embeddings optionally.Usage: normal, backwards compatible:
for encoder outputs:
Usage is fully backwards compatible