huggingface / optimum-nvidia

Apache License 2.0
844 stars 83 forks source link

Whisper inference #107

Closed fxmarty closed 3 months ago

fxmarty commented 3 months ago

As per title - tested in fp16 for now.

This allows to use Transformers Whisper checkpoints with TRT-LLM inference.

Left to do: