ochen1 / insanely-fast-whisper-cli

The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚑️
MIT License
318 stars 30 forks source link

Insanely Fast Whisper (CLI)

GitHub License Python Version

Powered by πŸ€— Transformers & Optimum and based on Vaibhavs10/insanely-fast-whisper.

TL;DR - πŸŽ™οΈ Transcribe 300 minutes (5 hours) of audio in less than 10 minutes - with OpenAI's Whisper Large v2. Blazingly fast transcription is now a reality!⚑️

Features

✨ ASR Model: Choose from different πŸ€— Hugging Face ASR models, including all sizes of openai/whisper and even use an English-only variant (for non-large models).

πŸš€ Performance: Customizable optimizations ASR processing with options for batch size, data type, and BetterTransformer, all from the comfort of your terminal! 😎

πŸ“ Timestamps: Get an SRT output file with accurate timestamps, allowing you to create subtitles for your audio or video content.

Installation

Usage

insanely-fast-whisper --model openai/whisper-base --device cuda:0 --dtype float32 --batch-size 8 --better-transformer --chunk-length 30 your_audio_file.wav

Example

Transcribing an audio file with English-only Whisper model and returning timestamps:

insanely-fast-whisper --model openai/whisper-base.en your_audio_file.wav

Output

The tool will save an SRT transcription of your audio file in the current working directory.

License

This project is licensed under the MIT License.

Acknowledgments

πŸ“ž Contact

Have questions or feedback? Feel free to create an issue!

🌟 Star this repository if you find it helpful!

Star History Chart


GitHub Issues GitHub Pull Requests

πŸš€ Happy transcribing with Insanely Fast Whisper! πŸš€