huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
MIT License
3.33k stars 238 forks source link

How to make training data? #49

Open satani99 opened 7 months ago

satani99 commented 7 months ago

I have a folder like this: audio_1 transcript_1.txt audio_2 transcript_2.txt

how can I make this folder into huggingface dataset?

sanchit-gandhi commented 6 months ago

Hey @satani99! You can convert any custom dataset to Hugging Face Datasets' format using this guide: https://huggingface.co/docs/datasets/audio_dataset