indiana-university / automated-transcription-service

BSD 3-Clause "New" or "Revised" License
2 stars 0 forks source link

Vocabulary files #8

Open alan-walsh opened 6 months ago

alan-walsh commented 6 months ago

Allow users to include a vocabulary file with their transcription, either list or table. This is likely to make a huge difference in recordings with a lot of very domain-specific language.

In the current implementation this would require some kind of clue in the audio filename. Either the vocab file has the same name as the recording or perhaps some kind of prefix that becomes a clue to the audio-to-transcribe Lambda function to look for the vocab file.