alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
7.57k stars 1.06k forks source link

Vocabulary reduction #1327

Open olevanss opened 1 year ago

olevanss commented 1 year ago

Hi everyone

I am looking for a way to improve transcribation quality. To achieve it I want to test cutting the vocabulary down. I have found about dynamic vocabulary change of small models.

My question: Is there a way to do it for big models? In particular, russian big model? It contains HCLG.fst and is static.

Thank you!