hoomano / mojodex

Open Source Digital Assistant Platform for Enterprises
https://hoomano.github.io/mojodex
Apache License 2.0
36 stars 4 forks source link

Vocab size #43

Open KellyRousselHoomano opened 7 months ago

KellyRousselHoomano commented 7 months ago

Mojodex can learn only 245 tokens as "vocabulary" for transcription. We should find a way to lift the restriction.

xbasset commented 4 months ago

The point is to control the "volume" of vocab parameters sent to Whisper prior to the call.

Few options to explore: