MichaelMcCulloch / WhisperVoiceKeyboard

A voice to text keyboard based on OpenAI Whisper Model.
MIT License
50 stars 5 forks source link

The app struggles with names. #4

Open MichaelMcCulloch opened 1 year ago

MichaelMcCulloch commented 1 year ago

The model struggles with names and assumes a word. Tostitos is often heard wrong, for example.

Either lean on a larger model and suffer at least 2x inference time, or build a dictionary of common typos and run the output through that before presenting to the users,