mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Mozilla Public License 2.0
384 stars 19 forks source link

STT language auto-detection #147

Open rubdos opened 5 days ago

rubdos commented 5 days ago

Currently, the STT Dbus API requires mentioning a model based on a language. However, Whisper has a model that does language autodetection.

It would be nice if one could call SttTranscribeFile (and related APIs, I suppose) with string:lang:auto, such that the language gets inferred by the STT. The relevant signals should then return the detected language (but I think it's mostly there already).

Additionally, the relevant user interfaces (I'm on SailfishOS) should get some element that allows downloading this general model.

mkiol commented 4 days ago

Hi, thanks for the idea. Language auto detection can indeed be useful.

I think I can add this in the next version.