okulovsky / kaia

GNU General Public License v3.0
17 stars 3 forks source link

Sound improvement #5

Open okulovsky opened 6 days ago

okulovsky commented 6 days ago

General ideas for sound improvement

okulovsky commented 6 days ago

To use https://github.com/SYSTRAN/faster-whisper insread of Whisper. Only integration with BrainBox is needed

https://github.com/m-bain/whisperX can do diarization of the video, might help if building corpus for specific speech.

https://mynoise.net/NoiseMachines/dungeonRPGSoundscapeGenerator.php can help to generate back noises such as ocean etc for athmosphere, This https://stability.ai/news/introducing-stable-audio-open can do it too but seems just a more expensive tool for the same purpose