Open okulovsky opened 6 days ago
To use https://github.com/SYSTRAN/faster-whisper insread of Whisper. Only integration with BrainBox is needed
https://github.com/m-bain/whisperX can do diarization of the video, might help if building corpus for specific speech.
https://mynoise.net/NoiseMachines/dungeonRPGSoundscapeGenerator.php can help to generate back noises such as ocean etc for athmosphere, This https://stability.ai/news/introducing-stable-audio-open can do it too but seems just a more expensive tool for the same purpose
General ideas for sound improvement