I'm opening this discussion to outline and prioritize the core functionalities we plan to implement in Fab, our audio box project. Our goal is to build a flexible and modular toolkit that supports a wide range of audio-processing tasks. The proposed functionalities include:
[ ] Speech-to-Text
[ ] Speech-to-Phonemes
[ ] Text-to-Speech
[ ] Speaker Verification
[ ] Speaker Diarization
[ ] Voice Activity Detection
[ ] Speech Enhancement
[ ] Voice Conversion
These building blocks are intended to be combined into pipelines, catering to diverse applications and use cases.
I encourage everyone to provide feedback on these functionalities, suggest any additional ones you think are crucial, and discuss how we can efficiently integrate these components into our development roadmap.
Hi there! :)
I'm opening this discussion to outline and prioritize the core functionalities we plan to implement in
Fab
, our audio box project. Our goal is to build a flexible and modular toolkit that supports a wide range of audio-processing tasks. The proposed functionalities include:These building blocks are intended to be combined into pipelines, catering to diverse applications and use cases.
I encourage everyone to provide feedback on these functionalities, suggest any additional ones you think are crucial, and discuss how we can efficiently integrate these components into our development roadmap.
Best, @fabiocat93