SillyTavern / SillyTavern-Extras

Extensions API for SillyTavern.
GNU Affero General Public License v3.0
563 stars 133 forks source link

Feature: Speech-to-text module using Vosk, Whisper from audio file sent by ST. #93

Closed Tony-sama closed 1 year ago

Tony-sama commented 1 year ago

revision of PR#84, too lazy to solve all conflict of rebase so simply inject the new part in neo branch.

This module provide speech-to-text from audio file sent by ST using Vosk or Whisper. Tried to not add too much to server.py, so put the stt module into their own files and used "add_url_rule" to add the api routes.

Features

What changed

Tests