echogarden-project / echogarden

Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
GNU General Public License v3.0
150 stars 16 forks source link
forced-alignment language-detection language-identification source-separation speech speech-alignment speech-recognition speech-synthesis speech-to-text speech-translation text-to-speech

Echogarden

Echogarden is an easy-to-use speech toolset that includes a variety of speech processing tools.

Features

Installation

Ensure you have Node.js v18.16.0 or later installed.

then:

npm install echogarden -g

Additional required tools:

Both tools are auto-downloaded as internal packages on Windows and Linux.

On macOS, only ffmpeg is currently auto-downloaded. It is recommended to install sox via a system package manager like Homebrew (brew install sox) to ensure it is available on the system path.

Updating to latest version

npm update echogarden -g

Using the toolset

Tools are accessible via a command-line interface, which enables powerful customization and is especially useful for long-running bulk operations.

Development of more graphical and interactive tooling is planned. A text-to-speech browser extension is currently under development (but not released yet).

If you are a developer, you can also import the package as a module or interface with it via a local WebSocket service (currently experimental).

Documentation

Credits

This project consolidates, and builds upon the effort of many different individuals and companies, as well as contributing a number of original works.

Developed by Rotem Dan (IPA: /ˈʁɒːtem ˈdän/).

License

GNU General Public License v3

Licenses for components, models and other dependencies are detailed on this page.