matatonic / openedai-speech

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
GNU Affero General Public License v3.0
192 stars 32 forks source link

Update Dockerfile to download voices #6

Closed justinh-rahb closed 1 month ago

justinh-rahb commented 2 months ago

This pull request updates the Dockerfile to include the necessary scripts for downloading voices. The init.sh script checks if the voices directory is empty and downloads the voices if necessary. This update ensures that the Docker container downloads the required files for voice models, making it easier for users to use the application.

matatonic commented 2 months ago

Thanks Justin from the Hammer! (I'm in Toronto). I actually had the voices in the image originally and removed them... let me think about this before putting them back. Thanks for the arm changes!

justinh-rahb commented 2 months ago

Thanks Justin from the Hammer! (I'm in Toronto). I actually had the voices in the image originally and removed them... let me think about this before putting them back. Thanks for the arm changes!

Greetings @matatonic in The Big Smoke!

The reason I didn't just build the models into the image (besides size which is already quite large for the main one) was keeping potential licensing issues around distribution out of the equation. I could refac to have these downloaded/added in during the build process, no need to have them included in the repo.

Unfortunately the ARM-related changes probably need to go in a new PR, I've reverted that.

matatonic commented 1 month ago

I've merged what I think was the intent of all these changes - thanks!