Closed doctorase closed 1 month ago
Found people on the coqui tts github with the same issue: not sure if they have a fix but yeah hope that helps point you in the right direction.
https://github.com/coqui-ai/TTS/discussions/2100
As everything is already preconfigured in the docker image.
To run the Docker container and start the Gradio interface, use the following command:
-Run with CPU only
docker run -it --rm -p 7860:7860 athomasson2/ebook2audiobookxtts:latest
-Run with GPU Speedup (Nvida graphics cards only)
docker run -it --rm --gpus all -p 7860:7860 athomasson2/ebook2audiobookxtts:latest
This command will start the Gradio interface on port 7860.(localhost:7860)
Finally used docker method, thanks for the fast reply!
No probs! Glad to hear it worked! 😄
One question. I can't see the GUI with localhost:7860, but i can with public url. I need to do something to enable GUI with localhost url?
Another question. Can I stop the conversion and follow it another day? It needs a lot of time.
Thanks for your work, is awesome!
oh, yeah, I Never figured out how to make the local host work for docker images, But the public url worked so I just left it at that.
For pausing......... I don't have an implementation in it for pausing butttt you might be able to pause the docker image itself and resume it,
Heads up if you do this I'm not sure if the public links will work after unpausing this or not.
I have no idea if this'll work, but it's worth a try?
Thanks for the info!
Do you know why each time i stop the container, docker removes the container and I need to pull another time?
Hm, it shouldn't if your making sure to exclude the command --rm
from the command you initially use to start the docker image when your using the docker run
command?
Is it doing that with that command omitted?
It works removing --rm. Incredible tool. Thanks!
Once again no problem! It's really nice seeing people using my side projects! 😄
Do you know if using a target voice file needs a lot of more time for the conversion? The target voice file size influence in this time?
I'm using now a target voice file and is needing a lot more time than without it for the same book. The file size is 60 sec mp3.
Because --->
Adding a target Voice shouldn't affect the speed of it as long as it's not like a giant file.
Your 60 second mp3 file shouldn't really affect the speed because the text to speech (XTTS), it doesn't actually work without a target Voice lol
What you don't see, is that in the backend I have a pre-set default target voice, which (XTTS) will use if no target voice is set by the user.
Will Extremely Long Audio Improve Coqui_TTS-Based Voices?-reddit
:)
Hi, when I'm trying to install with powershell with command "pip install tts==0.21.3 pydub nltk beautifulsoup4 ebooklib tqdm" it can't install the package correctly. I've pasted in this link the last part of the response. It says something about last vesion of Microsoft Visual C++, but i've installed the last version. Anyone knows how to solve it? Thanks
https://pastebin.com/ntabHYeU