coqui-ai / STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
https://coqui.ai
Mozilla Public License 2.0
2.27k stars 275 forks source link

Bug: web site stt.readthedocs.io doesn't have working info on getting started with training #2164

Closed FrontierDK closed 1 year ago

FrontierDK commented 2 years ago

Hi there.

I have just tried a lot of the methods shown on stt.readthedocs.io on how to get started with training...and none of them worked. Will the web site be updated?

Background To make sure I didn't have old stuff, I used a fresh virtual Ubuntu 21.10 desktop, with unddo/snapshots of the VM, so that I could try multiple ways/methods, without having old stuff from previous attempts left over...

https://stt.readthedocs.io/en/latest/TRAINING_INTRO.html

https://stt.readthedocs.io/en/latest/DEPLOYMENT.html

https://www.tensorflow.org/install/source#configure_the_build

Preparing and training data takes a LOT of time, A lot more trained datasets would be available if the documetation is updated to a level, where you can follow the command guides from a newly installed PC/VM, use just the listed commands...and end out with a working SR PC/VM. A lot of training data might even be shared, so that our models are more versatile. Is there a chance that the documentation is updated?

JRMeyer commented 2 years ago

@FrontierDK is this the error you get with building the dockerfile? https://github.com/coqui-ai/STT/issues/2168

FWIW -- the best way to start training is to pull down the pre-built docker images:

$ docker pull ghcr.io/coqui-ai/stt-train

this is actually the recommended training setup, as noted in the docs... did this not work for you for some reason?