πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Mozilla Public License 2.0
627
stars
71
forks
source link
Added a bunch of improvements for the fine-tune gui and fixed a bug when running on only cpu #107
Closed
DrewThomasson closed 5 days ago
Pull Request Details
Fixed Whisper CPU bug:
Now switches to
int8
on CPU to avoidfloat16
issues. GPU usage remainsfloat16
.Updated README:
Added usage guide and pre-made Docker images to pull for x86 CPU, NVIDIA GPU, and Apple Silicon.
Apple Silicon Instructions:
Included specific install steps with a dedicated
apple_silicon_requirements.txt
.New Export Model Button:
Exports a folder with:
dataset.zip
model.pth
config.json
vocab.json
Removes optimizer state for smaller model size. :)
PS: Thank you so much for keeping Coqui-TTS running smoothly! π