Closed wasertech closed 1 year ago
Managed to make this branch export a model from scratch. See the full logs here.
I've updated the stt_branch to be the lastest alpha of STT 1.4.0 so we'll have to update it once it's considered stable.
STT 1.4.0 was released as stable! I've updated stt_branch
accordingly. This branch stt140-cv9
is now completed.
Version 10 of CV is out so I'll probably make another branch for it (I'll probably wait for more affordable energy to train cv-fr-10 though).
This branch made the mistake to delete commonvoice-fr/DeepSpeech/
to create commonvoice-fr/STT/
.
It is now obsolete thanks to #168.
This branch implements everything needed to train STT models for french using CommonVoice 9.0 with STT version 1.4.0.
Notes
Checkout the released models from this branch: STT French v0.9.
I've added the
import_cv_perso.sh
importer script to download personal CV data and ease the process of fine-tuning from checkpoints. See this commit and this article on Discourse.I've also added a custom python script for lm_optimizer to catch the results of the optimization and save them to disk so we can use them during testing and exporting steps.
train.sh
has been split intotrain.sh
,test.sh
andexport.sh
. See this commit.