AASHISHAG / deepspeech-german

Automatic Speech Recognition (ASR) - German
Apache License 2.0
313 stars 36 forks source link

DeepSpeech 0.9.3 documentation and toolchain updates #35

Closed kaoh closed 3 years ago

kaoh commented 3 years ago

updated documentation download script added script for creating language model (scorer) script for exporting TFLite added fine-tuning support and updated transfer learning script KenLM and DeepSpeech as sub modules removed local scripts not used for speech processing added parameters to script files used DeepSpeech image without version

A lot has changed since DeepSpeech 0.5.0, I have tried to fix the scripts, make them flexible and bring the documentation in sync. The person who has generated the 0.9.0 version should be able to validate the new approach.

A potential issue is the use of a different version which was not validated against this project. By using git submodules this can be prevented and only checked versions are used. In case a new version is used these modules must be updated.

AASHISHAG commented 3 years ago

Thank you @kaoh so such detailed comments and updates. Allow me some time {two weeks} to run the steps.

Any other updates are always welcome!!

kaoh commented 3 years ago

Yes, sure. This reminds me of: I forgot one error I was facing to mention, which I have also documented during the training in the README :

NOTE: In case a Not enough time for target transition sequence (required: 171, available: 0) is thrown, the currently only known fix is to edit the file DeepSpeech/training/deepspeech_training/train.py and add , ignore_longer_outputs_than_inputs=True to the call to tfv1.nn.ctc_loss.

I assume this is related to some available training data but no corresponding wav file or an incorrect wav record length or the other way around. I was not able to trace down the error, because the Tensor class is not debugging friendly and does not reveal the vector values easily.

AASHISHAG commented 3 years ago

@kaoh : This is really awesome. I wasn't able to run all the steps (need more time). However, I will let the community to report any issues they encounter.

Merging the request.

kaoh commented 3 years ago

Thanks. I forgot to mention that I have stripped away the TODO section in the README. The TODOs were outdated and some other things have been added in the meanwhile (Mailabs, SWC) which have not been mentioned. If you want you can add a feature section again if you feel that something of this is noteworthy. For the TODOs in general I would use the GitHub issues.