JoungheeKim / K-wav2vec

Apache License 2.0
82 stars 15 forks source link

K-Wav2vec 2.0

This official implementation of "K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables"

Requirements and Installation

install essential library

pip install -r requirements.txt

install locally

python setup.py develop

 - We only test this implementation in Ubuntu 18.04.
 - DockerFile is also supported in this repo.

## Instructions
 - We support script examples to execute code easily(check `script` folder)
 - Following this instruction give you exact matched results.
```bash
# Guilde to make multi-model with Ksponspeech(orthographic transcription) 

# [1] preprocess dataset & make manifest
bash script/preprocess/make_ksponspeech_script_for_mulitmodel.sh

# [2] further pre-train the model
bash script/pretrain/run_further_pretrain.sh

# [3] fine-tune the model
bash script/finetune/run_ksponspeech_multimodel.sh

# [4] inference the model
bash script/inference/evaluate_multimodel.sh

Pretrained model

Dataset

Acknowledgments

License

Our implementation code(-py) is MIT-licensed. The license applies to the pre-trained models as well.