crouchred / speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
Apache License 2.0
245 stars 79 forks source link
machine-learning speaker-recognition voice-recognition

About

This project is a simple python3 version of speaker-recognition and I make a little change for the convenience of command line usage.

difference with speaker-recognition of python2

Usage

usage: speaker-recognition.py [-h] -t TASK -i INPUT -m MODEL

Speaker Recognition Command Line Tool

optional arguments:
  -h, --help            show this help message and exit
  -t TASK, --task TASK  Task to do. Either "enroll" or "predict"
  -i INPUT, --input INPUT
                        Input Files(to predict) or Directories(to enroll)
  -m MODEL, --model MODEL
                        Model file to save(in enroll) or use(in predict)

Wav files in each input directory will be labeled as the basename of the directory.
Note that wildcard inputs should be *quoted*, and they will be sent to glob module.

Examples:
    Train:
    ./speaker-recognition.py -t enroll -i "/tmp/person* ./mary" -m model.out

    Predict:
    ./speaker-recognition.py -t predict -i "./*.wav" -m model.out