GitHub - DannyNeo/speaker-recognition-py3: Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

About

This project is a simple python3 version of speaker-recognition and I make a little change for the convenience of command line usage.

difference with speaker-recognition of python2

Neither use MFCC implementation of bob nor implement that myself. Use the python_speech_features instead.
Remove the GUI and you can only use the command line to train and predict the model.
Replace the function and class in sklearn which will be removed in the later version.
Use softmax function to output the probability.
convert to mono if the origin audio if stereo.

Usage

usage: speaker-recognition.py [-h] -t TASK -i INPUT -m MODEL

Speaker Recognition Command Line Tool

optional arguments:
  -h, --help            show this help message and exit
  -t TASK, --task TASK  Task to do. Either "enroll" or "predict"
  -i INPUT, --input INPUT
                        Input Files(to predict) or Directories(to enroll)
  -m MODEL, --model MODEL
                        Model file to save(in enroll) or use(in predict)

Wav files in each input directory will be labeled as the basename of the directory.
Note that wildcard inputs should be *quoted*, and they will be sent to glob module.

Examples:
    Train:
    ./speaker-recognition.py -t enroll -i "/tmp/person* ./mary" -m model.out

    Predict:
    ./speaker-recognition.py -t predict -i "./*.wav" -m model.out

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
features.py		features.py
interface.py		interface.py
requirements.txt		requirements.txt
skgmm.py		skgmm.py
speaker-recognition.py		speaker-recognition.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

difference with speaker-recognition of python2

Usage

About

Releases

Packages

Languages

License

DannyNeo/speaker-recognition-py3

Folders and files

Latest commit

History

Repository files navigation

About

difference with speaker-recognition of python2

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages