kaldi/egs/mobvoi at master · tpoindex/kaldi

History

Name		Name	Last commit message	Last commit date
parent directory ..
v1		v1
README.txt		README.txt

README.txt

 The Mobvoi dataset is a ~67-hour corpus of wake word corpus
 in Chinese covering 523 speakers. It is currently not publicly available.
 The wake word is "Hi Xiaowen" (in Pinyin).
 Each speaker’s collection includes positive utterances and negative utterances
 recorded with different speaker-to-microphone distance and different
 signal-to-noise (SNR) ratio where noises are from typical home environments.
 The dataset is provided by Mobvoi. Inc.

 The recipe is in v1/

 The E2E LF-MMI recipe does not require any prior alignments for training
 LF-MMI, making the alignment more flexible during training. It can be optionally
 followed by a regular LF-MMI training to further improve the performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mobvoi

mobvoi

README.txt

Files

mobvoi

Directory actions

More options

Directory actions

More options

Latest commit

History

mobvoi

Folders and files

parent directory

README.txt