Skip to content

Latest commit

 

History

History
48 lines (47 loc) · 11.5 KB

README.md

File metadata and controls

48 lines (47 loc) · 11.5 KB

Overview of example information

Directory name Corpus name Task Language URL Note
aishell AISHELL-ASR0009-OS1 Open Source Mandarin Speech Corpus ASR ZH http://www.aishelltech.com/kysjcp
ami The AMI Meeting Corpus ASR EN http://groups.inf.ed.ac.uk/ami/corpus/
an4 CMU AN4 database ASR/TTS EN http://www.speech.cs.cmu.edu/databases/an4/
arctic CMU ARCTIC databases TTS EN http://www.festvox.org/cmu_arctic/
aurora4 Aurora-4 database ASR EN http://aurora.hsnr.de/aurora-4.html
babel IARPA Babel corups ASR ~20 Languages https://www.iarpa.gov/index.php/research-programs/babel
blizzard_2017 Blizzard Challenge 2017 TTS EN https://www.synsig.org/index.php/Blizzard_Challenge_2017
chime4 The 4th CHiME Speech Separation and Recognition Challenge ASR/Multichannel ASR EN http://spandh.dcs.shef.ac.uk/chime_challenge/chime2016/
chime5 The 5th CHiME Speech Separation and Recognition Challenge ASR EN http://spandh.dcs.shef.ac.uk/chime_challenge/
cmu_wilderness CMU Wilderness Multilingual Speech Dataset Multilingual ASR ~100 Languages https://github.com/festvox/datasets-CMU_Wilderness
commonvoice The Mozilla Common Voice ASR 13 Languages https://voice.mozilla.org/datasets
csj Corpus of Spontaneous Japanese ASR JP https://pj.ninjal.ac.jp/corpus_center/csj/en/
csmsc Chinese Standard Mandarin Speech Copus TTS ZH https://www.data-baker.com/open_source.html
dirha_wsj Distant-speech Interaction for Robust Home Applications Multi-Array ASR EN https://dirha.fbk.eu/, https://github.com/SHINE-FBK/DIRHA_English_wsj
fisher_callhome_spanish Fisher and CALLHOME Spanish--English Speech Translation ASR/Machine Translation/Speech Translation ES->EN https://catalog.ldc.upenn.edu/LDC2014T23
fisher_swbd Fisher English Training Speech, Switchboard-1 Release 2 ASR EN https://catalog.ldc.upenn.edu/LDC2004S13, https://catalog.ldc.upenn.edu/LDC2005S13, https://catalog.ldc.upenn.edu/LDC97S62
hkust HKUST Mandarin Telephone Speech ASR ZH https://catalog.ldc.upenn.edu/LDC2005S15, https://catalog.ldc.upenn.edu/LDC2005T32
how2 How2: A Large-scale Dataset for Multimodal Language Understanding ASR/Machine Translation/Speech Translation EN->PT https://github.com/srvk/how2-dataset
hub4_spanish 1997 Spanish Broadcast News Speech (HUB4-NE) ASR ES https://catalog.ldc.upenn.edu/LDC98S74, https://catalog.ldc.upenn.edu/LDC98T29
iwslt18 International Workshop on Spoken Language Translation 2018 ASR/Machine Translation/Speech Translation EN->DE https://sites.google.com/site/iwsltevaluation2018/Lectures-task
jnas ASJ Japanese Newspaper Article Sentences Read Speech Corpus (JNAS) ASR/TTS JP http://research.nii.ac.jp/src/JNAS.html
jsalt18e2e Multilingual End-to-end ASR for Incomplete Data Benchmark Multilingual ASR ~20 Languages https://www.clsp.jhu.edu/workshops/18-workshop/multilingual-end-end-asr-incomplete-data/ babel+
jsut Japanese speech corpus of Saruwatari-lab., University of Tokyo ASR/TTS JP https://sites.google.com/site/shinnosuketakamichi/publication/jsut
jvs JVS (Japanese versatile speech) corpus TTS JP https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
li10 Lanugage-Independent ASR task (10 languages) Multilingual ASR ~10 Languages https://www.merl.com/publications/docs/TR2017-182.pdf csj+hkust+voxforge(7lang)+wsj
libri_trans Translation Augmented LibriSpeech Corpus ASR/Machine Translation/Speech Translation https://persyval-platform.univ-grenoble-alpes.fr/DS91/detaildataset
librispeech LibriSpeech ASR corpus ASR EN http://www.openslr.org/12
libritts LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech TTS EN http://www.openslr.org/60/
ljspeech The LJ Speech Dataset TTS EN https://keithito.com/LJ-Speech-Dataset/
m_ailabs The M-AILABS Speech Dataset TTS ~5 languages https://www.caito.de/2019/01/the-m-ailabs-speech-dataset/
must_c Must-C Multilingual Speech Translation Corpus ASR/Machine Translation/Speech Translation EN->{DE, ES, FR, IT, NL, PT, RO, RU} https://ict.fbk.eu/must-c/
reverb REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge ASR EN https://reverb2014.dereverberation.com/
ru_open_stt Russian Open Speech To Text (STT/ASR) Dataset ASR RU https://github.com/snakers4/open_stt
swbd The Switchboard corpus ASR EN https://catalog.ldc.upenn.edu/LDC97S62
tedlium2 TED-LIUM corpus release 2 ASR EN https://www.openslr.org/19/, http://www.lrec-conf.org/proceedings/lrec2014/pdf/1104_Paper.pdf
tedlium3 TED-LIUM corpus release 3 ASR EN http://www.openslr.org/51/, https://arxiv.org/pdf/1805.04699
timit TIMIT Acoustic-Phonetic Continuous Speech Corpus ASR EN https://catalog.ldc.upenn.edu/LDC93S1
tweb The World English Bible TTS EN https://www.kaggle.com/bryanpark/the-world-english-bible-speech-dataset
vais1000 VAIS-1000 TTS VI https://ieee-dataport.org/documents/vais-1000-vietnamese-speech-synthesis-corpus
vivos VIVOS (Vietnamese corpus for ASR) ASR VI https://ailab.hcmus.edu.vn/vivos/
voxforge VoxForge ASR 7 languages http://www.voxforge.org/
wsj CSR-I (WSJ0) Complete, CSR-II (WSJ1) Complete ASR EN https://catalog.ldc.upenn.edu/LDC93S6A,https://catalog.ldc.upenn.edu/LDC94S13A
wsj_mix MERL WSJ0-mix multi-speaker dataset Multispeaker ASR EN http://www.merl.com/demos/deep-clustering
yesno The "yesno" corpus ASR HE http://www.openslr.org/1