Skip to content

Latest commit

 

History

History
 
 

wsj

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

This dir contains fully fledged recipes to build end-to-end ASR systems using the Wall Street Journal (WSJ) corpus.

You need to obtain the WSJ dataset from LDC to run this example. The LDC catalog numbers are LDC93S6B and LDC94S13B.

There are two recipes, demonstrating different types of CTC labels

run_ctc_phn.sh - phonemes as CTC labels
run_ctc_char.sh - characters (letters) as CTC labels