Name		Name	Last commit message	Last commit date
parent directory ..
asr_adapters		asr_adapters
asr_cache_aware_streaming		asr_cache_aware_streaming
asr_chunked_inference		asr_chunked_inference
asr_ctc		asr_ctc
asr_hybrid_transducer_ctc		asr_hybrid_transducer_ctc
asr_transducer		asr_transducer
asr_vad		asr_vad
asr_with_tts		asr_with_tts
conf		conf
experimental		experimental
export/transducer		export/transducer
quantization		quantization
speech_classification		speech_classification
speech_multitask		speech_multitask
speech_pretraining		speech_pretraining
speech_translation		speech_translation
README.md		README.md
speech_to_text_eval.py		speech_to_text_eval.py
speech_to_text_finetune.py		speech_to_text_finetune.py
transcribe_speech.py		transcribe_speech.py
transcribe_speech_parallel.py		transcribe_speech_parallel.py

README.md

Automatic Speech Recognition

This directory contains example scripts to train ASR models using various methods such as Connectionist Temporal Classification loss, RNN Transducer Loss.

Speech pre-training via self supervised learning, voice activity detection and other sub-domains are also included as part of this domain's examples.

ASR Model inference execution overview

The inference scripts in this directory execute in the following order. When preparing your own inference scripts, please follow this order for correct inference.

graph TD
    A[Hydra Overrides + Config Dataclass] --> B{Config}
    B --> |Init| C[Model]
    B --> |Init| D[Trainer]
    C & D --> E[Set trainer]
    E --> |Optional| F[Change Transducer Decoding Strategy]
    F --> H[Load Manifest]
    E --> |Skip| H
    H --> I["model.transcribe(...)"]
    I --> J[Write output manifest]
    K[Ground Truth Manifest]
    J & K --> |Optional| L[Evaluate CER/WER]

During restoration of the model, you may pass the Trainer to the restore_from / from_pretrained call, or set it after the model has been initialized by using model.set_trainer(Trainer).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

asr

README.md

Automatic Speech Recognition

ASR Model inference execution overview

Files

asr

Directory actions

More options

Directory actions

More options

Latest commit

History

asr

Folders and files

parent directory

README.md

Automatic Speech Recognition

ASR Model inference execution overview