Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
speech_to_text_rnnt.py		speech_to_text_rnnt.py
speech_to_text_rnnt_bpe.py		speech_to_text_rnnt_bpe.py

README.md

ASR with Transducer Models

This directory contains example scripts to train ASR models using Transducer Loss (often termed RNNT Loss).

Currently supported models are -

Character based RNNT model
Subword based RNNT model

Model execution overview

The training scripts in this directory execute in the following order. When preparing your own training-from-scratch / fine-tuning scripts, please follow this order for correct training/inference.

graph TD
    A[Hydra Overrides + Yaml Config] --> B{Config}
    B --> |Init| C[Trainer]
    C --> D[ExpManager]
    B --> D[ExpManager]
    C --> E[Model]
    B --> |Init| E[Model]
    E --> |Constructor| F1(Change Vocabulary)
    F1 --> F2(Setup Adapters if available)
    F2 --> G(Setup Train + Validation + Test Data loaders)
    G --> H1(Setup Optimization)
    H1 --> H2(Change Transducer Decoding Strategy)
    H2 --> I[Maybe init from pretrained]
    I --> J["trainer.fit(model)"]

During restoration of the model, you may pass the Trainer to the restore_from / from_pretrained call, or set it after the model has been initialized by using model.set_trainer(Trainer).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr_transducer

asr_transducer

README.md

ASR with Transducer Models

Model execution overview

Files

asr_transducer

Directory actions

More options

Directory actions

More options

Latest commit

History

asr_transducer

Folders and files

parent directory

README.md

ASR with Transducer Models

Model execution overview