This is a research project, not an official NVIDIA product.
- Sequence to sequence learning
- Different cell types: LSTM, GRU, GLSTM, SLSTM
- Encoders: RNN-based, unidirectional, bi-directional, GNMT-like
- Attention mechanisms: Bahdanau, Luong, GNMT-like
- Beam search for inference
- Single box data parallel multi-gpu training
- Distributed (data-parallel) multi-node, mult-gpu training using Horovod
- LARS norm scaling algorithm