Stars
Laia: A deep learning toolkit for HTR based on Torch
aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
Self-labelling via simultaneous clustering and representation learning. (ICLR 2020)
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
m-wiesner / wikipron
Forked from CUNY-CL/wikipronMassively multilingual pronunciation mining
Awesome Contrastive Learning for CV & NLP
[ICLR 2020] Lite Transformer with Long-Short Range Attention
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Gecko - A Tool for Effective Annotation of Human Conversations
m-wiesner / epitran
Forked from dmort27/epitranA tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'
The RWTH extensible training framework for universal recurrent neural networks
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
openslr-org / openslr
Forked from danpovey/openslrRepository for the web pages and scripts associated with OpenSLR: the open speech and language repository
Unofficial Pytorch Implementation of WaveGrad2
[CVPR 2021] 3D CNNs with Adaptive Temporal Feature Resolutions https://arxiv.org/abs/2011.08652
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Detect Language API Python Client