Stars
Simple python script for downloading AVSpeech Dataset
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper
Production First and Production Ready End-to-End Speech Recognition Toolkit
自动记账!通过读取通知内容自动弹窗记账的app,自动通知记账必备