Stars
5
results
for sponsorable starred repositories
Clear filter
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…
Implementation of Google's USM speech model in Pytorch
Build high-performance AI models with modular building blocks