Lists (1)
Sort Name ascending (A-Z)
Stars
Wan: Open and Advanced Large-Scale Video Generative Models
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Slick, declarative command line video editing & API
Concats a list of videos together using ffmpeg with sexy OpenGL transitions.
Janus-Series: Unified Multimodal Understanding and Generation Models
Inference and training library for high-quality TTS models.
Awesome-LLM: a curated list of Large Language Model
A high-throughput and memory-efficient inference and serving engine for LLMs
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
A pipeline parallel training script for diffusion models.
FastVideo is a lightweight framework for accelerating large video diffusion models.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
PhoGPT: Generative Pre-training for Vietnamese (2023)
TensorFlow code and pre-trained models for BERT
Faster Whisper transcription with CTranslate2
Underthesea - Vietnamese NLP Toolkit
Fast inference engine for Transformer models
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)