Stars
Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)
A course on aligning smol models.
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Foundational Model for Speech Recognition Tasks
Simple image captioning model
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
State-of-the-Art Text Embeddings
microservice for image_text retrieval
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Effective LLM Alignment Toolkit
shigabeev / vits2_pytorch_bigvgan
Forked from p0p4k/vits2_pytorchunofficial vits2-TTS implementation in pytorch
Простой расстановщик ударений с обработкой омографов
unofficial vits2-TTS implementation in pytorch
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
AI-generated text boundary detection with RoFT
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
Chatbot Ollama is an open source chat UI for Ollama.
Large Language Model Text Generation Inference
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…
The training program for libfacedetection for face detection and 5-landmark detection.