Starred repositories
Core Engine of Singing Voice Conversion & Singing Voice Clone
paulilioaica / mergekit-phi3
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
On-device AI across mobile, embedded and edge for PyTorch
Running Hugging Face Spaces on a local machine / colab T4 GPU involves several steps. Hugging Face Spaces is a platform to host machine learning demos and applications using Streamlit, Gradio, or o…
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
A general fine-tuning kit geared toward diffusion models.
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
ML-powered speech recognition directly in your browser
Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
henk717 / KoboldAI
Forked from KoboldAI/KoboldAI-ClientKoboldAI is generative AI software optimized for fictional use, but capable of much more!
Large Language Model Text Generation Inference
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Python bindings for the Transformer models implemented in C/C++ using GGML library.
Lord of Large Language and Multi modal Systems Web User Interface
Port of OpenAI's Whisper model in C/C++
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free