Lists (3)
Sort Name ascending (A-Z)
Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
LlamaIndex is a data framework for your LLM applications
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Instant voice cloning by MIT and MyShell.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
GUI for a Vocal Remover that uses Deep Neural Networks.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Blender addons to make the bridge between Blender and geographic data
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
WebRTC and ORTC implementation for Python using asyncio
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
AudioLDM: Generate speech, sound effects, music and beyond, with text.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
A lightweight framework for building LLM-based agents
AI powered speech denoising and enhancement
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Turn any face into a video game character, pixel art, claymation, 3D or toy
Open-source framework to review and patch code using your preferred LLM.