Lists (1)
Sort Name ascending (A-Z)
Stars
😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram 等消息平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI Grok、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot…
A simple screen parsing tool towards pure vision based GUI agent
DeepSeek Coder: Let the Code Write Itself
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Build multimodal language agents for fast prototype and production
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
Make websites accessible for AI agents
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF
Task-Aware Agent-driven Prompt Optimization Framework
RUC-NLPIR / FlashRAG-Paddle
Forked from RUC-NLPIR/FlashRAG⚡FlashRAG: A Python Toolkit for Efficient RAG Research
A lightweight next-gen data explorer - Postgres, MySQL, SQLite, MongoDB, Redis, MariaDB, Elastic Search, and Clickhouse with Chat interface