Skip to content
View ailijian's full-sized avatar

Block or report ailijian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram 等消息平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI Grok、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot…

Python 9,467 688 Updated Mar 12, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 20,071 1,628 Updated Mar 11, 2025

DeepSeek Coder: Let the Code Write Itself

Python 20,947 2,346 Updated May 21, 2024
Python 3,921 314 Updated Mar 12, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,821 1,171 Updated Mar 10, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 2,040 169 Updated Feb 10, 2025

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 331 32 Updated Feb 8, 2025

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 14,502 1,270 Updated Mar 12, 2025

Build multimodal language agents for fast prototype and production

Python 2,223 237 Updated Mar 11, 2025

fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。

JavaScript 10,336 1,935 Updated Feb 27, 2025

Make websites accessible for AI agents

Python 42,024 4,279 Updated Mar 11, 2025

一个超轻量级、可以在移动端实时运行的数字人模型

Python 1,649 239 Updated Mar 5, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,204 277 Updated Nov 5, 2024
C++ 4,530 669 Updated Mar 12, 2025

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,660 457 Updated Nov 27, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,510 2,428 Updated Feb 10, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,303 1,090 Updated Sep 14, 2024

Bring portraits to life!

Python 14,300 1,537 Updated Feb 28, 2025

Real Time High-Fidelity Faceswap

Python 287 72 Updated Aug 21, 2024

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,196 376 Updated Feb 27, 2025

AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。

TypeScript 1,954 288 Updated Mar 7, 2025

An open-sourced end-to-end VLM-based GUI Agent

Python 820 60 Updated Feb 19, 2025

RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF

Python 809 116 Updated Mar 10, 2025

Task-Aware Agent-driven Prompt Optimization Framework

Python 2,965 244 Updated Jan 10, 2025

Get your documents ready for gen AI

Python 23,798 1,380 Updated Mar 11, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 23 4 Updated Dec 11, 2024

A lightweight next-gen data explorer - Postgres, MySQL, SQLite, MongoDB, Redis, MariaDB, Elastic Search, and Clickhouse with Chat interface

TypeScript 2,934 93 Updated Mar 10, 2025
Next