Stars
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
A generative world for general-purpose robotics & embodied AI learning.
🥥 Coco AI App - Search, Connect, Collaborate, Your Personal AI Search and Assistant, all in one space.
Deezer source separation library including pretrained models.
Video Graph Transformer for Video Question Answering (ECCV'22)
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
A Multi-modal Model Chinese Spell Checker Released on ACL2021.
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
Source code for the paper "C-LLM: Learn to Check Chinese Spelling Errors Character by Character"
Align Anything: Training All-modality Model with Feedback
Robust Speech Recognition via Large-Scale Weak Supervision
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)