Skip to content
View deepminder's full-sized avatar

Block or report deepminder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,610 579 Updated Jan 11, 2025

Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step

Python 83 4 Updated Jan 11, 2025

This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.

Python 592 58 Updated Dec 19, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 29,351 2,769 Updated Jan 23, 2025

💬 Ready-to-use, flexible RAG Chatbot. 基于大模型和 RAG 的知识库问答系统。

Python 12,757 1,669 Updated Jan 23, 2025

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,237 282 Updated May 4, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,564 928 Updated Aug 21, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 66,672 34,556 Updated Jan 15, 2025

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Python 7,897 822 Updated Dec 7, 2024

The first real AI developer

Python 32,239 3,271 Updated Oct 3, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,343 631 Updated Jan 20, 2025

[AI Agent Application Development Framework] - 🚀 Build AI agent native application in very few code 💬 Easy to interact with AI agent in code using structure data and chained-calls syntax 🧩 Enhance …

Python 1,207 138 Updated Dec 13, 2024

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,850 416 Updated Dec 20, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 67,143 8,193 Updated Jan 9, 2025

Browse the web with GPT-4V and Vimium

Python 2,656 200 Updated Sep 25, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,664 355 Updated Jul 10, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,849 3,275 Updated Aug 12, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,789 491 Updated Nov 27, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,601 1,589 Updated Jan 13, 2025

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …

JavaScript 6,077 748 Updated Jul 17, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,293 426 Updated May 29, 2024

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 25,954 3,284 Updated Dec 30, 2024

📷 EasyPhoto | Your Smart AI Photo Generator.

Python 5,047 402 Updated Jul 10, 2024

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,237 866 Updated Dec 10, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,676 5,449 Updated Dec 18, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,839 3,460 Updated May 18, 2024

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

Python 11,601 1,272 Updated Jan 16, 2025

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 20,977 4,568 Updated Dec 27, 2024

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 19,895 5,229 Updated Jan 23, 2025
Next