![awesome logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/awesome/awesome.png)
Starred repositories
Make websites accessible for AI agents
😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 🤖 | 适配 QQ / 微信(企业、个人微信)/ 飞书(feishu)/ 钉钉 / Discord 等消息平台 | 支持 OpenAI GPT、ChatGPT、DeepSeek、Dify、Claude、Gemini、Ollama、LM Studio、SiliconFlow、Qwen、Moonshot、ChatGLM 等 LL…
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
AAAI2025:Diff-Shadow: Global-guided Diffusion Model for Shadow Removal
🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!
A generative world for general-purpose robotics & embodied AI learning.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Anthropic's educational courses
Automate browser-based workflows with LLMs and Computer Vision
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
face recognition algorithms in pytorch framework, including arcface, cosface, sphereface and so on
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
FinQwen: 致力于构建一个开放、稳定、高质量的金融大模型项目,基于大模型搭建金融场景智能问答系统,利用开源开放来促进「AI+金融」。
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.