Stars
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
A natural language interface for computers
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A generative world for general-purpose robotics & embodied AI learning.
Open-Sora: Democratizing Efficient Video Production for All
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Fast and memory-efficient exact attention
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
a state-of-the-art-level open visual language model | 多模态预训练模型
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Representation learning on large graphs using stochastic graph convolutions.
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Unified framework for robot learning built on NVIDIA Isaac Sim
A minimalist environment for decision-making in autonomous driving
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
A library for advanced large language model reasoning
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.