-
Institute of Automation, Chinese Academy of Acience
- Beijing
Stars
A collection of MARL benchmarks based on TorchRL
Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24
Library with search algorithms for task and path planning for multi robot/agent systems
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".
离线部署大模型,构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统
llama3 implementation one matrix multiplication at a time
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…
Dream to Control: Learning Behaviors by Latent Imagination
We perform functional grounding of LLMs' knowledge in BabyAI-Text
An index of algorithms for offline reinforcement learning (offline-rl)
Example models using DeepSpeed
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Some thoughts on prompts for Large Language Models.
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Using Low-rank adaptation to quickly fine-tune diffusion models.
中文nlp解决方案(大模型、数据、模型、训练、推理)
骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
low-cost, high-efficiency, easy-to-implement
Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
Collection of OpenAI parametrized action-space environments.
Reinforcement Learning Algorithms Based on PyTorch