Skip to content
View zhanyon's full-sized avatar
  • Institute of Automation, Chinese Academy of Acience
  • Beijing

Block or report zhanyon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of MARL benchmarks based on TorchRL

Python 315 46 Updated Dec 20, 2024

Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24

Python 143 19 Updated Dec 3, 2024

Library with search algorithms for task and path planning for multi robot/agent systems

C++ 851 221 Updated Aug 10, 2023

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 394 21 Updated Dec 9, 2024

This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".

115 5 Updated Sep 3, 2024

离线部署大模型,构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。

Python 24 2 Updated Apr 23, 2024

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

75 7 Updated Mar 27, 2024

计算机视觉课程设计-基于Chinese-CLIP的图文检索系统

Python 53 3 Updated Jun 20, 2023

DIP & NLP期末大作业 — 课程设计

Jupyter Notebook 18 3 Updated Dec 11, 2022

Multi-Agent Reinforcement Learning with JAX

Python 467 90 Updated Dec 19, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,950 1,136 Updated May 23, 2024
Python 217 60 Updated Feb 22, 2023

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 2,080 194 Updated Dec 17, 2024

Dream to Control: Learning Behaviors by Latent Imagination

Python 519 110 Updated Sep 10, 2021

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python 232 25 Updated Aug 23, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

948 87 Updated May 23, 2024

Example models using DeepSpeed

Python 6,179 1,057 Updated Dec 24, 2024

Curated tutorials and resources for Large Language Models, AI Painting, and more.

3,969 267 Updated Mar 31, 2024

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 680 137 Updated Dec 29, 2024

Some thoughts on prompts for Large Language Models.

Python 9 Updated Jun 17, 2023

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 33,622 7,662 Updated Dec 29, 2024

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 970 158 Updated Nov 28, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,130 486 Updated Mar 22, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,117 378 Updated Dec 17, 2024

骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

Jupyter Notebook 711 85 Updated May 30, 2023

low-cost, high-efficiency, easy-to-implement

Python 651 353 Updated Dec 10, 2023

Live posts of FeedBot on Libera.Chat

38 11 Updated Dec 30, 2024

Simple and easily configurable 3D FPS-game-like environments for reinforcement learning

Python 714 132 Updated Dec 19, 2024

Collection of OpenAI parametrized action-space environments.

Python 62 10 Updated Feb 22, 2023

Reinforcement Learning Algorithms Based on PyTorch

Python 447 93 Updated Oct 21, 2021
Next