Skip to content
View zhanyon's full-sized avatar
  • Institute of Automation, Chinese Academy of Acience
  • Beijing

Block or report zhanyon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of MARL benchmarks based on TorchRL

Python 313 46 Updated Dec 20, 2024

Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24

Python 139 18 Updated Dec 3, 2024

Library with search algorithms for task and path planning for multi robot/agent systems

C++ 848 221 Updated Aug 10, 2023

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 390 20 Updated Dec 9, 2024

This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".

114 5 Updated Sep 3, 2024

离线部署大模型,构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。

Python 24 2 Updated Apr 23, 2024

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

74 7 Updated Mar 27, 2024

计算机视觉课程设计-基于Chinese-CLIP的图文检索系统

Python 52 3 Updated Jun 20, 2023

DIP & NLP期末大作业 — 课程设计

Jupyter Notebook 18 3 Updated Dec 11, 2022

Multi-Agent Reinforcement Learning with JAX

Python 465 89 Updated Dec 19, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,903 1,134 Updated May 23, 2024
Python 216 60 Updated Feb 22, 2023

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 2,065 194 Updated Dec 17, 2024

Dream to Control: Learning Behaviors by Latent Imagination

Python 518 110 Updated Sep 10, 2021

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python 232 24 Updated Aug 23, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

945 87 Updated May 23, 2024

Example models using DeepSpeed

Python 6,167 1,053 Updated Dec 14, 2024

Curated tutorials and resources for Large Language Models, AI Painting, and more.

3,947 267 Updated Mar 31, 2024

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 679 137 Updated Dec 11, 2024

Some thoughts on prompts for Large Language Models.

Python 9 Updated Jun 17, 2023

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 33,573 7,659 Updated Jul 23, 2024

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 963 157 Updated Nov 28, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,112 485 Updated Mar 22, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,088 374 Updated Dec 17, 2024

骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

Jupyter Notebook 711 85 Updated May 30, 2023

low-cost, high-efficiency, easy-to-implement

Python 647 353 Updated Dec 10, 2023

Live posts of FeedBot on Libera.Chat

38 11 Updated Dec 20, 2024

Simple and easily configurable 3D FPS-game-like environments for reinforcement learning

Python 712 132 Updated Dec 19, 2024

Collection of OpenAI parametrized action-space environments.

Python 61 10 Updated Feb 22, 2023

Reinforcement Learning Algorithms Based on PyTorch

Python 447 93 Updated Oct 21, 2021
Next