zhanyon

Follow

zhan yuan zhanyon

Follow

0 followers · 3 following

Institute of Automation， Chinese Academy of Acience
Beijing

Stars

facebookresearch / BenchMARL

A collection of MARL benchmarks based on TorchRL

Python 315 46 Updated Dec 20, 2024

guosyjlu / DS-Agent

Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24

Python 143 19 Updated Dec 3, 2024

whoenig / libMultiRobotPlanning

Library with search algorithms for task and path planning for multi robot/agent systems

C++ 851 221 Updated Aug 10, 2023

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 394 21 Updated Dec 9, 2024

BAAI-Agents / GPA-LM

This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".

115 5 Updated Sep 3, 2024

junyuyang7 / ChatAgent_RAG

离线部署大模型，构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。

Python 24 2 Updated Apr 23, 2024

Guozheng-Ma / DA-in-visualRL

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

75 7 Updated Mar 27, 2024

sugarandgugu / Text2Image-Retrieval

计算机视觉课程设计-基于Chinese-CLIP的图文检索系统

Python 53 3 Updated Jun 20, 2023

BeatsLeo / ClipCap-Chinese

DIP & NLP期末大作业 — 课程设计

Jupyter Notebook 18 3 Updated Dec 11, 2022

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 467 90 Updated Dec 19, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,950 1,136 Updated May 23, 2024

songwenas12 / fjsp-drl

Python 217 60 Updated Feb 22, 2023

breezedeus / Pix2Text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 2,080 194 Updated Dec 17, 2024

danijar / dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Python 519 110 Updated Sep 10, 2021

flowersteam / Grounding_LLMs_with_online_RL

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python 232 25 Updated Aug 23, 2024

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

948 87 Updated May 23, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,179 1,057 Updated Dec 24, 2024

luban-agi / Awesome-AIGC-Tutorials

Curated tutorials and resources for Large Language Models, AI Painting, and more.

3,969 267 Updated Mar 31, 2024

Mq-b / Loser-HomeWork

卢瑟们的作业展示，答案讲解，以及一些C++知识

C++ 680 137 Updated Dec 29, 2024

ShenDezhou / Open-Prompt-Research

Some thoughts on prompts for Large Language Models.

Python 9 Updated Jun 17, 2023

PKUanonym / REKCARC-TSC-UHT

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 33,622 7,662 Updated Dec 29, 2024

Replicable-MARL / MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 970 158 Updated Nov 28, 2024

cloneofsimo / lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,130 486 Updated Mar 22, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,117 378 Updated Dec 17, 2024

LC1332 / Chinese-alpaca-lora

骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

Jupyter Notebook 711 85 Updated May 30, 2023

Melelery / c-binance-futures-quant

low-cost, high-efficiency, easy-to-implement

Python 651 353 Updated Dec 10, 2023

feedarchive / libera-feedbot-live

Live posts of FeedBot on Libera.Chat

38 11 Updated Dec 30, 2024

Farama-Foundation / Miniworld

Simple and easily configurable 3D FPS-game-like environments for reinforcement learning

Python 714 132 Updated Dec 19, 2024

thomashirtz / gym-hybrid

Collection of OpenAI parametrized action-space environments.

Python 62 10 Updated Feb 22, 2023

StepNeverStop / RLs

Reinforcement Learning Algorithms Based on PyTorch

Python 447 93 Updated Oct 21, 2021