Stars
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.
Zhehui-Huang / quad-swarm-rl
Forked from amolchanov86/gym_artAdditional environments compatible with OpenAI gym
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Materials for the Hugging Face Diffusion Models Course
Matplotlib styles for scientific plotting
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
This repository contains demos I made with the Transformers library by HuggingFace.
Some Conferences' accepted paper lists (including AI, ML, Robotic)
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
An extension of the PyMARL codebase that includes additional algorithms and environment support
Reinforcement Learning Environments for Omniverse Isaac Gym
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Repository of continual learning papers
Brain Agent for Large-Scale and Multi-Task Agent Learning
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Multi-Joint dynamics with Contact. A general purpose physics simulator.
⏰ AI conference deadline countdowns
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Training scripts, training data, and experimental data for Neural Fly
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
🌈谷粒-Chrome插件英雄榜, 为优秀的Chrome插件写一本中文说明书, 让Chrome插件英雄们造福人类~ ChromePluginHeroes, Write a Chinese manual for the excellent Chrome plugin, let the Chrome plugin heroes benefit the human~ 公众号「0加1」同步更新
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization