Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to efficiently tune RL hyperparameters.

Python 74 13 Updated Nov 27, 2023

carla-simulator / carla

Open-source simulator for autonomous driving research.

C++ 12,086 3,887 Updated Feb 28, 2025

facebookresearch / level-replay

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to le…

Python 85 16 Updated Jun 11, 2021

kaixin96 / mixreg

Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization

Shell 32 9 Updated Oct 22, 2020

xiwan / AWSTools

Centralized place holding any AWS tools

Jupyter Notebook 5 1 Updated Feb 10, 2025

allegro / allRank

allRank is a framework for training learning-to-rank neural models based on PyTorch.

Python 913 121 Updated Aug 6, 2024

tsmatz / minecraft-rl-pigchase-attention

Applying "Stabilizing Transformers for Reinforcement Learning" in Minecraft pig chase (Nov 2021)

Python 5 2 Updated Nov 8, 2023

awslabs / scale-out-computing-on-aws

Scale-Out Computing on AWS is a solution that helps customers deploy and operate a multiuser environment for computationally intensive workflows.

Python 124 58 Updated Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kaige Yang yang0110

Achievements

Achievements

Block or report yang0110

Stars

volcengine / verl

kvfrans / shortcut-models

Video-as-Agent / VideoAgent

yizhongw / self-instruct

OpenRLHF / OpenRLHF

rsshyam / GRPO

opendilab / awesome-RLHF

AntoineTheb / RNN-RL

YeWR / EfficientZero

tmoer / alphazero_singleplayer

seawee1 / efficientalphazero

NM512 / dreamerv3-torch

fonseca-carlos / fantasy_nfl

aravindsrinivas / curl_rainbow

michaelnny / alpha_zero

submit-paper / Doudizhu_plus

Vincentzyx / Douzero_Resnet

DamonDeng / ai-town-evolution

kvfrans / fre

robertoschiavone / flappy-bird-env

datamllab / awesome-game-ai

CharlesPikachu / AIGames

facebookresearch / how-to-autorl