Skip to content
View longzh211's full-sized avatar

Block or report longzh211

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • cleanrl Public

    Forked from vwxyzjn/cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Python Other Updated Oct 27, 2023
  • SimTPR Public

    Forked from dojeon-ai/SimTPR

    Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)

    Python Updated Jun 13, 2023
  • Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to efficiently tune RL hyperparameters.

    Python Apache License 2.0 Updated May 31, 2023
  • Example models using DeepSpeed

    Python Apache License 2.0 Updated Apr 14, 2023
  • LA3P Public

    Forked from baturaysaglam/LA3P

    Actor Prioritized Experience Replay

    Python MIT License Updated Mar 25, 2023
  • seed_rl Public

    Forked from google-research/seed_rl

    SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

    Python Apache License 2.0 Updated Nov 29, 2022